Cellular towers communicate with each other so can seamlessly handoff your phone to the next tower when the time is right, or even block your phone entirely if it is seen by hundreds of towers at once (if you leave it on in an airplane).
Wifi is not like that, and clients often will refuse to move to an AP with a stronger signal until the signal to the one it's connected to drops entirely. 802.11r fast transition reduces the time to connect to a different AP to reduce the interruption (it's still not seamless) by allowing the key to be cached, which is of course what's targeted by KRACK.
Theoretically, if the APs/nodes communicate with each other they could decide only the one with the strongest signal should connect and all the others should kick the device off (similar to how the band-steering feature works, where 5GHz-capable clients are identified and get periodically kicked from 2.4GHz), but I'm not aware of any MESH setup that actually does this. Instead MESH is usually just sold as an expensive but convenient way to administer all of the APs at once using one GUI screen, and with a wireless backhaul to avoid having to run cable to APs.
It's usually enough to just adjust the roaming sensitivity of your clients so they will switch easier, and in the APs to reduce the Roaming Assistant setting until they occasionally kick a client with lower signal, so that client will then reconnect to whatever is strongest then. Yes, this will still incur a slight delay when switching, long-distance low-signal connections will be repeatedly disconnected, and switching necessarily won't be that frequent because in order to force the client to do so, an AP has no option other than to kick the client off to force a renegotiation.
As for overlapping channels, we are used to thinking in terms of 20MHz-wide blocks (so will manually set 2.4GHz to the non-overlapping ch1, 6 and 11) but OFDMA in Wifi 6 and above can automatically divide things even more to 10MHz or even 5MHz wide "resource units" as needed, perfect for many low-bandwidth but latency-sensitive streams. Selecting different channels than your neighbors helps minimize co-channel interference, but they may not be aware each 2.4GHz channel is only 5MHz wide so for example choosing a 20MHz-wide block centered on ch4 would interfere with channels 2, 3, 5 and 6, leaving only a 20MHz block centered on ch9-11 as non-overlapping for their neighbors
With 5GHz people usually choose either 80MHz-wide ch36-48 or ch149-161, or even 160MHz-wide using both non-contiguously (80+80 mode) just to avoid the long connection delay inherent in using DFS channels.
Historically 802.11b was spread-spectrum which helped to reduce the effects of interference, since packets were widely spread over different frequencies, leaving empty space over the air (so with luck interference could happen to just fit in the unused parts). 802.11a on 5GHz and 802.11g on 2.4GHz improved bandwidth by basically using every bit of their 20MHz-wide single pipes with no empty space, and this meant of course that any interference anywhere in that pipe would result in corrupted packets and a retransmit. It's been a solid pipe ever since, but at least 320MHz and 160MHz can either fall back to unbonded, or even use OFDMA to fall back to even 5MHz-wide if needed due to interference.