Yes, I was strictly speaking about internal busses, and yes, with "wide" busses in mind. The question, and the "FPGA vs. ASIC" point were a hint, but after re-reading my OP, I admit it wasn't perfectly clear.
External interconnections are still a different beast, even though the same question could still be interesting. But in both cases, multiplexers are always more expensive in terms of area. (And yes again, this is a general question, as on most modern FPGAs, internal tristate buffers do not exist, so strictly for FPGAs, the question is more a question of code style, assuming the synthesis tool can infer the MUXes properly, which frankly, having witnessed a number of inferring bugs in many of these tools, I wouldn't even play with these days...)
And the idea of crossbar matrices sounds interesting. I wonder why FPGAs don't embed those? Maybe that would just be hard selecting a reasonable number of them and distributing them properly on die so they are effectively usable... whereas LUTs are general-purpose structures that are much easier to deal with.