You could consider something like drawn in the attachment (sorry poor quality).
Firstly, there is one buffer driving the other buffers so that the 32kHz clock signal driving the buffers will be loaded as little as possible.
Then, you should notice that each of the shared clock nets contain a buffer, thus the nets are essentially isolated from each other. For example, if you place a scope to one of the clock nets, the others will not be affected and the signal quality driving each net will be as good as possible. Also, if there are any reflections in the PCB traces, the reflections are isolated and do not interfere each other. Otherwise you can experience multiple clock edges which will ruin your day. So, you should consider isolation technique as a good thing.
Next you should notice that each output buffer contains a 50 ohm series resistor. This resistor will provide series termination and impedance matching for driving the nominal 50 ohm PCB traces. I would use the series resistors even if the PCB track is short that it necessarily require termination - just in case there is something that needs to be tweaked or measured.
There is an optional receiver buffer (the dashed blob in the middle driving the device A). In your case this is not necessary, but it it drawn there to indicate where the receiver buffer should reside: At the far end of the PCB trace, that is.
Finally, then there are two resistor networks at the far end of the PCB trace next to the devices A and B which can be used to provide fancy biasing and amplitude scaling, unless you are willing to use proper receiver buffer with proper level translation. The resistor tweaks are typically usable up to few megahertz. At higher frequencies one should seriously consider using proper level translators and/or buffers. The impedance of the resistor network should be at least 1kohm ohms in order not to affect the signal amplitude too much as the signal is driven through a 50 ohm resistor.
The scheme presented here scales well up to few megahertz or more. For higher frequencies and long PCB traces one should consider using differential drivers for the PCB clock networks for better signal quality and less EMI interference.
Btw, if I understood the STM32 datasheet correctly the STM32 clock inputs can be driven by an external logic-level clock signal.