That's not how decoupling capacitors work. They only actually useful if they are close to the power pin. You need a separate capacitor for each IC. On more complicated ICs with multiple power pins, you need a separate capacitor per pin.
The circuits are not equivalent, and the second one is wrong.
Only "wrong" in the dogmatic sense that it violates someone's arbitrary condition.
It's only "wrong" in the true practical sense, if it causes the ICs to malfunction, however that might be. Examples might include: excessive supply or output ripple/noise, digital jitter, DC offsets in op-amps, etc.
In practice, the bypass caps can be many decimeters away from the device in question. Or maybe they have to be right up against the device, and multiple are required. Or furthermore, multiple are required, and they must be arranged around a supply plane with lossy capacitors as well,
How to avoid the dogma and approach it analytically?
First, determine how much ripple the device should experience. This will probably be less than 10% of total supply voltage, but beyond that, it depends. A sensitive analog circuit will need its supply quiet enough, so that, supply noise * PSRR does not affect the output noise floor. Use PSRR (which is a function of frequency!) in your calculations, where possible. Digital gates should be considered as 50% PSRR, i.e., CMOS input threshold is at about half supply. An analog or digital circuit with very good PSRR (like emitter followers, and ECL) might be perfectly fine with relatively noisy supplies, even for relatively critical (sensitive or low-jitter) applications. You won't know until you calculate the requirement!
Second, determine how much transient current the device will draw. This also varies wildly with type. An op-amp will be fairly quiet, except for load current being passed directly to the supplies (for obvious reasons). Therefore, an op-amp with a heavy load will generate more transient supply current than a lightly loaded one. By transient, I mean that only
changes in current matter: the DC (quiescent / operating) current need not be a factor. An opamp might draw, say, 3mA operating current, but source/sink 30mA into the load; an ECL gate might draw 10mA, but as long as the outputs are loaded equally, that current draw hardly changes at all; a CMOS gate might draw 100uA average, but gulp pulses of 2 or 20 or even 100mA ("shoot-through current") when the outputs switch state!
Third, put these two factors together to find the maximum supply impedance, ΔV/ΔI. Assume that this impedance must be met at all frequencies of interest to the device. An LM741 might only be critical up to 10MHz (and basically ignorant of anything above there). A CMOS gate might be critical at more than 200MHz.
Incidentally, it is simply physically impossible for a PCB to deliver stable power to heavy loads, at frequencies much above 100MHz -- as a result, big, low voltage devices, like MCUs, FPGAs and SoCs, have on-board bypass caps inside their package, even on die, to supply their GHz-range current demands! At frequencies this high, the length of the device's own pins prevents any degree of PCB-level bypassing from helping up there. Consequently, don't beat yourself up if you can't get a bypass cap on every single stupid pin!
Fourth and finally, synthesize a PDN (power distribution network) that meets the impedance requirement.
What is a PDN? It's a chain of series inductors (trace inductance) and parallel capacitors (bypasses, planes) connecting power to devices. In other words, an ordinary dumb LC filter. What do you do with a filter? You terminate it, at its characteristic impedance, and it behaves.
To make a good filter, place bypass capacitors at more-or-less regular intervals (distances) from each other. Prefer a linear (point-to-point-to-point, single chain) topology with termination at one or both ends, or use a branched topology (some points connect to multiple other points) with termination at the ends of the stubs.
This isn't hard. Computers can do this analysis for you. (Given some setup, of course. Computers aren't THAT smart yet.) Why do people prefer dogma? Because people are lazy, and better at pattern recognition ("these caps don't
look right"!) than analysis.
With some practice, you can run through this list in ten minutes. It's not a big burden. It can be greatly accelerated with assumptions and shortcuts -- keeping in mind the limitations of those assumptions, and being careful not to fall back into a dogmatic approach yourself.
Tim