Yes, it is possible.
As far as I know:
A transmission gate is used. Also known as an analog switch.
These are available as board-level components with quite reasonable dynamic specs (~ohms on-resistance, 10s pF off-capacitance, bandwidth ~1GHz). Though I don't know that you can pull off a S&H with as much bandwidth, because of the particulars of propagation (I suspect they have some multi-stage transmission-line-like magic which is able to deliver higher on-state bandwidth than Zo*Cp alone implies), let-through, and most importantly of all, commutation (how much time is required for the switch to transition from closed to open: the sample aperture).
Note that such performance levels require very small transistors, and so the supply voltages are low, and the signal voltages must be within those supplies. (I forget what the best performance is; there might be a near-1GHz switch with +/-15V range, which is very impressive indeed.)
This is all tightly integrated on chip, to charge the sampling capacitor, which is just as it sounds, a small capacitor (some pF) paired with the relevant switches and sense amps which make the ADC an ADC.
From there, it depends on the architecture, but faster ADCs (starting in the low MS/s) are mostly pipelined SAR type, which pair the sample cap with another cap of exactly equal value, that's been precharged to VREF or 0; then its voltage is amplified by a factor of 2, and so on until the desired bits have been obtained. The secret sauce that constitutes the precious IP of an ADC, includes timing tweaks, gain compensation and so on, so the result is obtained with minimal nonlinearities, no missing codes, and etc.
With everything being neatly on chip, parasitics can be controlled tightly; a capacitor might not be much more than a few transistors and their drain capacitance, or even the capacitance under a transistor's channel*. You can do some of this with board-level components, but it's impossible to achieve nearly as good performance (between all of: size, accuracy/ENOB, sample rate, power consumption).
*IIRC, this is the mechanism behind the famous analog "bucket brigade" device: as a transistor turns on and off, the channel capacitance itself is modulated, essentially squeezing out the charge under one transistor, into the next and so on. So the analogy to a real bucket brigade is amazingly apt. Because charge is conserved between stages, minimal amplification is required, even after many stages. Doing this with analog switches and fixed capacitors, you would need a buffer after every capacitor, or a gain of 2 after every other capacitor, etc., plus you'd incur the total distortion of all those stages.
(No, this doesn't add much to the present conversation, more just rephrasing or hinting at things that have already been mentioned.)
Tim