A few points. (Sorry if this sounds a bit hectoring - it is not meant to be.)
Firstly, scopes are used to look at digital signal such as pulses, the rise time of a 200MHz scope is 1.75 nsecs. You probably want to have at least one point on the slope and one on each end say so it would be nice to have a sample every 0.9nsecs but every 1nsec is probably ok but you don't want to go much below that. 2GS/s would give a sample every 0.5 nsec which gives you two or three on the slope.
Secondly, manufacturers tend to develop families of scopes so it makes sense to develop or use ADCs across the range which generally for these low end scopes goes up to around 500MHz for which at least 1GS/s is needed. The savings from using a different/slower ADC for the 100MHz model are probably not great compared to the savings of ordering/using a large number of the same part across the family. Often on the higher end models interleaving is used with the same speed ADCs e.g. 2GS/s is available if only one channel out of two is used by interleaving two 1GS/s ADCs. (I know that the 2000X only goes up to 200MHz but if you count the 3000X series in as well this can go up to 1GHz.)
Thirdly, scopes are used for measurements so adding additional filters just to enable lower speed ADCs would save little money whilst producing a scope that had a worse specification than rival products - analogue filters with a sharp and accurate frequency cut off without ripple in the pass band are not the easiest things to develop and mass produce and all for the purpose of a net saving that might be only a few pence.
Another, more minor point is that having more sample points enables you to avoid having to do interpolation. A screen worth of points is 500 typically (though some of the wide-screen scopes have more). At 2GS/s 500 points is 250 nsecs or 25 nsec/div. With a 200MHz scope you can accurately look at signals up to about a third of this as a rule of thumb which is around 70MHz for which the period is around 15 nsecs only so you'd probably want to set a time base of 2 nsec/div when even at 2GS/s you will only have 40 points in the wave form.
With a sharp filter and sinc(x) interpolation you may be able to fill in the points but it would be doing a lot of work (and expense) for what can more easily be done with a faster ADC and a simpler filter.