Hello,
I would do a allan deviation diagram.
There you see quickly what number of values you have to average to get "best stability"
So the lowest uncertainity which you can get.
Of course in a real world you will have to repeat the allan deviation several times since it variese with environment conditions.
But in your case you have a drift free measurement system so it should be very repeatable
Attached a example from my ADC#21.
The one is freshly after it has been built + calibrated so the drift is still large showing in rising standard deviation
to 0.5 uV for longer integration times due to drift.
(x-axis in minutes, y-axis in mV after 2:1 divider).
The other is after some months of operation showing that the best stability (standard deviation) of 0.25 uV in 5V range can be obtained by averaging around 200-300 measurement (1 minute) values.
(x-axis in raw measurements, y-axis again in mV after 2:1 divider).
with best regards
Andreas