Watch mfgs do proper calibration in production, probably 2-point i.e. at 2 temps. You have to do that if using a cheap 32768Hz xtal.
Swiss "chrono" watches need to achieve < 7 secs per day, which is what you get on a 10,000 quid IWC etc - well from one 600 quid service to the next a year later

Or you could assume it will be worn, so +35C or so

and then cal it at just that one point. But if it draws as much power as this one will, it likely won't be worn during the night (that's the case for most smartwatches) and then you have a problem...
There are drift offsetting methods used in the precision analog chip business, which let the device warm up and drift due to that and you measure the drift and extrapolate the curve from that. I was doing this in the 1970s on precision HV power supply feedback resistors. It is much quicker than heating up the product during production to get the 2nd cal point.
If there is not enough self-heating then you can incorporate a resistor next to the xtal which is connected to test probes for the heating.
However in this case I suspect the purpose of this exercise is to develop an application note for some software libraries, so it doesn't need to be super polished. One would be hard pushed to compete with the 800 quid Garmin product, which probably runs for a week or so between charges. And which doesn't do a whole lot anyway unless there is a BT-connected smartphone.