The results of your experiment are according to theory (simplified)!
By applying the same power they:
- increased their temperature in the same rate, because they have about the same mass and (as they seem, probably due to the similar materials...) the same thermal capacity.
- reached the same final temperature, because they have about the same external surface area, they are colored similarly and in the same environment, so they dissipated the same amount of energy.
But this test is not relevant with what you wonder!
The power rating of a device (resistor, transistor, transformer ...) it is an indication of its' relative durability when it operates in some ranges, defined by its' specifications: voltage, current, temperature...
So it is needed to study the data-sheets to decide which model is capable for your application, before the purchase. By the way, have in mind that a resistor, except of the resistance and power rating, has also other characteristics, like insulation and inductance.
About testing the resistors:
You can evaluate some characteristics of them, like measuring the resistance and variation of it, the inductance, the insulation, the recovering after a temperature increase or decrease ...
The maximum power of the device can only be a destructive test with prerequisites as already described by others earlier.