Hi everyone,
I have a small question about a suspect relay.
First of all, the device the relay comes from is likely beyond economical repair. But I was tasked to take a look at the device and look for a possible failure reason.
The device in question is an about 4 years old APC Automatic Transfer Switch. This device has two AC Inputs, and can switch between then without interrupting the output. So it is used as fail-over PDU for devices that only have one power supply themselves.
Reported failure mode: Does not switch over anymore, or does not turn on anymore. More details are not known. Could not test myself, since my boss did not want to risk blowing a fuse. (Don't ask

)
After visual inspection there was nothing showing any visible defects. No bulged caps, no burned resistors, no PCB discoloration at all. Except for a bit of dust, looks like fresh out of the factory. Wiring looks high quality, with crimped and isolated spade terminals. The "smarts" of the device are delegated to two sub-modules, one connected with a flatcable, the other module interestingly plugged into an old style 72pin EDO-RAM socked

Measured the input MOVs, no issues. Two fuses I found are both intact. But those are not inline the power switching anyway.
The input relays are two HASCO HATF902ASAC230 (
http://www.hascorelays.com/electromechanical-relays/hat902-series-spdt-30-40-amp.html) for each channel. The coils of those two are directly connected to their respective mains input.
I desoldered one of them and measured it. Coil resistance is a bit above 10k (should be about 15k according to the datasheet). Measuring the NO contact, I got something about 3 Megaohms... so yeah

After I cracked the relay open, I noticed three things: a typical "burned electronics " smell, black marks on the contact area and a slight sticking of the contacts when I manually close and release them.
My conclusion: The power supplied to at least one of the inputs is unstable, having caused many switching cycles, therefore wearing out the relay. As a consequence, in addition to replacing the ATS (already happened), we should also look into the stability of the circuit itself, otherwise more equipment damage may (or will?) happen sooner or later.
Any other opinions?