Here's a bizarre problem I've been fighting with over the past three weeks or so.
One of my Hubitat C4's keeps locking up (unresponsive to ping, it has a DHCP reservation). I already carried out a soft-reset and restore, it seems to last another 2 or 3 days then does it again. Nothing spurious in the logs, temperatures appear normal, memory use normal. Power LED still on and network LEDs still showing signs of activity. This is a ZigBee only hub, ZigBee is "dead" at this point because I have some sockets that start flashing a panic orange LED when they lose ZigBee ....
I need to get round to migrating this C4 over to a C5 or C7 .... I acknowledge that but have been putting it off for various reasons (most mentioned elsewhere in the forum). The C4 crashing isn't the part I'm flummoxed by.
So here's the bizarre part .... after the C4 goes offline (I use Nagios monitoring, it's pretty fast, usually pings me within 30 seconds or so of a host or service problem) various other unrelated devices "die".
Without fail, this happens every time within two or three minutes of the C4 going down.
6 x Foscam cameras - these are all hard-wired, on various different UPS, spread over 3 different network switches. It's not a network problem. These all drop simultaneously and I notice it immediately as they're on my desktop PC in a live view matrix. All 6 disappear at the same time, then shortly after the Nagios alerts come in. When looking physically at the cameras the power LEDs are "offf". Some are PTZ doing patrols and they stop moving. There is no life. Simply pulling the power barrel out and reinserting forcing them to power cycle brings them back online.
Hubitat C7 - has about a dozen Z-Wave devices on it, no ZigBee. Negligible memory and temperatures. Nagios alerts me it's offline and it is. Non responsive to ping. Static IP, still showing power LED and still showing network activity. On an entirely different network switch and UPS to the C4 that goes down first. Have to power cycle to bring it back.
HD HomeRun - a DVB-T streaming device. Nagios alerts that it's down. Again on a different network switch to the other equipment. Power cycle brings it back.
As I said, this is repeatable and has been doing it for a month or so.
I've no idea how this can be happening, for example as far as I'm aware the Foscam cameras don't have a "shutdown" command that could be used to turn them off.
I also have another C4 (about 70 ZigBee devices) and that one remains OK throughout.
So far it's just the devices mentioned above that go down.
Appreciate any thoughts on why it might be happening!
My best guess is the C4 crashing must be sending some spurious "cr4p" over the network that's causing certain other devices to crash. But only some. I have over 150 IP connected devices mixed over WiFi and Ethernet.