In the 2.4.3.133 release im curious about the automatic hub reboot if CPU load remains at a critical level for 15 minutes or more Platform Change.
Is this a single reboot or triggered everytime the condition is met?
Is this disableable?
My concern is, is it possible to get stuck in a ~15min reboot loop if a device/app is misbehaving due to the cpu load trigger? ie Can 1 or more spammy or dying devices cause the cpu on the hub to spike and stay spiked and keep a hub in a reboot loop?
I'd put it as "extremely unlikely". There are other mechanisms to limit the load that a single app or device can create on the hub, although they're not ironclad.
I always do these sort of "process" links in a new tab; that way, the results aren't overriding the display of some thing else and I can just close that tab when done.
I would much rather be notified via my own rules and notifications (triggered from a Hub Info Device (V3)) about resource constraints (CPU, Memory, external PING check, etc.). Then I can check what's going on, and decide on a course of action - I have Wireguard VPN access to my LAN and a WiFi plugs on hubs, if I need to "hard boot" (To date, I never have had to do this while remote).
I just don't like the possibility of some repeated 15M automated reboot cycle, if a hub is in some runaway loop. I would much rather investigate myself and then decide what to do (hard reboot, trying to disable some device or App, etc.).
Fortunately, this has never happened to me (yet?), I've only once had Zwave Mesh repeatedly crash due to a bad Jasco device, regardless, I generally don't like "automagical" fixes, potentially running for days on end, that can mask and make problems worse (corrupted DB from hundreds of reboots?). -
I much prefer alerts, and then human troubleshooting and action. - And this goes for most "things" outside of the HE realm. IMHO, overly smart tools, without options to manually intercede will be the downfall of our species (not HE in this particular case - more like thinking of Boeing and MCAS)...
As always, YMMV, but I have it disabled on all my hubs.
FWIW, auto reboot is disabled for one hour after hub boots up...provides plenty of time to use the endpoint to turn off the reboot if you're getting into a (exceedingly rare) reboot loop from this option.