Severe CPU Load?

thebearmay · January 6, 2022, 11:47pm

That sounds about right…

JoshBI555 · January 6, 2022, 11:49pm

So should I expect to see the system free that up after some time?

thebearmay · January 6, 2022, 11:57pm

My production hub usually drops to around 415000 a short time after reboot and then slowly drops over time, dev hub drops to about 520000 after settling in from a reboot and then very very slowly decreases over time. When I hit about 285000 I schedule a reboot.

sburke781 · January 7, 2022, 12:10am

Not sure if it's of any use, but in terms of memory, I noticed in the last few days a difference in the trend of the memory across my 4 HE hubs, 2 have a gentle decrease over time (although this morning one did drop at one point), and the other two bounce around. I do intend to try and understand what is causing the difference in behaviour, mostly because it may be some code I have written.

thebearmay · January 7, 2022, 12:40am

When I have a big drop in memory it is usually because I’ve updated a lot of code and forced several recompiles, or I have a run away task because I made an error in some code that I’m playing with.

sburke781 · January 7, 2022, 12:44am

Thanks. I was probably more interested in why they trend differently, but I will also look into that as well.

thebearmay · January 7, 2022, 12:51am

Trending differences can have a lot of reasons. Can’t prove it, but my suspicion is that the # of http calls, particularly long running ones or ones with dropped connections, has a direct impact. The recompiles I already mentioned, but I’m sure there are other leaks too.

sburke781 · January 7, 2022, 12:59am

Yep, that's the theory I'm working on as well. I've done a few drivers over my time that do make HTTP calls and have only started to transition them to being asynchronous in the last week or so. So I will start by getting my own house in order, then see where I'm at.

JoshBI555 · January 7, 2022, 1:02am

What is everyone using for auto rebooting?

thebearmay · January 7, 2022, 2:00am

I don’t auto-reboot but if I did:

tycalmc · January 7, 2022, 2:18am

I had this same issue and went back to the built in hubitat ecobee app and I have never had a severe CPU again.

672southmain · January 7, 2022, 2:33am

thebearmay · January 7, 2022, 2:37am

…and you got my good side too.

JoshBI555 · January 8, 2022, 3:59pm

Update.

11AM yesterday - 2022-01-07 10:59:16,441000,0.12

11AM today- 2022-01-08 10:54:15,422848,0.01

Continuing to monitor. Hub up time is about 2.5 days now.

thebearmay · January 8, 2022, 4:04pm

Sounds about right for a production hub.

JoshBI555 · January 9, 2022, 11:50pm

Alright. So I ran al little over three days and experienced the same processor jump to 1.0% sustained with free memory 412516. I have disabled some community drivers, switched over to built in ones, and I will see how that goes.

JoshBI555 · January 10, 2022, 2:46pm

Well…

This morning hub completely frozen and hot to the touch. First total freeze up in a few weeks. I have opened a ticket with support to start getting on their radar officially.

thebearmay · January 10, 2022, 3:34pm

Sounds like a process got into a tight processing loop, became a runaway, and the resultant CPU activity overheated hub, and caused a shut down. Would be interesting to hear what the engineering logs show. @gopher.ny

JoshBI555 · January 15, 2022, 8:46pm

Just wanted to keep updating here. I have written to support and given them information. I hope they can look into my logs a little deeper and determine what might have caused these issues.

I will say, since my last lockup the morning of 1/10, I have had over five days of runtime with no CPU spikes and no Lockups. This is by far the longest I have gone in a very long time. I did remove the Ecobee Driver, but I did also notice a Zigbee device sending tons of temperature change reporting through the day. I disabled that reporting as well.

I remain optimistic, that one of these two might have been the issue. Especially because I saw a ton of reporting from the Zigbee device just prior to the lockup. I eventually plan to try a clean install of the Ecobee Suite to see if the problem may have been the sensor all along. The stock driver does not come close to having the features I want, but I am getting by in the meantime.