I am running the v6 beta on a NanoPi Neo with DietPi.
This is just a bare beta installation, with nothing else.
LAN cable connection. Web access via HTTPS.
Everthing is up to date and all repos are on the v6-development branches.
Today once again (like already a day before) I could not reach the Web Interface due to a connection timeout at the first try.
The DietPi welcome message showed a system's CPU temperature above 70°C this time. (Ok, I am running the system without a heatsink, so the temperature can rise quite fast. In normal operation, however, the CPU always remains between 35 and 45°C.)
On the pihole diagnostics page a message is stating that the 15min avg load was higher then the number or processors.
But while observing the CPU load with htop the 15min avg is then fallen to below 0.3 within 20mins, even with a client connected to the dashboard.
What I noticed BTW:
FTL.log reported an SQLite3 error at 21:55:32.526.
The 'ERROR' wording in the taillog should be in red.
The long term overload warning appears multiple times in the log.
But on the Pi-hole diagnostics page the critical avg load value in the message reduced itself (with every warning occurrence).
It would make sense to state the highest value in the message instead of the last one - right?
Any idea what I could also check or look for at the next time?
What is causing the SQLite3 error?
You may find the atop package useful for this. It's like a recording version of htop. You can review historical loads after something has happened and see what the cause was.
I mentioned it in another post with some shortcut tips, and a link to a larger guide.
No, I didn't delete any log file myself.
But that seems to be normal due to DietPi's RAMlog configuration:
root@DietPi:/var/log/pihole# ls -l
total 0
-rw-r----- 1 pihole pihole 0 Mar 17 23:17 FTL.log
-rw-r----- 1 pihole pihole 0 Mar 18 10:17 pihole.log
-rw-r----- 1 pihole pihole 0 Mar 18 00:17 pihole.log.1
-rw-r----- 1 root pihole 0 Mar 18 00:17 pihole_debug.log
-rw-r----- 1 pihole pihole 0 Mar 17 22:17 pihole_updateGravity.log
-rw-r----- 1 pihole pihole 0 Mar 17 22:17 webserver.log
Looks like unfortunately the logs were cleared on the hour, right before I created the debug log.
Via the web interface the logs are even now visible.
I am not sure yet. I have not created or edited any cron jobs.
I assume that FTL checks the CPU loads every 5 minutes and therefore the warnings were displayed at this interval.
Thanks. I have now installed atop and waiting for a another occurrance.
Load is a multi-dimensional metric. Your disk is slow and your system has to wait for data often? High load. Your network is slow and you are waiting? High load. Your RAM is slow or there are many context changes and cache invalidations? You name it!
It is a good metric to know something is making your system less responsive as it could be. Finding what caused this exactly is more work.
Bad analogy time: Compare it to lending your car to someone else and getting it back the next day without any fuel left. Did they go really fast on the Autobahn and used lots of fuel quickly, or did they drive a long way, or was the engine running in standby without any movement for 24 hours? You will never know from the mere fact of "empty fuel"/"high load".
Checking the atop logs I see repeating high disk busy values while
while FTL.log fills up with this messages
ERROR add_message(type=6, message=excessive load) - SQL error step DELETE: database is locked
ERROR Error while trying to close database: database is locked
ERROR log_resource_shortage(): Failed to add message to database
Looks to me like the SD card is about to fail. I will try a new SD card...