Odd problems: storage space being consumed & UI errors / timing out, etc.

verulian

Renowned Member
Feb 18, 2019
184
28
68
45
I've been having some strange problems with my Proxmox installation.

First, I've found that when I've done an `apt update && apt upgrade` - after it's been running for a while - I've seen out of storage errors:
Job for pve-ha-lrm.service failed because a timeout was exceeded.
See "systemctl status pve-ha-lrm.service" and "journalctl -xe" for details.
Failed to reload daemon: Refusing to reload, not enough space available on /run/systemd. Currently, 0B are free, but a safety buffer of 16.0M is enforced.


I basically can't do anything at that point. Even `df` won't execute and the system becomes hung. A `reboot` get's things back to a functional stat where I can force dpkg to continue on its merry way without any problem.

However, I'me seeing other problems. When I'm in the web gui I get this a good bit and have to keep refreshing and reloading the UI:
error.png

Any idea of what could be going on here?
 
Is this server running on a very small amount of RAM?

Quite a few of the /run folders run on tmfps which is storage based on a RAM disk.
 
Image+2020-01-02+at+11.56.39+PM.png
 
One of the crazy things that continually happens intermittently, but consistently enough to interrupt all of my work, is this in the web UI:

connection_error.jpg


$ htop
htop.jpg
 
One of the crazy things that continually happens intermittently, but consistently enough to interrupt all of my work, is this in the web UI:

connection_error.jpg


$ htop
htop.jpg

Have you checked the browser console (normally F12) when this happens and see what errors if any are being shown?

On the apt error, the /run folder has 1.6GB on ramdisk, there must have just been alot of temp files in /run folder that had maxed the tmpfs out.

Hence a reboot fixed this due to ramdisk not being saved after a reboot.
 
That's what I was thinking, but I don't know what would be causing that here... I guess I'll have to dig about next time it happens, but it's a kind of "panic" situation when it has happened so far.