I am running 2 unprivileged LXC containers on a proxmox-ve 7.1-1 (running kernel: 5.13.19-3-pve). Both lxc's are running docker. The only "exotic" about it is using Fuse for the docker filesystem. The have nesting,keyctl,fuse,mknod =1.
They run fine for hours, sometimes days, but after a while, the entire host goes "offline" (keep reading..):
I have left the node alone for the time being, so if any debug info is required please let me know, as I can still access everything via ssh.
Thank you!
Edit: I have attached dmesg log. I cannot attach journalctl because it's 1.1mb zipped and that is apparently too large a file.
				
			They run fine for hours, sometimes days, but after a while, the entire host goes "offline" (keep reading..):
- Grey question mark appears on pve node/lxc's
 - Shell access (via web UI) and ssh is still available to the PVE host
 - Shell access (via web UI) and ssh is still available to the LXC containers
 - Eventually the node turns red (sometimes takes hours)
 - The system refuses to reboot or shutdown unless I use the forceful "magic SysRq sysproc" option
 - My guess is when I do a shutdown, pve gets stuck trying to do a clean shutdown of the lxc containers
 - I have tried killing the lxc processes and daemons but nothing happens
 - I have tried restarting the various pve daemons but it does not change host state
 - Eventually something times out (hours+) and PVE host reboots, returning the node state to green
 
I have left the node alone for the time being, so if any debug info is required please let me know, as I can still access everything via ssh.
Thank you!
Edit: I have attached dmesg log. I cannot attach journalctl because it's 1.1mb zipped and that is apparently too large a file.
Attachments
			
				Last edited: