I am running 2 unprivileged LXC containers on a proxmox-ve 7.1-1 (running kernel: 5.13.19-3-pve). Both lxc's are running docker. The only "exotic" about it is using Fuse for the docker filesystem. The have nesting,keyctl,fuse,mknod =1.
They run fine for hours, sometimes days, but after a while, the entire host goes "offline" (keep reading..):
I have left the node alone for the time being, so if any debug info is required please let me know, as I can still access everything via ssh.
Thank you!
Edit: I have attached dmesg log. I cannot attach journalctl because it's 1.1mb zipped and that is apparently too large a file.
They run fine for hours, sometimes days, but after a while, the entire host goes "offline" (keep reading..):
- Grey question mark appears on pve node/lxc's
- Shell access (via web UI) and ssh is still available to the PVE host
- Shell access (via web UI) and ssh is still available to the LXC containers
- Eventually the node turns red (sometimes takes hours)
- The system refuses to reboot or shutdown unless I use the forceful "magic SysRq sysproc" option
- My guess is when I do a shutdown, pve gets stuck trying to do a clean shutdown of the lxc containers
- I have tried killing the lxc processes and daemons but nothing happens
- I have tried restarting the various pve daemons but it does not change host state
- Eventually something times out (hours+) and PVE host reboots, returning the node state to green
I have left the node alone for the time being, so if any debug info is required please let me know, as I can still access everything via ssh.
Thank you!
Edit: I have attached dmesg log. I cannot attach journalctl because it's 1.1mb zipped and that is apparently too large a file.
Attachments
Last edited: