Hi I'm experiencing a problem with one of the containers in my system. The container runs fine for couple days then suddenly systemd receives a SIGRTMIN+3 signal and the container stays in a zombie state with only /sbin/init left running. Stopping and restarting the container does not help, only restart of the host system seems to resolve the state of the container. All other containers in the system are working fine though, so this is somehow isolated to this one instance.
running processes when container gets to this state:
syslog (don't have GUI in the container so not sure what the last line is about):
host setup:
running processes when container gets to this state:
Code:
root@LXC-TimescaleDB:~# ps aux
USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND
root 1 0.0 0.1 45816 4416 ? Ss 19:20 0:00 /sbin/init
root 23 0.0 0.0 6268 2508 ? Ss 19:20 0:00 /bin/bash
root 39 0.0 0.0 15984 1840 ? R+ 19:24 0:00 ps aux
syslog (don't have GUI in the container so not sure what the last line is about):
Code:
root@LXC-TimescaleDB:~# cat /var/log/syslog
Feb 20 00:00:01 LXC-TimescaleDB systemd[1]: Started Rotate log files.
Feb 20 00:05:01 LXC-TimescaleDB CRON[17232]: (root) CMD (command -v debian-sa1 > /dev/null && debian-sa1 1 1)
Feb 20 00:08:01 LXC-TimescaleDB CRON[17241]: (root) CMD ( test -x /etc/cron.daily/popularity-contest && /etc/cron.daily/popularity-contest --crond)
Feb 20 00:15:01 LXC-TimescaleDB CRON[17259]: (root) CMD (command -v debian-sa1 > /dev/null && debian-sa1 1 1)
Feb 20 00:25:01 LXC-TimescaleDB CRON[17283]: (root) CMD (command -v debian-sa1 > /dev/null && debian-sa1 1 1)
Feb 20 00:35:01 LXC-TimescaleDB CRON[17307]: (root) CMD (command -v debian-sa1 > /dev/null && debian-sa1 1 1)
Feb 20 00:45:01 LXC-TimescaleDB CRON[17331]: (root) CMD (command -v debian-sa1 > /dev/null && debian-sa1 1 1)
Feb 20 00:54:01 LXC-TimescaleDB CRON[17353]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Feb 20 00:55:01 LXC-TimescaleDB CRON[17358]: (root) CMD (command -v debian-sa1 > /dev/null && debian-sa1 1 1)
Feb 20 01:05:01 LXC-TimescaleDB CRON[17383]: (root) CMD (command -v debian-sa1 > /dev/null && debian-sa1 1 1)
Feb 20 01:15:01 LXC-TimescaleDB CRON[17407]: (root) CMD (command -v debian-sa1 > /dev/null && debian-sa1 1 1)
Feb 20 01:21:07 LXC-TimescaleDB systemd[1]: Received SIGRTMIN+3.
Feb 20 01:21:07 LXC-TimescaleDB systemd[1]: Stopped target Graphical Interface.
host setup:
Code:
root@bear:~# pveversion --verbose
proxmox-ve: 5.3-1 (running kernel: 4.15.18-10-pve)
pve-manager: 5.3-8 (running version: 5.3-8/2929af8e)
pve-kernel-4.15: 5.3-1
pve-kernel-4.15.18-10-pve: 4.15.18-32
pve-kernel-4.15.18-9-pve: 4.15.18-30
pve-kernel-4.15.17-1-pve: 4.15.17-9
corosync: 2.4.4-pve1
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.1-3
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-44
libpve-guest-common-perl: 2.0-19
libpve-http-server-perl: 2.0-11
libpve-storage-perl: 5.0-36
libqb0: 1.0.3-1~bpo9
lvm2: 2.02.168-pve6
lxc-pve: 3.1.0-2
lxcfs: 3.0.2-2
novnc-pve: 1.0.0-2
proxmox-widget-toolkit: 1.0-22
pve-cluster: 5.0-33
pve-container: 2.0-33
pve-docs: 5.3-1
pve-edk2-firmware: 1.20181023-1
pve-firewall: 3.0-17
pve-firmware: 2.0-6
pve-ha-manager: 2.0-6
pve-i18n: 1.0-9
pve-libspice-server1: 0.14.1-2
pve-qemu-kvm: 2.12.1-1
pve-xtermjs: 3.10.1-1
qemu-server: 5.0-45
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
zfsutils-linux: 0.7.12-pve1~bpo1