Proxmox 5.4, fully updated, kernel: 4.15.18-12. We do not use OOB or clustering. pveversion output below.
We have a Proxmox system that about every other reboot hangs at the "Reached target Reboot" stdout message. Left unattended, on a full-fledged hang the system finally reboots at about 30 minutes. The system logs confirm the 30 minutes. That period hardly seems coincidental -- as though hard-coded somewhere.
The root cause for the delay is not yet known. Searching the web finds many similar reports and indicates two possible common causes. One is the NFS client hanging and the other is the swap partition not unmounting.
Yesterday I locally rebooted the server after updates. Before starting the reboot I opened a debug-shell console (systemctl start debug-shell).
The system again seemed to hang at 'Reached target Reboot'. I toggled to the debug console and found swap not yet unmounted. I don't know if that is normal. I barely had time to investigate further because at that moment, the system continued with the reboot.
This time the system did not hang the full 30 minutes, but I also was unable to collect or view any meaningful data. I cannot say for certain the several second delay witnessed yesterday is normal.
Like many VM servers, we do not reboot often. Collecting data is challenge. Without resolving this delay we cannot reboot this system remotely.
Any ideas of the root cause of this delay?
Thanks much.
pveversion -v:
We have a Proxmox system that about every other reboot hangs at the "Reached target Reboot" stdout message. Left unattended, on a full-fledged hang the system finally reboots at about 30 minutes. The system logs confirm the 30 minutes. That period hardly seems coincidental -- as though hard-coded somewhere.
The root cause for the delay is not yet known. Searching the web finds many similar reports and indicates two possible common causes. One is the NFS client hanging and the other is the swap partition not unmounting.
Yesterday I locally rebooted the server after updates. Before starting the reboot I opened a debug-shell console (systemctl start debug-shell).
The system again seemed to hang at 'Reached target Reboot'. I toggled to the debug console and found swap not yet unmounted. I don't know if that is normal. I barely had time to investigate further because at that moment, the system continued with the reboot.
This time the system did not hang the full 30 minutes, but I also was unable to collect or view any meaningful data. I cannot say for certain the several second delay witnessed yesterday is normal.
Like many VM servers, we do not reboot often. Collecting data is challenge. Without resolving this delay we cannot reboot this system remotely.
Any ideas of the root cause of this delay?
Thanks much.
pveversion -v:
Code:
proxmox-ve: 5.4-1 (running kernel: 4.15.18-12-pve)
pve-manager: 5.4-3 (running version: 5.4-3/0a6eaa62)
pve-kernel-4.15: 5.3-3
pve-kernel-4.15.18-12-pve: 4.15.18-35
pve-kernel-4.15.18-11-pve: 4.15.18-34
corosync: 2.4.4-pve1
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.1-8
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-50
libpve-guest-common-perl: 2.0-20
libpve-http-server-perl: 2.0-13
libpve-storage-perl: 5.0-41
libqb0: 1.0.3-1~bpo9
lvm2: 2.02.168-pve6
lxc-pve: 3.1.0-3
lxcfs: 3.0.3-pve1
novnc-pve: 1.0.0-3
proxmox-widget-toolkit: 1.0-25
pve-cluster: 5.0-36
pve-container: 2.0-37
pve-docs: 5.4-2
pve-edk2-firmware: 1.20190312-1
pve-firewall: 3.0-19
pve-firmware: 2.0-6
pve-ha-manager: 2.0-9
pve-i18n: 1.1-4
pve-libspice-server1: 0.14.1-2
pve-qemu-kvm: 2.12.1-3
pve-xtermjs: 3.12.0-1
qemu-server: 5.0-50
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
zfsutils-linux: 0.7.13-pve1~bpo2