[SOLVED] reboots hang with "watchdog did not stop"

emmanuel

New Member
Dec 5, 2017
28
1
3
This has to do with a BIOS setting:
I set in bios "OS" from Windows 7 to Android or Linux or Windows 8x and the rebooting and shutdown issues were gone. Reproduced it.

Still Problems?
"Play" with these values:
nano /etc/systemd/system.conf
RuntimeWatchdogSec=
ShutdownWatchdogSec=
Well I'll be.......that did it! Thanks goseph. What's weird though is that this did not use to happen. :/
 

rordonez

New Member
Aug 4, 2010
12
0
1
Provisioned a Twin server like this one hp-proliant dl160 g8
and the problem did not went away,
It started after running Windows KVM Virtual Machines on the server with windows 2008 server r2 or windows 2016 server

So this solution is not the correct one,
Changed the setting for
RuntimeWatchdogSec=0
ShutdownWatchdogSec=0

The problem still ocurrs:
intel CPU Hyperthreading Enabled/ VT EXtensiones Enabled


HP Latest bios does not have an option to select the OS , so still stuck with this issue

hth

Rodrigo O
Xnet


////////////
proxmox-ve: 5.2-2 (running kernel: 4.15.17-1-pve)
pve-manager: 5.2-1 (running version: 5.2-1/0fcd7879)
pve-kernel-4.15: 5.2-1
pve-kernel-4.15.17-1-pve: 4.15.17-9
corosync: 2.4.2-pve5
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.0-8
libpve-apiclient-perl: 2.0-4
libpve-common-perl: 5.0-31
libpve-guest-common-perl: 2.0-16
libpve-http-server-perl: 2.0-8
libpve-storage-perl: 5.0-23
libqb0: 1.0.1-1
lvm2: 2.02.168-pve6
lxc-pve: 3.0.0-3
lxcfs: 3.0.0-1
novnc-pve: 0.6-4
proxmox-widget-toolkit: 1.0-18
pve-cluster: 5.0-27
pve-container: 2.0-23
pve-docs: 5.2-3
pve-firewall: 3.0-8
pve-firmware: 2.0-4
pve-ha-manager: 2.0-5
pve-i18n: 1.0-5
pve-libspice-server1: 0.12.8-3
pve-qemu-kvm: 2.11.1-5
pve-xtermjs: 1.0-5
qemu-server: 5.0-26
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
zfsutils-linux: 0.7.8-pve1~bpo9
 

leshch

New Member
Oct 17, 2018
7
0
1
34
Provisioned a Twin server like this one hp-proliant dl160 g8
and the problem did not went away,
It started after running Windows KVM Virtual Machines on the server with windows 2008 server r2 or windows 2016 server

So this solution is not the correct one,
Changed the setting for
RuntimeWatchdogSec=0
ShutdownWatchdogSec=0

The problem still ocurrs:
intel CPU Hyperthreading Enabled/ VT EXtensiones Enabled


HP Latest bios does not have an option to select the OS , so still stuck with this issue

hth

Rodrigo O
Xnet


////////////
proxmox-ve: 5.2-2 (running kernel: 4.15.17-1-pve)
pve-manager: 5.2-1 (running version: 5.2-1/0fcd7879)
pve-kernel-4.15: 5.2-1
pve-kernel-4.15.17-1-pve: 4.15.17-9
corosync: 2.4.2-pve5
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.0-8
libpve-apiclient-perl: 2.0-4
libpve-common-perl: 5.0-31
libpve-guest-common-perl: 2.0-16
libpve-http-server-perl: 2.0-8
libpve-storage-perl: 5.0-23
libqb0: 1.0.1-1
lvm2: 2.02.168-pve6
lxc-pve: 3.0.0-3
lxcfs: 3.0.0-1
novnc-pve: 0.6-4
proxmox-widget-toolkit: 1.0-18
pve-cluster: 5.0-27
pve-container: 2.0-23
pve-docs: 5.2-3
pve-firewall: 3.0-8
pve-firmware: 2.0-4
pve-ha-manager: 2.0-5
pve-i18n: 1.0-5
pve-libspice-server1: 0.12.8-3
pve-qemu-kvm: 2.11.1-5
pve-xtermjs: 1.0-5
qemu-server: 5.0-26
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
zfsutils-linux: 0.7.8-pve1~bpo9

try to upgrade the kernel. it helped me in my case.

apt-get dist-upgrade
 

rordonez

New Member
Aug 4, 2010
12
0
1
try to upgrade the kernel. it helped me in my case.

apt-get dist-upgrade
I think Im running the latest on the no-subscription REPO(4.15.17-1-pve). could you share your kernel version?

proxmox-ve: 5.2-2 (running kernel: 4.15.17-1-pve)
pve-manager: 5.2-1 (running version: 5.2-1/0fcd7879)
pve-kernel-4.15: 5.2-1
pve-kernel-4.15.17-1-pve: 4.15.17-9
corosync: 2.4.2-pve5
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.0-8
libpve-apiclient-perl: 2.0-4
libpve-common-perl: 5.0-31
libpve-guest-common-perl: 2.0-16
libpve-http-server-perl: 2.0-8
libpve-storage-perl: 5.0-23
libqb0: 1.0.1-1
lvm2: 2.02.168-pve6
lxc-pve: 3.0.0-3
lxcfs: 3.0.0-1
novnc-pve: 0.6-4
proxmox-widget-toolkit: 1.0-18
pve-cluster: 5.0-27
pve-container: 2.0-23
pve-docs: 5.2-3
pve-firewall: 3.0-8
pve-firmware: 2.0-4
pve-ha-manager: 2.0-5
pve-i18n: 1.0-5
pve-libspice-server1: 0.12.8-3
pve-qemu-kvm: 2.11.1-5
pve-xtermjs: 1.0-5
qemu-server: 5.0-26
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
zfsutils-linux: 0.7.8-pve1~bpo9
 

rordonez

New Member
Aug 4, 2010
12
0
1
I found an ugly workaround

I was able to diagnose a bit further,
The test server has only 1 kvm windows 2008 server vm.

If you issue an:
qm stop vmid

then reboot works fine!,

However if I issue a
qm shutdown vmid

The server hangs and cannot reboot/shutdown

Looks like the shutdown sends a message to the raid card (p420 in this test) that causes the hang, or makes the system expect a message that never arrives.

if anybody has any ideas please share them

regards

Rodrigo O
Xnet
 

leshch

New Member
Oct 17, 2018
7
0
1
34
I found an ugly workaround

I was able to diagnose a bit further,
The test server has only 1 kvm windows 2008 server vm.

If you issue an:
qm stop vmid

then reboot works fine!,

However if I issue a
qm shutdown vmid

The server hangs and cannot reboot/shutdown

Looks like the shutdown sends a message to the raid card (p420 in this test) that causes the hang, or makes the system expect a message that never arrives.

if anybody has any ideas please share them

regards

Rodrigo O
Xnet
before update

Code:
pveversion -v

proxmox-ve: 5.2-2 (running kernel: 4.15.17-1-pve)
pve-manager: 5.2-6 (running version: 5.2-6/bcd5f008)
pve-kernel-4.15: 5.2-1
pve-kernel-4.15.17-1-pve: 4.15.17-9
corosync: 2.4.2-pve5
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.0-8
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-37
libpve-guest-common-perl: 2.0-17
libpve-http-server-perl: 2.0-9
libpve-storage-perl: 5.0-23
libqb0: 1.0.1-1
lvm2: 2.02.168-pve6
lxc-pve: 3.0.0-3
lxcfs: 3.0.0-1
novnc-pve: 1.0.0-2
proxmox-widget-toolkit: 1.0-19
pve-cluster: 5.0-29
pve-container: 2.0-24
pve-docs: 5.2-5
pve-firewall: 3.0-13
pve-firmware: 2.0-5
pve-ha-manager: 2.0-5
pve-i18n: 1.0-6
pve-libspice-server1: 0.12.8-3
pve-qemu-kvm: 2.11.2-1
pve-xtermjs: 1.0-5
qemu-server: 5.0-30
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
zfsutils-linux: 0.7.9-pve1~bpo9
after update

Code:
proxmox-ve: 5.2-2 (running kernel: 4.15.18-7-pve)
pve-manager: 5.2-9 (running version: 5.2-9/4b30e8f9)
pve-kernel-4.15: 5.2-10
pve-kernel-4.15.18-7-pve: 4.15.18-27
pve-kernel-4.15.17-1-pve: 4.15.17-9
corosync: 2.4.2-pve5
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.0-8
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-40
libpve-guest-common-perl: 2.0-18
libpve-http-server-perl: 2.0-11
libpve-storage-perl: 5.0-30
libqb0: 1.0.1-1
lvm2: 2.02.168-pve6
lxc-pve: 3.0.2+pve1-2
lxcfs: 3.0.2-2
novnc-pve: 1.0.0-2
proxmox-widget-toolkit: 1.0-20
pve-cluster: 5.0-30
pve-container: 2.0-28
pve-docs: 5.2-8
pve-firewall: 3.0-14
pve-firmware: 2.0-5
pve-ha-manager: 2.0-5
pve-i18n: 1.0-6
pve-libspice-server1: 0.12.8-3
pve-qemu-kvm: 2.11.2-1
pve-xtermjs: 1.0-5
qemu-server: 5.0-36
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
zfsutils-linux: 0.7.11-pve1~bpo1
after the kernel is updated, the message still appears, but not for a long time and the server goes into reboot.
to completely remove the message i done this:

vim /etc/systemd/system.conf
RuntimeWatchdogSec=0
ShutdownWatchdogSec=0
 

rordonez

New Member
Aug 4, 2010
12
0
1
Thanks for sharing,

I forgot to add the following lines to /etc/apt/sources.list

# PVE pve-no-subscription repository provided by proxmox.com,
# NOT recommended for production use
deb http://download.proxmox.com/debian/pve stretch pve-no-subscription

afterwards worked fine on hp server with p420

reboot / shudown all fixed, thanks for the patience.

proxmox-ve: 5.2-2 (running kernel: 4.15.18-7-pve)
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE and Proxmox Mail Gateway. We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get your own in 60 seconds.

Buy now!