echo 0 > /proc/sys/kernel/hung_task_timeout_secs disables this message

mehmethanifi

Renowned Member
Dec 15, 2016
61
4
73
39
Hello friends,

I have installed proxmox on the Dell r720 server. Installation is finished, but there is an error or warning on the screen.
I don't know what this warning causes. I am requesting your support.

root@pve1:~# pveperf
CPU BOGOMIPS: 96016.80
REGEX/SECOND: 1102614
HD SIZE: 93.99 GB (/dev/mapper/pve-root)
BUFFERED READS: 260.88 MB/sec
AVERAGE SEEK TIME: 4.81 ms
FSYNCS/SECOND: 674.49
DNS EXT: 1040.51 ms
DNS INT: 1002.70 ms (kas.local)
root@pve1:~# pveperf /srv/vms
CPU BOGOMIPS: 96016.80
REGEX/SECOND: 1213904
df: /srv/vms: No such file or directory
DNS EXT: 1003.36 ms
DNS INT: 1002.49 ms (kas.local)

root@pve1:~# pveversion -v
proxmox-ve: 5.4-1 (running kernel: 4.15.18-12-pve)
pve-manager: 5.4-3 (running version: 5.4-3/0a6eaa62)
pve-kernel-4.15: 5.3-3
pve-kernel-4.15.18-12-pve: 4.15.18-35
corosync: 2.4.4-pve1
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.1-8
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-50
libpve-guest-common-perl: 2.0-20
libpve-http-server-perl: 2.0-13
libpve-storage-perl: 5.0-41
libqb0: 1.0.3-1~bpo9
lvm2: 2.02.168-pve6
lxc-pve: 3.1.0-3
lxcfs: 3.0.3-pve1
novnc-pve: 1.0.0-3
proxmox-widget-toolkit: 1.0-25
pve-cluster: 5.0-36
pve-container: 2.0-37
pve-docs: 5.4-2
pve-edk2-firmware: 1.20190312-1
pve-firewall: 3.0-19
pve-firmware: 2.0-6
pve-ha-manager: 2.0-9
pve-i18n: 1.1-4
pve-libspice-server1: 0.14.1-2
pve-qemu-kvm: 2.12.1-3
pve-xtermjs: 3.12.0-1
qemu-server: 5.0-50
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
zfsutils-linux: 0.7.13-pve1~bpo2

root@pve1:~# pveversion -v
proxmox-ve: 5.4-1 (running kernel: 4.15.18-12-pve)
pve-manager: 5.4-3 (running version: 5.4-3/0a6eaa62)
pve-kernel-4.15: 5.3-3
pve-kernel-4.15.18-12-pve: 4.15.18-35
corosync: 2.4.4-pve1
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.1-8
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-50
libpve-guest-common-perl: 2.0-20
libpve-http-server-perl: 2.0-13
libpve-storage-perl: 5.0-41
libqb0: 1.0.3-1~bpo9
lvm2: 2.02.168-pve6
lxc-pve: 3.1.0-3
lxcfs: 3.0.3-pve1
novnc-pve: 1.0.0-3
proxmox-widget-toolkit: 1.0-25
pve-cluster: 5.0-36
pve-container: 2.0-37
pve-docs: 5.4-2
pve-edk2-firmware: 1.20190312-1
pve-firewall: 3.0-19
pve-firmware: 2.0-6
pve-ha-manager: 2.0-9
pve-i18n: 1.1-4
pve-libspice-server1: 0.14.1-2
pve-qemu-kvm: 2.12.1-3
pve-xtermjs: 3.12.0-1
qemu-server: 5.0-50
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
zfsutils-linux: 0.7.13-pve1~bpo2

There are 2 pieces of Intel (R) Xeon (R) CPU E5-2620 0 @ 2.00GHz available.

There is 80 gb ram on the device.

I installed raid10 with 4 15k 300 gb sas disks.

I installed the operating system on it.
 

Attachments

  • Ekran Görüntüsü - 2019-10-24 14-43-08.png
    Ekran Görüntüsü - 2019-10-24 14-43-08.png
    617.3 KB · Views: 94
In the screenshot you see that lzop is the binary that is blocked for more than 120 sec. Check the resources of your server. Maybe it's a backup and the storage is slow?
 
There is a backup that I started at 22:00 at night. This backup takes too long. Maybe that's what you're talking about.

Does this error damage my server during working hours? Is the VM running on it affected?

Finally, it is recommended that I do what I do in such cases.
 
Does this error damage my server during working hours?
Not likely.

Is the VM running on it affected?
Yes, the backup process (vzdump) run in qemu and intercepts writes to the storage, so it can write those blocks also to the backup. This will slow down the VM. Noticeable or not, depends on your estimate.

Finally, it is recommended that I do what I do in such cases.
You need to find out, what causes the slow down of the backup (storage, network, compression, ...).
 
That is up to you, if you believe that will resolve the issue (whichever it really is).
 
Hi, I have the same problem since I switched from a simple HP 8200 PC to an HP ML110 server. Recovery takes a long time and several messages like this appear: echo 0 > /proc/sys/kernel/hung_task_timeout_secs disables this message. After recovery it works normally, but if I try to perform another virtual machine recovery, the whole system slows down. I'm worried about the reliability of the system.
Any suggestions?
Thanks
 
@vinci, as said above, check the node resource usage. The storage seems slow.