HI!
We started to have a BIG problem with one of our proxmox sevrers. It started to hang everynight! A hardware reset is needed to re-run it. Nagios is all red for VMs, no ssh access. Just KVM and power breaker do their job. Even a shutdown from the console is not helping. FYI, we have lxc / kvm machines there -- +150 LXC / ~10 KVM.
Server has:
CPU(s): 96 x Intel(R) Xeon(R) Gold 6248R CPU @ 3.00GHz (2 Sockets)
Kernel Version: Linux 5.4.203-1-pve #1 SMP PVE 5.4.203-1 (Fri, 26 Aug 2022 14:43:35 +0200)
PVE Manager Version: pve-manager/6.4-15/af7986e6
RAM: 1.5TB
HDD: SOFT RAID-5 - 6 x 1.92TB NVME -- DATA / 2 x 960 GB SOFT RAID-1 -- SYSTEM
After a physical reset all is well during a whole day. Then at night again! A hang ! What the heck could it be ? Anyone can help here ? Thanks in advance !
I am attaching the syslog we got and our sysctl.conf, can be useful I hope... Thanks!
regards,
Grzegorz Leskiewicz
We started to have a BIG problem with one of our proxmox sevrers. It started to hang everynight! A hardware reset is needed to re-run it. Nagios is all red for VMs, no ssh access. Just KVM and power breaker do their job. Even a shutdown from the console is not helping. FYI, we have lxc / kvm machines there -- +150 LXC / ~10 KVM.
Server has:
CPU(s): 96 x Intel(R) Xeon(R) Gold 6248R CPU @ 3.00GHz (2 Sockets)
Kernel Version: Linux 5.4.203-1-pve #1 SMP PVE 5.4.203-1 (Fri, 26 Aug 2022 14:43:35 +0200)
PVE Manager Version: pve-manager/6.4-15/af7986e6
RAM: 1.5TB
HDD: SOFT RAID-5 - 6 x 1.92TB NVME -- DATA / 2 x 960 GB SOFT RAID-1 -- SYSTEM
After a physical reset all is well during a whole day. Then at night again! A hang ! What the heck could it be ? Anyone can help here ? Thanks in advance !
I am attaching the syslog we got and our sysctl.conf, can be useful I hope... Thanks!
regards,
Grzegorz Leskiewicz