Hi, I am a teacher and I work at a school. In the computer lab, I set up a server for system restoration using FOG and a file server using OpenMediaVault. Until now, they were running on separate machines. I decided to virtualize them using Proxmox. I used a computer with an MSI x370 gaming pro carbon motherboard, an AMD Ryzen 7 1700X processor, and 48GB of RAM (2x16GB + 2x8GB). I started two virtual machines, FOG and OMV (16GB RAM, 2 cores), configured them, and everything seemed to work, but from the very beginning, Proxmox occasionally "freezes." Nothing can be done, the web GUI and SSH do not work. I have to reset it. On the monitor connected to Proxmox, the following error appears:
root@pve25: # [11218.538901] igb 0000:21:00.0"eno1: NETDEV WATCHDOG: CPU: 11: transmit queue 1 timed out 5455 ms
[11260.804462] watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [kworker/3:1:153]
[11256.290168] nmi_backtrace_stall_check: CPU 0: NMIs are not reaching exc_nmi() handler, last activity: 941852 jiffies ago.
[11267.168444] watchdog: Watchdog detected hard LOCKUP on cpu 11
[11266.294210] nmi_backtrace_stall_check: CPU 4: NMIs are not reaching exc_nmi() handler, last activity: 1834999 jiffies ago.
[11286.302450] nmi_backtrace_stall_check: CPU 8: NMIs are not reaching
[11276.298415] nmi_backtrace_stall_check: CPU 6: NMIs are not reaching exc_nmi() handler, last activity: 936681 jiffies ago.
Both virtual machines do not even exceed half of the CPU and memory resources. I updated the BIOS on the motherboard, disabled the Ballooning Device option for memory, but it didn't help. I removed 2x8GB RAM, and still nothing. Recently, another error appeared, after which I also couldn't do anything but reset:
root@pve25: # [ 9734.619739] nmi_backtrace_stall_check: CPU 0: NMIs are not reaching exc_nmi() handler, last activity: 1557393 jiffies ago.
[10304.630894] nmi_backtrace_stall_check: CPU 0: NMIs are not reaching exc_nmi() handler, last activity: 2127422 jiffies ago.
I tried almost all the solutions provided on this forum for similar cases, but none of them helped.
What else can I do? Could it be hardware issues?
root@pve25: # [11218.538901] igb 0000:21:00.0"eno1: NETDEV WATCHDOG: CPU: 11: transmit queue 1 timed out 5455 ms
[11260.804462] watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [kworker/3:1:153]
[11256.290168] nmi_backtrace_stall_check: CPU 0: NMIs are not reaching exc_nmi() handler, last activity: 941852 jiffies ago.
[11267.168444] watchdog: Watchdog detected hard LOCKUP on cpu 11
[11266.294210] nmi_backtrace_stall_check: CPU 4: NMIs are not reaching exc_nmi() handler, last activity: 1834999 jiffies ago.
[11286.302450] nmi_backtrace_stall_check: CPU 8: NMIs are not reaching
[11276.298415] nmi_backtrace_stall_check: CPU 6: NMIs are not reaching exc_nmi() handler, last activity: 936681 jiffies ago.
Both virtual machines do not even exceed half of the CPU and memory resources. I updated the BIOS on the motherboard, disabled the Ballooning Device option for memory, but it didn't help. I removed 2x8GB RAM, and still nothing. Recently, another error appeared, after which I also couldn't do anything but reset:
root@pve25: # [ 9734.619739] nmi_backtrace_stall_check: CPU 0: NMIs are not reaching exc_nmi() handler, last activity: 1557393 jiffies ago.
[10304.630894] nmi_backtrace_stall_check: CPU 0: NMIs are not reaching exc_nmi() handler, last activity: 2127422 jiffies ago.
I tried almost all the solutions provided on this forum for similar cases, but none of them helped.
What else can I do? Could it be hardware issues?