Freeze = Proxmox webgui not accessible, no ping reply, no "DOS" screen with monitor attached. But the physical pc is still running (I can hear the cpu fan spinning)What does "freeze" means exactly? Anything in the logs (e.g.,journalctl -b
)
What network card is running there? Maybe one using the igb driver? (e.g., check withlspci -k
)
00:1f.6 Ethernet controller: Intel Corporation Ethernet Connection (2) I219-LM (rev 31)
DeviceName: Onboard Lan
Subsystem: Hewlett-Packard Company Ethernet Connection (2) I219-LM
Kernel driver in use: e1000e
Kernel modules: e1000e
Kernel is probably easier to test by simply rebooting into the old one (select on boot or use kernel pin tool)The big question is : is it the new kernel or is it the BIOS update?
Kernel is probably easier to test by simply rebooting into the old one (select on boot or use kernel pin tool)
The VM machine version has nothing to do with the kernel, only with the installed QEMU version.I noticed with the 6.2.1-9 kernel that all my VMs could upgrade from 6.x to 7.2 (vm > Hardware > Machine > pc-i440fx-7.2).
Do I have to adjust this setting again when downgrading?
What network card is running there? Maybe one using the igb driver? (e.g., check withlspci -k
)
00:1f.6 Ethernet controller: Intel Corporation Ethernet Connection I219-LM (rev 21)
Subsystem: Intel Corporation Ethernet Connection I219-LM
Kernel driver in use: e1000e
Kernel modules: e1000e
01:00.0 Ethernet controller: Intel Corporation I211 Gigabit Network Connection (rev 03)
Subsystem: Holco Enterprise Co, Ltd/Shuttle Computer I211 Gigabit Network Connection
Kernel driver in use: igb
Kernel modules: igb
We identified a potential deadlock with the igb driver in the 5.15.104-1 kernel due to a backported patch, see this thread.What exactly do you mean by that, and is the igb driver good or bad (performance wise)?
It is fixed in the latest 6.2.11-1-pve.I updated my 6.2.6-1-pve kernel to the latest 6.2.9-1-pve today and it brokeintel_gpu_top
with the error:Failed to initialize PMU! (No such file or directory)
I went back to 6.2.6-1 and it works again.
Great, thanks for letting me know.It is fixed in the latest 6.2.11-1-pve.
# uname -a
Linux accod 6.2.11-1-pve #1 SMP PREEMPT_DYNAMIC PVE 6.2.11-1 (2023-04-20T09:59Z) x86_64 GNU/Linux
# intel_gpu_top
Failed to initialize PMU! (No such file or directory)
I noticed the same behaviour, since kernel 6.2 update, the cpu temp is always about 4-10 degrees higher then before, although the power usage stayed the same. (mind you, I also updated proxmox to 7.4-3 at the same time, so not sure if it is the proxmox update or kernel update that is causing this)By the way, the overal temperature of the unit/pc is rising when it is doing almost nothing.
Now on 65.5 C degrees (was ~50).
This pc was running perfectly and cool before the kernel and BIOS updates.
The big question is : is it the new kernel or is it the BIOS update?
Today I updated the held back packages proxmox-ve and pve-kernel-helper and this resolved the error with intel_gpu_top.It's still failing for me unfortunately:
Code:# uname -a Linux accod 6.2.11-1-pve #1 SMP PREEMPT_DYNAMIC PVE 6.2.11-1 (2023-04-20T09:59Z) x86_64 GNU/Linux # intel_gpu_top Failed to initialize PMU! (No such file or directory)
However, going back to 6.2.6-1 now doesn't resolve it. I noticed pve-firmware was updated along with the kernel so I'm wondering if that's why. The gpu is working inside containers so passthrough and the device is working correctly, just not intel_gpu_top on the host.