Setting tdp_mmu=N fixes the issue with the newest kernel 5.15 and Windows 11 Insider Builds.And did you follow the guide?
https://pve.proxmox.com/mediawiki/index.php?title=Upgrade_from_6.x_to_7.0&action=history
Setting tdp_mmu=N fixes the issue with the newest kernel 5.15 and Windows 11 Insider Builds.And did you follow the guide?
https://pve.proxmox.com/mediawiki/index.php?title=Upgrade_from_6.x_to_7.0&action=history
Settingtdp_mmu=N fixes the issue with the newest kernel 5.15 and Windows 11 Insider Builds.
Shutting themselves down or crashing withBad news - over the last few days i have had 3 separate instances of win 2022 vms & 1 win 11 vm shutting themselves down with tdp_mmu=N configured on my lab 3 node cluster running kernel 5.15.39-1.
KVM: entry failed, hardware error 0x80000021
are two quite different things, if the PVE host's kernel log doesn't show said error it is definitively another issue and should go in its own thread.Shutting themselves down or crashing withKVM: entry failed, hardware error 0x80000021
are two quite different things, if the PVE host's kernel log doesn't show said error it is definitively another issue and should go in its own thread.
What type of CPU? It works on aBad news - over the last few days i have had 3 separate instances of win 2022 vms & 1 win 11 vm shutting themselves down with tdp_mmu=N configured on my lab 3 node cluster running kernel 5.15.39-1.
Going to repin kernel 5.13 for now
Intel(R) Xeon(R) CPU E5-2660 v2
at least, on Linux 5.15.35-2-pve #1 SMP PVE 5.15.35-5 (Wed, 08 Jun 2022 15:02:51 +0200) x86_64 GNU/Linux
.My lab hosts are all single socket Intel Xeon E-2186G (12) @ 4.700GHzWhat type of CPU? It works on aIntel(R) Xeon(R) CPU E5-2660 v2
at least, on
Linux 5.15.35-2-pve #1 SMP PVE 5.15.35-5 (Wed, 08 Jun 2022 15:02:51 +0200) x86_64 GNU/Linux
.
I came across an interesting discovery. I have 2 Proxmox servers with identical CPUs, one had Windows 2022 server VMs randomly crashing on it, sometimes happening a few times a day. I updated that one with tdp_mmu=N and the crashes stopped. A few weeks later, I needed to migrate the VMs from that server to another one, which happened to have the same identical CPU, but this Proxmox server didn't have the ttdp_mmu=N option set. What's interesting is, the VMs have been running stable without any crashes for a few weeks now on this new Proxmox server.
The only difference between the 2 servers is that one was upgraded to Proxmox v7 from v6, while the other was a brand new install of v7. The one upgraded from v6 had random crashes happening. I was puzzled by the difference but was surprised to find everything is running ok, and running well, on the brand new install of v7 without tdp_mmu=N set. I'm not sure why, but that's what I found
if you add the option to the kernel commandline (as opposed to adding it in a file in /etc/modprobe.d) you need to put a dot between module name and option:kvm tdp_mmu=N
kvm.tdp_mmu=N
cat /sys/module/kvm/parameters/tdp_mmu
Hi Stoiko -if you add the option to the kernel commandline (as opposed to adding it in a file in /etc/modprobe.d) you need to put a dot between module name and option:kvm.tdp_mmu=N
you can verify that the setting is set correctly with
Code:cat /sys/module/kvm/parameters/tdp_mmu
I hope this helps!
No, currently you need to disable two-dimensional paging for the MMU (Are there any changes to the issue on 5.15.39-2-pve?
tdp_mmu
) manually if your setup is affected, or better first check that you have the newest bios/firmware and CPU microcode installed, as then you may not even require the workaround anymore.in logs but the result is the same as people experience in this thread.KVM: entry failed, hardware error 0x80000021
Many thanks for such information, this can be valuable on nailing the actual range of models possibly affected and also possibly the underlying issue that could help in either avoiding the bug or atleast automatically disable the new feature.20 x Intel(R) Core(TM) i9-10900 CPU @ 2.80GHz (1 Socket), 48 GB RAM
I seem to have the same problem on one of 2 Windows 11 Pro VMs.
The one that has the problem has the following features enabled:
- Virtual Machine Platform
- Windows Hypervisor Platform
- Windows Subsystem for Linux
And the VM that is stable doesn't have those 3 features enabled.
I will soon uninstall those 3 features on the faulty Windows 11 Pro VM and report back in a couple of days.
I will not be disabling -- tdp_mmu -- on my Proxmox server because I want to check if it's really those three features that cause the problem ... because right now those 3 features are the only difference between the 2 Windows 11 Pro VMs.
My Proxmox Host runs on:
- Intel(R) Core(TM) i7-10700F CPU @ 2.90GHz
- Linux 5.15.39-1-pve #1 SMP PVE 5.15.39-1
- 128 GB RAM
- 2 TB Samsung 980 Pro NVME