Tonight another proxmox VE server updated two days ago, with two VMs Windows server 2022 gave problems on a machine that shut down at 01:43 AM .. now I proceed to go back to the previous kernel (proxmox-boot-tool kernel pin 5.13.19-6-pve) that with the other three proxmox VE servers that I manage I have solved the problem; but I would really like to understand what are the current official fixes for this problem, or a series of steps that need to be done, as these updates being available in the enterprise repo I expected them to be tested a lot .. and that these things did not happen or that in any case a fix is produced in a short time, so we have been with this problem for more than 20 days ... I am waiting for their comment from the Proxmox Staff, thanks
I enclose the configuration with which I have the blocking problem:
proxmox-ve: 7.2-1 (running kernel: 5.15.35-2-pve)
pve-manager: 7.2-4 (running version: 7.2-4 / ca9d43cc)
pve-kernel-5.15: 7.2-4
pve-kernel-helper: 7.2-4
pve-kernel-5.13: 7.1-9
pve-kernel-5.15.35-2-pve: 5.15.35-5
pve-kernel-5.13.19-6-pve: 5.13.19-15
pve-kernel-5.13.19-2-pve: 5.13.19-4
ceph-fuse: 15.2.15-pve1
corosync: 3.1.5-pve2
criu: 3.15-1 + pve-1
glusterfs-client: 9.2-1
ifupdown2: 3.1.0-1 + pmx3
ksm-control-daemon: 1.4-1
libjs-extjs: 7.0.0-1
libknet1: 1.22-pve2
libproxmox-acme-perl: 1.4.2
libproxmox-backup-qemu0: 1.3.1-1
libpve-access-control: 7.2-2
libpve-apiclient-perl: 3.2-1
libpve-common-perl: 7.2-2
libpve-guest-common-perl: 4.1-2
libpve-http-server-perl: 4.1-2
libpve-storage-perl: 7.2-4
libspice-server1: 0.14.3-2.1
lvm2: 2.03.11-2.1
lxc-pve: 4.0.12-1
lxcfs: 4.0.12-pve1
novnc-pve: 1.3.0-3
proxmox-backup-client: 2.2.1-1
proxmox-backup-file-restore: 2.2.1-1
proxmox-mini-journalreader: 1.3-1
proxmox-widget-toolkit: 3.5.1
pve-cluster: 7.2-1
pve-container: 4.2-1
pve-docs: 7.2-2
pve-edk2-firmware: 3.20210831-2
pve-firewall: 4.2-5
pve-firmware: 3.4-2
pve-ha-manager: 3.3-4
pve-i18n: 2.7-2
pve-qemu-kvm: 6.2.0-8
pve-xtermjs: 4.16.0-1
qemu-server: 7.2-3
smartmontools: 7.2-pve3
spiceterm: 3.2-2
swtpm: 0.7.1 ~ bpo11 + 1
vncterm: 1.7-1
zfsutils-linux: 2.1.4-pve1
----------------------------------------------
And this is the syslog:
Jun 17 00:02:45 italprox smartd[808]: Device: /dev/bus/0 [megaraid_disk_02] [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 65 to 64
Jun 17 00:10:26 italprox rsyslogd[806]: [origin software="rsyslogd" swVersion="8.2102.0" x-pid="806" x-info="
https://www.rsyslog.com"] rsyslogd was HUPed
Jun 17 00:13:41 italprox pvescheduler[233409]: INFO: Finished Backup of VM 1050 (00:26:31)
Jun 17 00:13:41 italprox pvescheduler[233409]: INFO: Backup job finished successfully
Jun 17 00:17:01 italprox CRON[241732]: pam_unix(cron:session): session opened for user root(uid=0) by (uid=0)
Jun 17 00:17:01 italprox CRON[241733]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Jun 17 00:17:01 italprox CRON[241732]: pam_unix(cron:session): session closed for user root
Jun 17 00:32:45 italprox smartd[808]: Device: /dev/bus/0 [megaraid_disk_02] [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 64 to 65
Jun 17 01:17:01 italprox CRON[252475]: pam_unix(cron:session): session opened for user root(uid=0) by (uid=0)
Jun 17 01:17:01 italprox CRON[252476]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Jun 17 01:17:01 italprox CRON[252475]: pam_unix(cron:session): session closed for user root
Jun 17 01:43:37 italprox kernel: set kvm_intel.dump_invalid_vmcs=1 to dump internal KVM state.
Jun 17 01:43:37 italprox QEMU[1187]: KVM: entry failed, hardware error 0x80000021
Jun 17 01:43:37 italprox QEMU[1187]: If you're running a guest on an Intel machine without unrestricted mode
Jun 17 01:43:37 italprox QEMU[1187]: support, the failure can be most likely due to the guest entering an invalid
Jun 17 01:43:37 italprox QEMU[1187]: state for Intel VT. For example, the guest maybe running in big real mode
Jun 17 01:43:37 italprox QEMU[1187]: which is not supported on less recent Intel processors.
Jun 17 01:43:37 italprox QEMU[1187]: EAX=001a0f30 EBX=c29a0180 ECX=00000000 EDX=00000000
Jun 17 01:43:37 italprox QEMU[1187]: ESI=c29ac440 EDI=698ea080 EBP=226b5690 ESP=226b54b0
Jun 17 01:43:37 italprox QEMU[1187]: EIP=00008000 EFL=00000002 [-------] CPL=0 II=0 A20=1 SMM=1 HLT=0
Jun 17 01:43:37 italprox QEMU[1187]: ES =0000 00000000 ffffffff 00809300
Jun 17 01:43:37 italprox QEMU[1187]: CS =ae00 7ffae000 ffffffff 00809300
Jun 17 01:43:37 italprox QEMU[1187]: SS =0000 00000000 ffffffff 00809300
Jun 17 01:43:37 italprox QEMU[1187]: DS =0000 00000000 ffffffff 00809300
Jun 17 01:43:37 italprox QEMU[1187]: FS =0000 00000000 ffffffff 00809300
Jun 17 01:43:37 italprox QEMU[1187]: GS =0000 00000000 ffffffff 00809300
Jun 17 01:43:37 italprox QEMU[1187]: LDT=0000 00000000 000fffff 00000000
Jun 17 01:43:37 italprox QEMU[1187]: TR =0040 c29b0000 00000067 00008b00
Jun 17 01:43:37 italprox QEMU[1187]: GDT= c29b1fb0 00000057
Jun 17 01:43:37 italprox QEMU[1187]: IDT= 00000000 00000000
Jun 17 01:43:37 italprox QEMU[1187]: CR0=00050032 CR2=cc7de000 CR3=001ae000 CR4=00000000
Jun 17 01:43:37 italprox QEMU[1187]: DR0=0000000000000000 DR1=0000000000000000 DR2=0000000000000000 DR3=0000000000000000
Jun 17 01:43:37 italprox QEMU[1187]: DR6=00000000ffff0ff0 DR7=0000000000000400
Jun 17 01:43:37 italprox QEMU[1187]: EFER=0000000000000000
Jun 17 01:43:37 italprox QEMU[1187]: Code=kvm: ../hw/core/cpu-sysemu.c:77: cpu_asidx_from_attrs: Assertion `ret < cpu->num_ases && ret >= 0' failed.
Jun 17 01:43:37 italprox kernel: fwbr1010i0: port 2(tap1010i0) entered disabled state
Jun 17 01:43:37 italprox kernel: fwbr1010i0: port 2(tap1010i0) entered disabled state
Jun 17 01:43:37 italprox systemd[1]: 1010.scope: Succeeded.
Jun 17 01:43:37 italprox systemd[1]: 1010.scope: Consumed 4h 49min 19.701s CPU time.
Jun 17 01:43:38 italprox qmeventd[257151]: Starting cleanup for 1010
Jun 17 01:43:38 italprox kernel: fwbr1010i0: port 1(fwln1010i0) entered disabled state
Jun 17 01:43:38 italprox kernel: vmbr0: port 2(fwpr1010p0) entered disabled state
Jun 17 01:43:38 italprox kernel: device fwln1010i0 left promiscuous mode
Jun 17 01:43:38 italprox kernel: fwbr1010i0: port 1(fwln1010i0) entered disabled state
Jun 17 01:43:38 italprox kernel: device fwpr1010p0 left promiscuous mode
Jun 17 01:43:38 italprox kernel: vmbr0: port 2(fwpr1010p0) entered disabled state
Jun 17 01:43:38 italprox qmeventd[257151]: Finished cleanup for 1010