Windows VM gets unresponsive

carlosmlribeiro

New Member
Jun 26, 2022
6
1
1
Hi,

From time to time, best case scenario a couple of weeks, today just 3 days, my windows VM becomes unresponsive and I can't get through to it, even to reboot it from proxmox console.

Any ideias what I might done wrong?

This is my config:
Code:
# qm config 100
agent: 1
boot: order=virtio1
cores: 4
cpu: host
hostpci0: 00:02.0,mdev=i915-GVTg_V5_4
ide2: local:iso/virtio-win-0.1.185.iso,media=cdrom,size=402812K
machine: q35
memory: 8192
name: windows10
net0: e1000=8A:B9:6E:32:64:13,bridge=vmbr0
numa: 1
onboot: 1
ostype: win10
scsihw: virtio-scsi-pci
smbios1: uuid=b9c75bd1-4b80-4d34-88ca-8b9250d2a5d9
sockets: 1
vga: memory=512
virtio0: Storage:vm-100-disk-0,size=1300G
virtio1: local-lvm:vm-100-disk-0,size=64G
vmgenid: c9d41514-4317-44cc-ba80-eb3e974a430a
 

Attachments

  • proxmox frozen on windows10.jpg
    proxmox frozen on windows10.jpg
    318.4 KB · Views: 13
Hi Moayad! Thanks for pointing me to the right direction. I have a couple of thousand entries like this.

Any ideas what I might have done wrong in the config?

Code:
Jun 26 10:03:33 hal9000 kernel: [342983.303305] gvt: vgpu 1: fail: shadow page 0000000000000000 guest entry 0xfbc8027b7656eb7b type 9
Jun 26 10:03:33 hal9000 kernel: [342983.303307] gvt: vgpu 1: fail: spt 00000000687862be guest entry 0xfbc8027b7656eb7b type 9
Jun 26 10:03:33 hal9000 kernel: [342983.303308] gvt: vgpu 1: fail: shadow page 00000000687862be guest entry 0xfbc8027b7656eb7b type 9.
Jun 26 10:03:33 hal9000 kernel: [342983.303309] gvt: guest page write error, gpa 20127d78
Jun 26 10:03:33 hal9000 kernel: [342983.303317] ------------[ cut here ]------------
Jun 26 10:03:33 hal9000 kernel: [342983.303318] invalid entry type
Jun 26 10:03:33 hal9000 kernel: [342983.303370] WARNING: CPU: 1 PID: 2320 at drivers/gpu/drm/i915/gvt/gtt.c:683 ppgtt_populate_spt+0x1f1/0x3a0 [i915]
Jun 26 10:03:33 hal9000 kernel: [342983.303371] Modules linked in: veth ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables xt_conntrack xt_MASQUERADE nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 aufs iptable_filter bpfilter overlay softdog nfnetlink_log nfnetlink intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp zfs(PO) kvm_intel crct10dif_pclmul crc32_pclmul ghash_clmulni_intel zunicode(PO) aesni_intel zlua(PO) crypto_simd zavl(PO) cryptd icp(PO) glue_helper mei_hdcp snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic snd_sof_pci snd_sof_intel_hda_common snd_soc_hdac_hda snd_sof_intel_hda snd_sof_intel_byt snd_sof_intel_ipc intel_cstate snd_sof snd_sof_xtensa_dsp snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi ledtrig_audio snd_soc_core snd_compress ac97_bus snd_pcm_dmaengine snd_hda_intel snd_intel_dspcfg pcspkr eeepc_wmi snd_hda_codec asus_wmi sparse_keymap snd_hda_core
Jun 26 10:03:33 hal9000 kernel: [342983.303383]  snd_hwdep wmi_bmof snd_pcm snd_timer snd mxm_wmi acpi_tad acpi_pad soundcore cdc_acm mei_me mei mac_hid zcommon(PO) znvpair(PO) spl(O) vhost_net vhost tap ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi kvmgt i915 vfio_mdev mdev drm_kms_helper drm i2c_algo_bit fb_sys_fops syscopyarea sysfillrect sysimgblt kvm vfio_pci nfsd vfio_virqfd auth_rpcgss irqbypass nfs_acl vfio_iommu_type1 lockd vfio grace sunrpc ip_tables x_tables autofs4 btrfs xor zstd_compress raid6_pq dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio libcrc32c r8169 xhci_pci ahci realtek xhci_hcd libahci wmi video
Jun 26 10:03:33 hal9000 kernel: [342983.303399] CPU: 1 PID: 2320 Comm: kvm Tainted: P           O      5.4.73-1-pve #1
Jun 26 10:03:33 hal9000 kernel: [342983.303400] Hardware name: ASUS System Product Name/PRIME B460M-K, BIOS 0402 03/13/2020
Jun 26 10:03:33 hal9000 kernel: [342983.303419] RIP: 0010:ppgtt_populate_spt+0x1f1/0x3a0 [i915]
Jun 26 10:03:33 hal9000 kernel: [342983.303420] Code: 18 49 8b 01 48 8b 88 10 4a 00 00 8b 45 b8 8d 50 fa 83 e2 fb 74 08 83 e8 01 83 f8 0b 76 13 48 c7 c7 3a cc a3 c0 e8 d0 0f 0d f3 <0f> 0b e9 8c fe ff ff 4d 8b 47 38 48 8b 41 08 48 89 da 31 c9 48 8d
Jun 26 10:03:33 hal9000 kernel: [342983.303420] RSP: 0018:ffffa93a053bf738 EFLAGS: 00010282
Jun 26 10:03:33 hal9000 kernel: [342983.303421] RAX: 0000000000000000 RBX: 0000000000000001 RCX: 0000000000000006
Jun 26 10:03:33 hal9000 kernel: [342983.303422] RDX: 0000000000000007 RSI: 0000000000000082 RDI: ffff8dc24f2578c0
Jun 26 10:03:33 hal9000 kernel: [342983.303422] RBP: ffffa93a053bf7a8 R08: 000000000000e1cf R09: 0000000000000004
Jun 26 10:03:33 hal9000 kernel: [342983.303423] R10: 0000000000000000 R11: 0000000000000001 R12: ffffffffc0a02080
Jun 26 10:03:33 hal9000 kernel: [342983.303423] R13: 0000000000001000 R14: ffffffffc0a497e0 R15: ffff8dbf2ff14840
Jun 26 10:03:33 hal9000 kernel: [342983.303424] FS:  00007f34b3bff700(0000) GS:ffff8dc24f240000(0000) knlGS:00000000002eb000
Jun 26 10:03:33 hal9000 kernel: [342983.303424] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 26 10:03:33 hal9000 kernel: [342983.303425] CR2: 0000023ed7750088 CR3: 00000003d7754002 CR4: 00000000003626e0
Jun 26 10:03:33 hal9000 kernel: [342983.303425] DR0: 000000000443f043 DR1: 0000000000000000 DR2: 0000000000000000
Jun 26 10:03:33 hal9000 kernel: [342983.303426] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Jun 26 10:03:33 hal9000 kernel: [342983.303426] Call Trace:
Jun 26 10:03:33 hal9000 kernel: [342983.303444]  ppgtt_populate_spt_by_guest_entry+0xfb/0x420 [i915]
Jun 26 10:03:33 hal9000 kernel: [342983.303459]  ppgtt_handle_guest_write_page_table+0x353/0x4a0 [i915]
Jun 26 10:03:33 hal9000 kernel: [342983.303461]  ? kvmgt_rw_gpa+0x90/0x120 [kvmgt]
Jun 26 10:03:33 hal9000 kernel: [342983.303477]  ppgtt_write_protection_handler+0x260/0x290 [i915]
Jun 26 10:03:33 hal9000 kernel: [342983.303491]  intel_vgpu_page_track_handler+0x63/0x9c [i915]
Jun 26 10:03:33 hal9000 kernel: [342983.303493]  kvmgt_page_track_write+0x6a/0x70 [kvmgt]
Jun 26 10:03:33 hal9000 kernel: [342983.303508]  kvm_page_track_write+0x82/0xc0 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303516]  emulator_write_phys+0x3b/0x50 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303524]  write_emulate+0xe/0x10 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303531]  emulator_read_write_onepage+0xfc/0x320 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303539]  emulator_read_write+0xd6/0x190 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303546]  emulator_write_emulated+0x15/0x20 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303555]  segmented_write+0x5d/0x80 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303564]  writeback+0x11e/0x2e0 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303572]  x86_emulate_insn+0x990/0x1050 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303580]  x86_emulate_instruction+0x350/0x710 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303588]  kvm_mmu_page_fault+0xeb/0x5f0 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303590]  ? __switch_to_asm+0x40/0x70
Jun 26 10:03:33 hal9000 kernel: [342983.303591]  ? __switch_to_asm+0x34/0x70
Jun 26 10:03:33 hal9000 kernel: [342983.303592]  ? __switch_to_asm+0x34/0x70
Jun 26 10:03:33 hal9000 kernel: [342983.303593]  ? __switch_to_asm+0x40/0x70
Jun 26 10:03:33 hal9000 kernel: [342983.303594]  ? __switch_to_asm+0x34/0x70
Jun 26 10:03:33 hal9000 kernel: [342983.303595]  ? __switch_to_asm+0x40/0x70
Jun 26 10:03:33 hal9000 kernel: [342983.303595]  ? __switch_to_asm+0x34/0x70
Jun 26 10:03:33 hal9000 kernel: [342983.303596]  ? __switch_to_asm+0x34/0x70
Jun 26 10:03:33 hal9000 kernel: [342983.303598]  ? __check_object_size+0x4d/0x17c
Jun 26 10:03:33 hal9000 kernel: [342983.303602]  handle_ept_violation+0xf6/0x250 [kvm_intel]
Jun 26 10:03:33 hal9000 kernel: [342983.303604]  vmx_handle_exit+0xa5/0x7d0 [kvm_intel]
Jun 26 10:03:33 hal9000 kernel: [342983.303612]  vcpu_enter_guest+0x31f/0x1600 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303620]  ? restart_apic_timer+0x72/0x190 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303628]  kvm_arch_vcpu_ioctl_run+0x100/0x570 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303634]  kvm_vcpu_ioctl+0x24b/0x610 [kvm]
Jun 26 10:03:33 hal9000 kernel: [342983.303635]  ? do_futex+0xc7/0xc50
Jun 26 10:03:33 hal9000 kernel: [342983.303638]  ? __wake_up_locked_key+0x1b/0x20
Jun 26 10:03:33 hal9000 kernel: [342983.303639]  do_vfs_ioctl+0xa9/0x640
Jun 26 10:03:33 hal9000 kernel: [342983.303640]  ksys_ioctl+0x67/0x90
Jun 26 10:03:33 hal9000 kernel: [342983.303641]  __x64_sys_ioctl+0x1a/0x20
Jun 26 10:03:33 hal9000 kernel: [342983.303642]  do_syscall_64+0x57/0x190
Jun 26 10:03:33 hal9000 kernel: [342983.303643]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
Jun 26 10:03:33 hal9000 kernel: [342983.303644] RIP: 0033:0x7f36c47b5427
Jun 26 10:03:33 hal9000 kernel: [342983.303645] Code: 00 00 90 48 8b 05 69 aa 0c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 39 aa 0c 00 f7 d8 64 89 01 48
Jun 26 10:03:33 hal9000 kernel: [342983.303646] RSP: 002b:00007f34b3bf98b8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Jun 26 10:03:33 hal9000 kernel: [342983.303647] RAX: ffffffffffffffda RBX: 000000000000ae80 RCX: 00007f36c47b5427
Jun 26 10:03:33 hal9000 kernel: [342983.303647] RDX: 0000000000000000 RSI: 000000000000ae80 RDI: 000000000000001c
Jun 26 10:03:33 hal9000 kernel: [342983.303648] RBP: 0000000000000000 R08: 000055e31e351bd0 R09: 00007f34b3bf95a0
Jun 26 10:03:33 hal9000 kernel: [342983.303648] R10: 0000000000000001 R11: 0000000000000246 R12: 00007f34b6c6e8c0
Jun 26 10:03:33 hal9000 kernel: [342983.303648] R13: 000055e31e317b60 R14: 00007f36b9881000 R15: 0000000000000000
Jun 26 10:03:33 hal9000 kernel: [342983.303650] ---[ end trace 05859726d179adc7 ]---
 
here:
Code:
proxmox-ve: 6.2-2 (running kernel: 5.4.73-1-pve)
pve-manager: 6.2-15 (running version: 6.2-15/48bd51b6)
pve-kernel-5.4: 6.3-1
pve-kernel-helper: 6.3-1
pve-kernel-5.4.73-1-pve: 5.4.73-1
pve-kernel-5.4.34-1-pve: 5.4.34-2
ceph: 14.2.15-pve1
ceph-fuse: 14.2.15-pve1
corosync: 3.0.4-pve1
criu: 3.11-3
glusterfs-client: 5.5-3
ifupdown: 0.8.35+pve1
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.16-pve1
libproxmox-acme-perl: 1.0.5
libpve-access-control: 6.1-3
libpve-apiclient-perl: 3.0-3
libpve-common-perl: 6.2-4
libpve-guest-common-perl: 3.1-3
libpve-http-server-perl: 3.0-6
libpve-storage-perl: 6.2-10
libqb0: 1.0.5-1
libspice-server1: 0.14.2-4~pve6+1
lvm2: 2.03.02-pve4
lxc-pve: 4.0.3-1
lxcfs: 4.0.3-pve3
novnc-pve: 1.1.0-1
proxmox-backup-client: 1.0.1-1
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.3-10
pve-cluster: 6.2-1
pve-container: 3.2-3
pve-docs: 6.2-6
pve-edk2-firmware: 2.20200531-1
pve-firewall: 4.1-3
pve-firmware: 3.1-3
pve-ha-manager: 3.1-1
pve-i18n: 2.2-2
pve-qemu-kvm: 5.1.0-6
pve-xtermjs: 4.7.0-2
qemu-server: 6.2-20
smartmontools: 7.1-pve2
spiceterm: 3.1-1
vncterm: 1.6-2
zfsutils-linux: 0.8.5-pve1
 
Thanks Moayad, I've been procrastinating on that for a while, guess it's time to prioritize that. I'll let you know the results.

Again, thanks for your help