GPU passthrough stops rendering 3D scenes (Proxmox 5.3-11)

Tsirist

New Member
Mar 21, 2019
4
0
1
30
About a week ago I did my first tests with passthrough as I was interested in setting up a machine that my girlfriend and I could share. Last week I set up a throwaway Windows VM to test things out on, and the performance was pretty stellar.

Come this week, and I've successfully migrated our physical machines (I think), and performance seems decent until it just stops. I'm using a passthrough GPU to type this up, and outside of 3D rendering things seem to work well. 2D games like Factorio work, but after a few seconds of rendering something 3D like one of the Unigine benchmarks or DotA 2, the GPU utilization craters and the window just freezes. Sometimes the GPU utilization crawls upwards again, but this isn't reflected usefully on-screen. The game (or whatever it might be) continues to be interactive: I can perform actions, there is audio feedback, and so on, but never again shall a frame be rendered, which leads me to believe that this is an issue with the GPU passthrough somehow.

I tried setting up another throwaway VM earlier today to see if it would work again, and it had the same symptoms. If I don't pass through a card (I actually have no idea what does the rendering then), games like DotA will run, just (obviously) not well. But at least they render at that point.

There are a few differences this time around (from last week's attempt), but none that should matter as far as I know. Last week I used LVM to provide the disk to Windows. I tried that again with the throwaway Windows VM today, but no luck. Right now our machines run off of vioscsi disks (NVME and hard disks) based on /dev/by-id. The motherboard being used between this week and last week changed as well, from a Gigabyte AORUS B450 to an AORUS X470.

Booting Windows physically and performing the same tasks does not result in these issues.

I'm at a loss at this point. I've tried twiddling with more than a few things over the past two days. If this problem is familiar to anyone, I would very much appreciate any recommendations.
 
Last edited:
can you post your 'pveversion -v', the vm config and any relevant logs from the host as well as from the guest?
 
Code:
# pveversion -v
proxmox-ve: 5.3-1 (running kernel: 4.15.18-12-pve)
pve-manager: 5.3-11 (running version: 5.3-11/d4907f84)
pve-kernel-4.15: 5.3-3
pve-kernel-4.15.18-12-pve: 4.15.18-35
pve-kernel-4.15.18-11-pve: 4.15.18-34
pve-kernel-4.15.18-10-pve: 4.15.18-32
corosync: 2.4.4-pve1
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.1-3
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-47
libpve-guest-common-perl: 2.0-20
libpve-http-server-perl: 2.0-12
libpve-storage-perl: 5.0-39
libqb0: 1.0.3-1~bpo9
lvm2: 2.02.168-pve6
lxc-pve: 3.1.0-3
lxcfs: 3.0.3-pve1
novnc-pve: 1.0.0-3
proxmox-widget-toolkit: 1.0-23
pve-cluster: 5.0-33
pve-container: 2.0-35
pve-docs: 5.3-3
pve-edk2-firmware: 1.20181023-1
pve-firewall: 3.0-18
pve-firmware: 2.0-6
pve-ha-manager: 2.0-8
pve-i18n: 1.0-9
pve-libspice-server1: 0.14.1-2
pve-qemu-kvm: 2.12.1-2
pve-xtermjs: 3.10.1-2
qemu-server: 5.0-47
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
zfsutils-linux: 0.7.12-pve1~bpo1

Code:
# cat /etc/pve/qemu-server/101.conf
agent: 1
balloon: 0
bios: ovmf
bootdisk: scsi0
cores: 16
cpu: host,hidden=1
efidisk0: local-lvm:vm-101-disk-0,size=128K
hostpci0: 0b:00,x-vga=1,romfile=rtx_2070_d.rom,pcie=1
ide0: local:iso/Win10_1809Oct_English_x64.iso,media=cdrom,size=4956582K
ide2: local:iso/virtio-win-0.1.141.iso,media=cdrom,size=309208K
machine: q35
memory: 16384
name: radiance
net0: virtio=1E:CB:86:AA:0B:9B,bridge=vmbr0
numa: 0
ostype: win10
scsi0: /dev/disk/by-id/nvme-Samsung_SSD_970_EVO_250GB_S465NX0KB20449N,iothread=1,size=244198584K
scsi1: /dev/disk/by-id/ata-TOSHIBA_DT01ACA300_X67BJHAAS,iothread=1,size=2930266584K
scsihw: virtio-scsi-pci
smbios1: uuid=4ec8eafc-81a7-4050-a23e-0444621043f3
sockets: 1
usb0: host=1-10
usb1: host=1-9.4
vmgenid: 6667e0e0-cbd5-4275-8ba6-9f8ad3269429

Here's some of dmesg:

Code:
# dmesg -e
[Mar21 15:09] Linux version 4.15.18-12-pve (build@pve) (gcc version 6.3.0 20170516 (Debian 6.3.0-18+deb9u1)) #1 SMP PVE 4.15.18-35 (Wed, 13 Mar 2019 08:24:42 +0100) ()
[  +0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-4.15.18-12-pve root=/dev/mapper/pve-root ro quiet amd_iommu=on video=efifb:off
[  +0.000000] KERNEL supported cpus:
[  +0.000000]   Intel GenuineIntel
[  +0.000000]   AMD AuthenticAMD
[  +0.000000]   Centaur CentaurHauls
[  +0.000000] x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers'
[  +0.000000] x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers'
[  +0.000000] x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers'
[  +0.000000] x86/fpu: xstate_offset[2]:  576, xstate_sizes[2]:  256
[  +0.000000] x86/fpu: Enabled xstate features 0x7, context size is 832 bytes, using 'compacted' format.
...
[  +0.000292] PCI host bridge to bus 0000:00
[  +0.000001] pci_bus 0000:00: root bus resource [io  0x0000-0x03af window]
[  +0.000001] pci_bus 0000:00: root bus resource [io  0x03e0-0x0cf7 window]
[  +0.000001] pci_bus 0000:00: root bus resource [io  0x03b0-0x03df window]
[  +0.000000] pci_bus 0000:00: root bus resource [io  0x0d00-0xffff window]
[  +0.000001] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window]
[  +0.000001] pci_bus 0000:00: root bus resource [mem 0x000c0000-0x000dffff window]
[  +0.000001] pci_bus 0000:00: root bus resource [mem 0xc0000000-0xfec2ffff window]
[  +0.000001] pci_bus 0000:00: root bus resource [mem 0xfee00000-0xffffffff window]
[  +0.000001] pci_bus 0000:00: root bus resource [bus 00-ff]
[  +0.000005] pci 0000:00:00.0: [1022:1450] type 00 class 0x060000
[  +0.000066] pci 0000:00:00.2: [1022:1451] type 00 class 0x080600
[  +0.000082] pci 0000:00:01.0: [1022:1452] type 00 class 0x060000
[  +0.000058] pci 0000:00:01.1: [1022:1453] type 01 class 0x060400
[  +0.000251] pci 0000:00:01.1: enabling Extended Tags
[  +0.000785] pci 0000:00:01.1: PME# supported from D0 D3hot D3cold
[  +0.000082] pci 0000:00:01.3: [1022:1453] type 01 class 0x060400
[  +0.000028] pci 0000:00:01.3: enabling Extended Tags
[  +0.000076] pci 0000:00:01.3: PME# supported from D0 D3hot D3cold
[  +0.000085] pci 0000:00:02.0: [1022:1452] type 00 class 0x060000
[  +0.000071] pci 0000:00:03.0: [1022:1452] type 00 class 0x060000
[  +0.000058] pci 0000:00:03.1: [1022:1453] type 01 class 0x060400
[  +0.000561] pci 0000:00:03.1: PME# supported from D0 D3hot D3cold
[  +0.000089] pci 0000:00:03.2: [1022:1453] type 01 class 0x060400
[  +0.000121] pci 0000:00:03.2: PME# supported from D0 D3hot D3cold
[  +0.000094] pci 0000:00:04.0: [1022:1452] type 00 class 0x060000
[  +0.000076] pci 0000:00:07.0: [1022:1452] type 00 class 0x060000
[  +0.000056] pci 0000:00:07.1: [1022:1454] type 01 class 0x060400
[  +0.000652] pci 0000:00:07.1: enabling Extended Tags
[  +0.000075] pci 0000:00:07.1: PME# supported from D0 D3hot D3cold
[  +0.000087] pci 0000:00:08.0: [1022:1452] type 00 class 0x060000
[  +0.000057] pci 0000:00:08.1: [1022:1454] type 01 class 0x060400
[  +0.000947] pci 0000:00:08.1: enabling Extended Tags
[  +0.000076] pci 0000:00:08.1: PME# supported from D0 D3hot D3cold
[  +0.000124] pci 0000:00:14.0: [1022:790b] type 00 class 0x0c0500
[  +0.000209] pci 0000:00:14.3: [1022:790e] type 00 class 0x060100
[  +0.000213] pci 0000:00:18.0: [1022:1460] type 00 class 0x060000
[  +0.000046] pci 0000:00:18.1: [1022:1461] type 00 class 0x060000
[  +0.000047] pci 0000:00:18.2: [1022:1462] type 00 class 0x060000
[  +0.000046] pci 0000:00:18.3: [1022:1463] type 00 class 0x060000
[  +0.000046] pci 0000:00:18.4: [1022:1464] type 00 class 0x060000
[  +0.000051] pci 0000:00:18.5: [1022:1465] type 00 class 0x060000
[  +0.000047] pci 0000:00:18.6: [1022:1466] type 00 class 0x060000
[  +0.000045] pci 0000:00:18.7: [1022:1467] type 00 class 0x060000
[  +0.000122] pci 0000:01:00.0: [144d:a808] type 00 class 0x010802
[  +0.000022] pci 0000:01:00.0: reg 0x10: [mem 0xf7a00000-0xf7a03fff 64bit]
[  +0.014928] pci 0000:00:01.1: PCI bridge to [bus 01]
[  +0.000005] pci 0000:00:01.1:   bridge window [mem 0xf7a00000-0xf7afffff]
[  +0.000097] pci 0000:02:00.0: [1022:43d0] type 00 class 0x0c0330
[  +0.000022] pci 0000:02:00.0: reg 0x10: [mem 0xf75a0000-0xf75a7fff 64bit]
[  +0.000035] pci 0000:02:00.0: enabling Extended Tags
[  +0.000047] pci 0000:02:00.0: PME# supported from D3hot D3cold
[  +0.000069] pci 0000:02:00.1: [1022:43c8] type 00 class 0x010601
[  +0.000044] pci 0000:02:00.1: reg 0x24: [mem 0xf7580000-0xf759ffff]
[  +0.000007] pci 0000:02:00.1: reg 0x30: [mem 0xf7500000-0xf757ffff pref]
[  +0.000005] pci 0000:02:00.1: enabling Extended Tags
[  +0.000035] pci 0000:02:00.1: PME# supported from D3hot D3cold
[  +0.000051] pci 0000:02:00.2: [1022:43c6] type 01 class 0x060400
[  +0.000036] pci 0000:02:00.2: enabling Extended Tags
[  +0.000038] pci 0000:02:00.2: PME# supported from D3hot D3cold
[  +0.011507] pci 0000:00:01.3: PCI bridge to [bus 02-09]
[  +0.000004] pci 0000:00:01.3:   bridge window [io  0xf000-0xffff]
[  +0.000003] pci 0000:00:01.3:   bridge window [mem 0xf7200000-0xf75fffff]
[  +0.000116] pci 0000:03:00.0: [1022:43c7] type 01 class 0x060400
[  +0.000042] pci 0000:03:00.0: enabling Extended Tags
[  +0.000047] pci 0000:03:00.0: PME# supported from D3hot D3cold
[  +0.000072] pci 0000:03:01.0: [1022:43c7] type 01 class 0x060400
[  +0.000041] pci 0000:03:01.0: enabling Extended Tags
[  +0.000045] pci 0000:03:01.0: PME# supported from D3hot D3cold
[  +0.000068] pci 0000:03:02.0: [1022:43c7] type 01 class 0x060400
[  +0.000041] pci 0000:03:02.0: enabling Extended Tags
[  +0.000045] pci 0000:03:02.0: PME# supported from D3hot D3cold
[  +0.000068] pci 0000:03:03.0: [1022:43c7] type 01 class 0x060400
[  +0.000041] pci 0000:03:03.0: enabling Extended Tags
[  +0.000045] pci 0000:03:03.0: PME# supported from D3hot D3cold
[  +0.000070] pci 0000:03:04.0: [1022:43c7] type 01 class 0x060400
[  +0.000041] pci 0000:03:04.0: enabling Extended Tags
[  +0.000044] pci 0000:03:04.0: PME# supported from D3hot D3cold
[  +0.000061] pci 0000:03:09.0: [1022:43c7] type 01 class 0x060400
[  +0.000039] pci 0000:03:09.0: enabling Extended Tags
[  +0.000041] pci 0000:03:09.0: PME# supported from D3hot D3cold
[  +0.000063] pci 0000:02:00.2: PCI bridge to [bus 03-09]
[  +0.000004] pci 0000:02:00.2:   bridge window [io  0xf000-0xffff]
[  +0.000003] pci 0000:02:00.2:   bridge window [mem 0xf7200000-0xf74fffff]
[  +0.000032] pci 0000:03:00.0: PCI bridge to [bus 04]
[  +0.000043] pci 0000:03:01.0: PCI bridge to [bus 05]
[  +0.000042] pci 0000:03:02.0: PCI bridge to [bus 06]
[  +0.000068] pci 0000:07:00.0: [8086:1539] type 00 class 0x020000
[  +0.000043] pci 0000:07:00.0: reg 0x10: [mem 0xf7400000-0xf741ffff]
[  +0.000031] pci 0000:07:00.0: reg 0x18: [io  0xf000-0xf01f]
[  +0.000016] pci 0000:07:00.0: reg 0x1c: [mem 0xf7420000-0xf7423fff]
[  +0.000161] pci 0000:07:00.0: PME# supported from D0 D3hot D3cold
[  +0.014530] pci 0000:03:03.0: PCI bridge to [bus 07]
[  +0.000006] pci 0000:03:03.0:   bridge window [io  0xf000-0xffff]
[  +0.000003] pci 0000:03:03.0:   bridge window [mem 0xf7400000-0xf74fffff]
[  +0.000079] pci 0000:08:00.0: [144d:a808] type 00 class 0x010802
[  +0.000039] pci 0000:08:00.0: reg 0x10: [mem 0xf7300000-0xf7303fff 64bit]
[  +0.011877] pci 0000:03:04.0: PCI bridge to [bus 08]
[  +0.000007] pci 0000:03:04.0:   bridge window [mem 0xf7300000-0xf73fffff]
[  +0.000064] pci 0000:09:00.0: [1b21:1343] type 00 class 0x0c0330
[  +0.000041] pci 0000:09:00.0: reg 0x10: [mem 0xf7200000-0xf7207fff 64bit]
[  +0.000067] pci 0000:09:00.0: enabling Extended Tags
[  +0.000090] pci 0000:09:00.0: PME# supported from D3hot D3cold
[  +0.011724] pci 0000:03:09.0: PCI bridge to [bus 09]
[  +0.000008] pci 0000:03:09.0:   bridge window [mem 0xf7200000-0xf72fffff]
[  +0.000122] pci 0000:0a:00.0: [10de:1f02] type 00 class 0x030000
[  +0.000030] pci 0000:0a:00.0: reg 0x10: [mem 0xf6000000-0xf6ffffff]
[  +0.000014] pci 0000:0a:00.0: reg 0x14: [mem 0xe0000000-0xefffffff 64bit pref]
[  +0.000013] pci 0000:0a:00.0: reg 0x1c: [mem 0xf0000000-0xf1ffffff 64bit pref]
[  +0.000009] pci 0000:0a:00.0: reg 0x24: [io  0xe000-0xe07f]
[  +0.000009] pci 0000:0a:00.0: reg 0x30: [mem 0xf7000000-0xf707ffff pref]
[  +0.000015] pci 0000:0a:00.0: BAR 1: assigned to efifb
[  +0.000056] pci 0000:0a:00.0: PME# supported from D0 D3hot
[  +0.000099] pci 0000:0a:00.1: [10de:10f9] type 00 class 0x040300
[  +0.000018] pci 0000:0a:00.1: reg 0x10: [mem 0xf7080000-0xf7083fff]
[  +0.000128] pci 0000:0a:00.2: [10de:1ada] type 00 class 0x0c0330
[  +0.000025] pci 0000:0a:00.2: reg 0x10: [mem 0xf2000000-0xf203ffff 64bit pref]
[  +0.000021] pci 0000:0a:00.2: reg 0x1c: [mem 0xf2040000-0xf204ffff 64bit pref]
[  +0.000065] pci 0000:0a:00.2: PME# supported from D0 D3hot
[  +0.000053] pci 0000:0a:00.3: [10de:1adb] type 00 class 0x0c8000
[  +0.000018] pci 0000:0a:00.3: reg 0x10: [mem 0xf7084000-0xf7084fff]
[  +0.000083] pci 0000:0a:00.3: PME# supported from D0 D3hot
[  +0.011222] pci 0000:00:03.1: PCI bridge to [bus 0a]
[  +0.000005] pci 0000:00:03.1:   bridge window [io  0xe000-0xefff]
[  +0.000003] pci 0000:00:03.1:   bridge window [mem 0xf6000000-0xf70fffff]
[  +0.000004] pci 0000:00:03.1:   bridge window [mem 0xe0000000-0xf20fffff 64bit pref]
[  +0.000093] pci 0000:0b:00.0: [10de:1f02] type 00 class 0x030000
[  +0.000030] pci 0000:0b:00.0: reg 0x10: [mem 0xf4000000-0xf4ffffff]
[  +0.000013] pci 0000:0b:00.0: reg 0x14: [mem 0xc0000000-0xcfffffff 64bit pref]
[  +0.000014] pci 0000:0b:00.0: reg 0x1c: [mem 0xd0000000-0xd1ffffff 64bit pref]
[  +0.000008] pci 0000:0b:00.0: reg 0x24: [io  0xd000-0xd07f]
[  +0.000009] pci 0000:0b:00.0: reg 0x30: [mem 0xf5000000-0xf507ffff pref]
[  +0.000068] pci 0000:0b:00.0: PME# supported from D0 D3hot
[  +0.000097] pci 0000:0b:00.1: [10de:10f9] type 00 class 0x040300
[  +0.000018] pci 0000:0b:00.1: reg 0x10: [mem 0xf5080000-0xf5083fff]
[  +0.000126] pci 0000:0b:00.2: [10de:1ada] type 00 class 0x0c0330
[  +0.000025] pci 0000:0b:00.2: reg 0x10: [mem 0xd2000000-0xd203ffff 64bit pref]
[  +0.000021] pci 0000:0b:00.2: reg 0x1c: [mem 0xd2040000-0xd204ffff 64bit pref]
[  +0.000064] pci 0000:0b:00.2: PME# supported from D0 D3hot
[  +0.000050] pci 0000:0b:00.3: [10de:1adb] type 00 class 0x0c8000
[  +0.000018] pci 0000:0b:00.3: reg 0x10: [mem 0xf5084000-0xf5084fff]
[  +0.000082] pci 0000:0b:00.3: PME# supported from D0 D3hot
[  +0.011252] pci 0000:00:03.2: PCI bridge to [bus 0b]
[  +0.000005] pci 0000:00:03.2:   bridge window [io  0xd000-0xdfff]
[  +0.000003] pci 0000:00:03.2:   bridge window [mem 0xf4000000-0xf50fffff]
[  +0.000004] pci 0000:00:03.2:   bridge window [mem 0xc0000000-0xd20fffff 64bit pref]
[  +0.000084] pci 0000:0c:00.0: [1022:145a] type 00 class 0x130000
[  +0.000027] pci 0000:0c:00.0: enabling Extended Tags
[  +0.000065] pci 0000:0c:00.2: [1022:1456] type 00 class 0x108000
[  +0.000015] pci 0000:0c:00.2: reg 0x18: [mem 0xf7700000-0xf77fffff]
[  +0.000008] pci 0000:0c:00.2: reg 0x24: [mem 0xf7800000-0xf7801fff]
[  +0.000005] pci 0000:0c:00.2: enabling Extended Tags
[  +0.000070] pci 0000:0c:00.3: [1022:145f] type 00 class 0x0c0330
[  +0.000012] pci 0000:0c:00.3: reg 0x10: [mem 0xf7600000-0xf76fffff 64bit]
[  +0.000019] pci 0000:0c:00.3: enabling Extended Tags
[  +0.000029] pci 0000:0c:00.3: PME# supported from D0 D3hot D3cold
[  +0.000060] pci 0000:00:07.1: PCI bridge to [bus 0c]
[  +0.000004] pci 0000:00:07.1:   bridge window [mem 0xf7600000-0xf78fffff]
[  +0.000551] pci 0000:0d:00.0: [1022:1455] type 00 class 0x130000
[  +0.000024] pci 0000:0d:00.0: enabling Extended Tags
[  +0.000055] pci 0000:0d:00.2: [1022:7901] type 00 class 0x010601
[  +0.000025] pci 0000:0d:00.2: reg 0x24: [mem 0xf7908000-0xf7908fff]
[  +0.000007] pci 0000:0d:00.2: enabling Extended Tags
[  +0.000025] pci 0000:0d:00.2: PME# supported from D3hot D3cold
[  +0.000039] pci 0000:0d:00.3: [1022:1457] type 00 class 0x040300
[  +0.000009] pci 0000:0d:00.3: reg 0x10: [mem 0xf7900000-0xf7907fff]
[  +0.000016] pci 0000:0d:00.3: enabling Extended Tags
[  +0.000024] pci 0000:0d:00.3: PME# supported from D0 D3hot D3cold
[  +0.000048] pci 0000:00:08.1: PCI bridge to [bus 0d]
[  +0.000003] pci 0000:00:08.1:   bridge window [mem 0xf7900000-0xf79fffff]
[  +0.000253] ACPI: PCI Interrupt Link [LNKA] (IRQs 4 5 7 10 11 14 15) *0
[  +0.000052] ACPI: PCI Interrupt Link [LNKB] (IRQs 4 5 7 10 11 14 15) *0
[  +0.000044] ACPI: PCI Interrupt Link [LNKC] (IRQs 4 5 7 10 11 14 15) *0
[  +0.000056] ACPI: PCI Interrupt Link [LNKD] (IRQs 4 5 7 10 11 14 15) *0
[  +0.000050] ACPI: PCI Interrupt Link [LNKE] (IRQs 4 5 7 10 11 14 15) *0
[  +0.000040] ACPI: PCI Interrupt Link [LNKF] (IRQs 4 5 7 10 11 14 15) *0
[  +0.000041] ACPI: PCI Interrupt Link [LNKG] (IRQs 4 5 7 10 11 14 15) *0
[  +0.000040] ACPI: PCI Interrupt Link [LNKH] (IRQs 4 5 7 10 11 14 15) *0
[  +0.002152] SCSI subsystem initialized
[  +0.000008] libata version 3.00 loaded.
[  +0.000004] pci 0000:0a:00.0: vgaarb: VGA device added: decodes=io+mem,owns=none,locks=none
[  +0.000000] pci 0000:0b:00.0: vgaarb: VGA device added: decodes=io+mem,owns=none,locks=none
[  +0.000000] pci 0000:0a:00.0: vgaarb: bridge control possible
[  +0.000000] pci 0000:0b:00.0: vgaarb: bridge control possible
[  +0.000000] pci 0000:0a:00.0: vgaarb: setting as boot device
...
[  +0.000018] AMD-Vi: IOMMU performance counters supported
[  +0.000239] iommu: Adding device 0000:00:01.0 to group 0
[  +0.000078] iommu: Adding device 0000:00:01.1 to group 1
[  +0.000083] iommu: Adding device 0000:00:01.3 to group 2
[  +0.000076] iommu: Adding device 0000:00:02.0 to group 3
[  +0.000093] iommu: Adding device 0000:00:03.0 to group 4
[  +0.000085] iommu: Adding device 0000:00:03.1 to group 5
[  +0.000091] iommu: Adding device 0000:00:03.2 to group 6
[  +0.000080] iommu: Adding device 0000:00:04.0 to group 7
[  +0.000092] iommu: Adding device 0000:00:07.0 to group 8
[  +0.000086] iommu: Adding device 0000:00:07.1 to group 9
[  +0.000077] iommu: Adding device 0000:00:08.0 to group 10
[  +0.000083] iommu: Adding device 0000:00:08.1 to group 11
[  +0.000081] iommu: Adding device 0000:00:14.0 to group 12
[  +0.000009] iommu: Adding device 0000:00:14.3 to group 12
[  +0.000106] iommu: Adding device 0000:00:18.0 to group 13
[  +0.000010] iommu: Adding device 0000:00:18.1 to group 13
[  +0.000009] iommu: Adding device 0000:00:18.2 to group 13
[  +0.000009] iommu: Adding device 0000:00:18.3 to group 13
[  +0.000009] iommu: Adding device 0000:00:18.4 to group 13
[  +0.000009] iommu: Adding device 0000:00:18.5 to group 13
[  +0.000009] iommu: Adding device 0000:00:18.6 to group 13
[  +0.000010] iommu: Adding device 0000:00:18.7 to group 13
[  +0.000086] iommu: Adding device 0000:01:00.0 to group 14
[  +0.000112] iommu: Adding device 0000:02:00.0 to group 15
[  +0.000021] iommu: Adding device 0000:02:00.1 to group 15
[  +0.000022] iommu: Adding device 0000:02:00.2 to group 15
[  +0.000009] iommu: Adding device 0000:03:00.0 to group 15
[  +0.000009] iommu: Adding device 0000:03:01.0 to group 15
[  +0.000010] iommu: Adding device 0000:03:02.0 to group 15
[  +0.000009] iommu: Adding device 0000:03:03.0 to group 15
[  +0.000009] iommu: Adding device 0000:03:04.0 to group 15
[  +0.000008] iommu: Adding device 0000:03:09.0 to group 15
[  +0.000017] iommu: Adding device 0000:07:00.0 to group 15
[  +0.000018] iommu: Adding device 0000:08:00.0 to group 15
[  +0.000015] iommu: Adding device 0000:09:00.0 to group 15
[  +0.000130] iommu: Adding device 0000:0a:00.0 to group 16
[  +0.000036] iommu: Adding device 0000:0a:00.1 to group 16
[  +0.000034] iommu: Adding device 0000:0a:00.2 to group 16
[  +0.000035] iommu: Adding device 0000:0a:00.3 to group 16
[  +0.000136] iommu: Adding device 0000:0b:00.0 to group 17
[  +0.000036] iommu: Adding device 0000:0b:00.1 to group 17
[  +0.000035] iommu: Adding device 0000:0b:00.2 to group 17
[  +0.000034] iommu: Adding device 0000:0b:00.3 to group 17
[  +0.000092] iommu: Adding device 0000:0c:00.0 to group 18
[  +0.000121] iommu: Adding device 0000:0c:00.2 to group 19
[  +0.000091] iommu: Adding device 0000:0c:00.3 to group 20
[  +0.000099] iommu: Adding device 0000:0d:00.0 to group 21
[  +0.000087] iommu: Adding device 0000:0d:00.2 to group 22
[  +0.000095] iommu: Adding device 0000:0d:00.3 to group 23
[  +0.000180] AMD-Vi: Found IOMMU at 0000:00:00.2 cap 0x40
...
[  +0.000002] AMD-Vi: Interrupt remapping enabled
[  +0.000000] AMD-Vi: virtual APIC enabled
[  +0.000533] AMD-Vi: Lazy IO/TLB flushing enabled
[  +0.000645] amd_uncore: AMD NB counters detected
[  +0.000003] amd_uncore: AMD LLC counters detected
...
[Mar21 15:13]  nvme0n1: p1 p2 p3
[  +0.025167]  sdc: sdc1
[  +0.070456] vmbr0: port 2(tap101i0) entered disabled state
[  +0.066489] input: Ideazon Zboard as /devices/pci0000:00/0000:00:01.3/0000:02:00.0/usb1/1-9/1-9.4/1-9.4:1.0/0003:1038:0100.0006/input/input7
[  +0.056782] hid-generic 0003:1038:0100.0006: input,hidraw0: USB HID v1.10 Keyboard [Ideazon Zboard] on usb-0000:02:00.0-9.4/input0
[  +0.017327] input: Ideazon Zboard as /devices/pci0000:00/0000:00:01.3/0000:02:00.0/usb1/1-9/1-9.4/1-9.4:1.1/0003:1038:0100.0007/input/input8
[  +0.058649] hid-generic 0003:1038:0100.0007: input,hidraw1: USB HID v1.10 Device [Ideazon Zboard] on usb-0000:02:00.0-9.4/input1
[  +0.005180] hid-generic 0003:1038:170E.0008: hiddev0,hidraw2: USB HID v1.11 Device [SteelSeries SteelSeries Rival 500 Gaming Mouse] on usb-0000:02:00.0-10/input0
[  +0.005968] input: SteelSeries SteelSeries Rival 500 Gaming Mouse as /devices/pci0000:00/0000:00:01.3/0000:02:00.0/usb1/1-10/1-10:1.1/0003:1038:170E.0009/input/input9
[  +0.000101] hid-generic 0003:1038:170E.0009: input,hidraw3: USB HID v1.11 Mouse [SteelSeries SteelSeries Rival 500 Gaming Mouse] on usb-0000:02:00.0-10/input1
[  +0.006023] input: SteelSeries SteelSeries Rival 500 Gaming Mouse as /devices/pci0000:00/0000:00:01.3/0000:02:00.0/usb1/1-10/1-10:1.2/0003:1038:170E.000A/input/input10
[  +0.058679] hid-generic 0003:1038:170E.000A: input,hidraw4: USB HID v1.11 Keyboard [SteelSeries SteelSeries Rival 500 Gaming Mouse] on usb-0000:02:00.0-10/input2
[ +14.654324] xhci_hcd 0000:0b:00.2: remove, state 4
[  +0.000009] usb usb8: USB disconnect, device number 1
[  +0.000130] xhci_hcd 0000:0b:00.2: USB bus 8 deregistered
[  +0.000005] xhci_hcd 0000:0b:00.2: remove, state 4
[  +0.000003] usb usb7: USB disconnect, device number 1
[  +0.000626] xhci_hcd 0000:0b:00.2: USB bus 7 deregistered
[  +0.042765]  nvme0n1: p1 p2 p3
[  +0.002428]  sdc: sdc1
[  +0.500182] device tap101i0 entered promiscuous mode
[  +0.006241] vmbr0: port 2(tap101i0) entered blocking state
[  +0.000002] vmbr0: port 2(tap101i0) entered disabled state
[  +0.000070] vmbr0: port 2(tap101i0) entered blocking state
[  +0.000001] vmbr0: port 2(tap101i0) entered forwarding state
[  +2.405444] vfio_ecap_init: 0000:0b:00.0 hiding ecap 0x1e@0x258
[  +0.000026] vfio_ecap_init: 0000:0b:00.0 hiding ecap 0x19@0x900
[  +0.019811] vfio-pci 0000:0b:00.1: enabling device (0000 -> 0002)
[  +0.040001] vfio-pci 0000:0b:00.3: enabling device (0000 -> 0002)
[  +2.137597] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.543863] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[  +0.756136] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.711844] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[  +0.672162] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.763852] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[  +4.640218] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.544662] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[  +0.571920] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.544084] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[  +0.504152] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.543849] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[  +0.555325] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.543877] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[  +0.718417] kvm_get_msr_common: 6 callbacks suppressed
[  +0.000002] kvm [2351]: vcpu0, guest rIP: 0xfffff8027e88d0dc ignored rdmsr: 0xc001102c
[  +0.000006] kvm_set_msr_common: 6 callbacks suppressed
[  +0.000000] kvm [2351]: vcpu0, guest rIP: 0xfffff8027e88d0f9 ignored wrmsr: 0xc001102c data 0x8000000000000
[  +0.000902] kvm [2351]: vcpu1, guest rIP: 0xfffff8027e88d0dc ignored rdmsr: 0xc001102c
[  +0.000004] kvm [2351]: vcpu1, guest rIP: 0xfffff8027e88d0f9 ignored wrmsr: 0xc001102c data 0x8000000000000
[  +0.000854] kvm [2351]: vcpu2, guest rIP: 0xfffff8027e88d0dc ignored rdmsr: 0xc001102c
[  +0.000007] kvm [2351]: vcpu2, guest rIP: 0xfffff8027e88d0f9 ignored wrmsr: 0xc001102c data 0x8000000000000
[  +0.000957] kvm [2351]: vcpu3, guest rIP: 0xfffff8027e88d0dc ignored rdmsr: 0xc001102c
[  +0.000005] kvm [2351]: vcpu3, guest rIP: 0xfffff8027e88d0f9 ignored wrmsr: 0xc001102c data 0x8000000000000
[  +0.000984] kvm [2351]: vcpu4, guest rIP: 0xfffff8027e88d0dc ignored rdmsr: 0xc001102c
[  +0.000005] kvm [2351]: vcpu4, guest rIP: 0xfffff8027e88d0f9 ignored wrmsr: 0xc001102c data 0x8000000000000
[  +0.000984] kvm [2351]: vcpu5, guest rIP: 0xfffff8027e88d0dc ignored rdmsr: 0xc001102c
[  +0.000006] kvm [2351]: vcpu5, guest rIP: 0xfffff8027e88d0f9 ignored wrmsr: 0xc001102c data 0x8000000000000
[  +0.000980] kvm [2351]: vcpu6, guest rIP: 0xfffff8027e88d0dc ignored rdmsr: 0xc001102c
[  +0.000005] kvm [2351]: vcpu6, guest rIP: 0xfffff8027e88d0f9 ignored wrmsr: 0xc001102c data 0x8000000000000
[  +0.000973] kvm [2351]: vcpu7, guest rIP: 0xfffff8027e88d0dc ignored rdmsr: 0xc001102c
[  +0.000005] kvm [2351]: vcpu7, guest rIP: 0xfffff8027e88d0f9 ignored wrmsr: 0xc001102c data 0x8000000000000
[  +0.000968] kvm [2351]: vcpu8, guest rIP: 0xfffff8027e88d0dc ignored rdmsr: 0xc001102c
[  +0.000005] kvm [2351]: vcpu8, guest rIP: 0xfffff8027e88d0f9 ignored wrmsr: 0xc001102c data 0x8000000000000
[  +0.000996] kvm [2351]: vcpu9, guest rIP: 0xfffff8027e88d0dc ignored rdmsr: 0xc001102c
[  +0.000005] kvm [2351]: vcpu9, guest rIP: 0xfffff8027e88d0f9 ignored wrmsr: 0xc001102c data 0x8000000000000
[  +2.897101] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.547841] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[  +0.560152] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.547842] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[  +0.724177] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.543834] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[  +3.068220] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.755846] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[  +0.492571] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.515610] usb 1-10: reset full-speed USB device number 3 using xhci_hcd
[  +0.692183] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[  +0.723662] usb 1-9.4: reset low-speed USB device number 4 using xhci_hcd
[Mar21 15:16] watch[2660]: segfault at 7f1e254fed70 ip 00007f1e25fe6fb8 sp 00007ffc79ceec40 error 4 in libncursesw.so.5.9[7f1e25fcc000+2e000]

That segfault in libncurses is not something I've seen before, but it doesn't coincide with any of my tests. The problematic behavior is not reflected in any logs I've seen on the host's side.

On the guest, I'm not too sure where to look in Windows. The event viewer doesn't show any events proximate to the time of the problems for any of the devices I consider relevant (GPU driver, VirtIO drivers).

Hardware used:
  • CPU: Ryzen 7 2700X
  • Motherboard: Gigabyte X470
  • Memory:
    • GeIL - EVO SPEAR 16 GB (2 x 8 GB) DDR4-3000
    • G.SKILL TridentZ RGB Series 32GB (2 x 16GB) DDR4-3000
  • Storage:
    • (2x) 970 Evo 250 GB M.2-2280 Solid State Drive
    • old Patriot Pyro SSD
    • misc. hard disks
  • GPU: (2x) Gigabyte - GeForce RTX 2070 8 GB WINDFORCE
  • PSU: EVGA Supernova 750W G3
Immediately after booting:

Code:
# lspci -k
...
0a:00.0 VGA compatible controller: NVIDIA Corporation Device 1f02 (rev a1)
        Subsystem: Gigabyte Technology Co., Ltd Device 37c2
        Kernel driver in use: vfio-pci
        Kernel modules: nvidiafb, nouveau
0a:00.1 Audio device: NVIDIA Corporation Device 10f9 (rev a1)
        Subsystem: Gigabyte Technology Co., Ltd Device 37c2
        Kernel driver in use: vfio-pci
        Kernel modules: snd_hda_intel
0a:00.2 USB controller: NVIDIA Corporation Device 1ada (rev a1)
        Subsystem: Gigabyte Technology Co., Ltd Device 37c2
        Kernel driver in use: vfio-pci
0a:00.3 Serial bus controller [0c80]: NVIDIA Corporation Device 1adb (rev a1)
        Subsystem: Gigabyte Technology Co., Ltd Device 37c2
        Kernel driver in use: vfio-pci
0b:00.0 VGA compatible controller: NVIDIA Corporation Device 1f02 (rev a1)
        Subsystem: Gigabyte Technology Co., Ltd Device 37c2
        Kernel driver in use: vfio-pci
        Kernel modules: nvidiafb, nouveau
0b:00.1 Audio device: NVIDIA Corporation Device 10f9 (rev a1)
        Subsystem: Gigabyte Technology Co., Ltd Device 37c2
        Kernel driver in use: vfio-pci
        Kernel modules: snd_hda_intel
0b:00.2 USB controller: NVIDIA Corporation Device 1ada (rev a1)
        Subsystem: Gigabyte Technology Co., Ltd Device 37c2
        Kernel driver in use: vfio-pci
0b:00.3 Serial bus controller [0c80]: NVIDIA Corporation Device 1adb (rev a1)
        Subsystem: Gigabyte Technology Co., Ltd Device 37c2
        Kernel driver in use: vfio-pci
...
 
Last edited:
mhmm.. there is really nothing standing our from the info you posted, except the weird segfault in libncurses (i dont believe it has anything to do with it, but i would investigate nonetheless, maybe do a memory check or use 'debsums' to verify that the files are correct)

one thing you could try is to not passthrough the usb devices of the card (so only 0b:00.0 and 0b:00.1) if you do not need them (i have such a setup running at home with a 2060 which works)
the syntax would be
Code:
hostpci0: 0b:00.0;0b:00.1,x-vga=1,romfile=rtx_2070_d.rom,pcie=1

also, try without or with a different romfile

aside from that, i would suggest the typical suggestions: update the host/guest, update the drivers,etc.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!