GPU Passthrough partially not working since kernel upgrade

Slyrack

Member
May 16, 2022
5
0
6
Hello,

I'm currently testing different GPU cards on a Dell T7920 with quite good results until about a week ago. At the moment there are a GTX1080 and a RTX4000.
I'm using the pve-no-subscription repo and since the kernel has been upgraded from 5.13.19-6-pve (last fully working version for me) I get the following error when I try to boot the VM : kvm: -device vfio-pci,host=0000:73:00.0,id=hostpci0.0,bus=ich9-pcie-port-1,addr=0x0.0,multifunction=on: Failed to mmap 0000:73:00.0 BAR 3. Performance may be slow
This only affects the RTX4000. It is no longer detected inside the VM while the GTX1080 is still working normally.
I know that the problem is due to the fact that the kernel has been updated because when I pin the latest functional version (without any other changes), it works normally again.

Here below some tests with the two next kernel versions released (5.15.35-1-pve and 5.15.30-2-pve).
Bash:
root@projectvm:~# uname -a
Linux projectvm 5.15.35-1-pve #1 SMP PVE 5.15.35-2 (Thu, 05 May 2022 13:54:35 +0200) x86_64 GNU/Linux
root@projectvm:~# qm start 106
no efidisk configured! Using temporary efivars disk.
kvm: -device vfio-pci,host=0000:73:00.0,id=hostpci0.0,bus=ich9-pcie-port-1,addr=0x0.0,multifunction=on: Failed to mmap 0000:73:00.0 BAR 3. Performance may be slow
root@projectvm:~# proxmox-boot-tool kernel pin 5.15.30-3-pve
Overriding previously pinned version '5.15.35-1-pve' with '5.15.30-2-pve'
Set kernel '5.15.30-2-pve' in /etc/kernel/proxmox-boot-pin.
Refresh the actual boot ESPs now? [yN] y
Running hook script 'proxmox-auto-removal'..
Running hook script 'zz-proxmox-boot'..
Re-executing '/etc/kernel/postinst.d/zz-proxmox-boot' in new private mount namespace..
Copying and configuring kernels on /dev/disk/by-uuid/6CA5-071E
        Copying kernel and creating boot-entry for 5.13.19-6-pve
        Copying kernel and creating boot-entry for 5.15.30-2-pve
        Copying kernel and creating boot-entry for 5.15.35-1-pve
Copying and configuring kernels on /dev/disk/by-uuid/6CA5-51E5
        Copying kernel and creating boot-entry for 5.13.19-6-pve
        Copying kernel and creating boot-entry for 5.15.30-2-pve
        Copying kernel and creating boot-entry for 5.15.35-1-pve
root@projectvm:~# reboot


root@projectvm:~# uname -a
Linux projectvm 5.15.30-2-pve #1 SMP PVE 5.15.30-3 (Fri, 22 Apr 2022 18:08:27 +0200) x86_64 GNU/Linux
root@projectvm:~# qm start 106
no efidisk configured! Using temporary efivars disk.
kvm: -device vfio-pci,host=0000:73:00.0,id=hostpci0.0,bus=ich9-pcie-port-1,addr=0x0.0,multifunction=on: Failed to mmap 0000:73:00.0 BAR 3. Performance may be slow
root@projectvm:~# proxmox-boot-tool kernel pin 5.13.19-6-pve
[...]
root@projectvm:~# reboot


root@projectvm:~# uname -a
Linux projectvm 5.13.19-6-pve #1 SMP PVE 5.13.19-15 (Tue, 29 Mar 2022 15:59:50 +0200) x86_64 GNU/Linux
root@projectvm:~# qm start 106
no efidisk configured! Using temporary efivars disk.
root@projectvm:~# <-- Working again !

Some configuration files :

Bash:
root@projectvm:~# cat /proc/cmdline
initrd=\EFI\proxmox\5.13.19-6-pve\initrd.img-5.13.19-6-pve root=ZFS=rpool/ROOT/pve-1 boot=zfs video=efifb:off intel_iommu=on irqpoll
root@projectvm:~# cat /proc/cmdline
initrd=\EFI\proxmox\5.15.30-2-pve\initrd.img-5.15.30-2-pve root=ZFS=rpool/ROOT/pve-1 boot=zfs video=efifb:off intel_iommu=on irqpoll
root@projectvm:~# cat /proc/cmdline
initrd=\EFI\proxmox\5.15.35-1-pve\initrd.img-5.15.35-1-pve root=ZFS=rpool/ROOT/pve-1 boot=zfs video=efifb:off intel_iommu=on irqpoll

(using 5.15.35-1-pve)
Bash:
root@projectvm:~# dmesg | grep -e DMAR -e IOMMU
[    0.020385] ACPI: DMAR 0x0000000069800DF8 000270 (v01 DELL\x CBX3     00000001 INTL 20091013)
[    0.020470] ACPI: Reserving DMAR table memory at [mem 0x69800df8-0x69801067]
[    0.160165] DMAR: IOMMU enabled
[    0.376725] DMAR: Host address width 46
[    0.376727] DMAR: DRHD base: 0x000000d37fc000 flags: 0x0
[    0.376736] DMAR: dmar0: reg_base_addr d37fc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376742] DMAR: DRHD base: 0x000000e0ffc000 flags: 0x0
[    0.376748] DMAR: dmar1: reg_base_addr e0ffc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376753] DMAR: DRHD base: 0x000000ee7fc000 flags: 0x0
[    0.376758] DMAR: dmar2: reg_base_addr ee7fc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376763] DMAR: DRHD base: 0x000000fbffc000 flags: 0x0
[    0.376768] DMAR: dmar3: reg_base_addr fbffc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376772] DMAR: DRHD base: 0x000000a0ffc000 flags: 0x0
[    0.376777] DMAR: dmar4: reg_base_addr a0ffc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376781] DMAR: DRHD base: 0x000000a47fc000 flags: 0x0
[    0.376791] DMAR: dmar5: reg_base_addr a47fc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376796] DMAR: DRHD base: 0x000000c5ffc000 flags: 0x0
[    0.376801] DMAR: dmar6: reg_base_addr c5ffc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376805] DMAR: DRHD base: 0x000000a0bfc000 flags: 0x1
[    0.376810] DMAR: dmar7: reg_base_addr a0bfc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376814] DMAR: RMRR base: 0x0000006bc56000 end: 0x0000006bc66fff
[    0.376818] DMAR: ATSR flags: 0x0
[    0.376821] DMAR: ATSR flags: 0x0
[    0.376823] DMAR: RHSA base: 0x000000a0bfc000 proximity domain: 0x0
[    0.376827] DMAR: RHSA base: 0x000000a0ffc000 proximity domain: 0x0
[    0.376829] DMAR: RHSA base: 0x000000a47fc000 proximity domain: 0x0
[    0.376832] DMAR: RHSA base: 0x000000c5ffc000 proximity domain: 0x0
[    0.376835] DMAR: RHSA base: 0x000000d37fc000 proximity domain: 0x1
[    0.376837] DMAR: RHSA base: 0x000000e0ffc000 proximity domain: 0x1
[    0.376840] DMAR: RHSA base: 0x000000ee7fc000 proximity domain: 0x1
[    0.376842] DMAR: RHSA base: 0x000000fbffc000 proximity domain: 0x1
[    0.376847] DMAR-IR: IOAPIC id 12 under DRHD base  0xc5ffc000 IOMMU 6
[    0.376851] DMAR-IR: IOAPIC id 11 under DRHD base  0xa47fc000 IOMMU 5
[    0.376854] DMAR-IR: IOAPIC id 10 under DRHD base  0xa0ffc000 IOMMU 4
[    0.376857] DMAR-IR: IOAPIC id 18 under DRHD base  0xfbffc000 IOMMU 3
[    0.376860] DMAR-IR: IOAPIC id 17 under DRHD base  0xee7fc000 IOMMU 2
[    0.376863] DMAR-IR: IOAPIC id 16 under DRHD base  0xe0ffc000 IOMMU 1
[    0.376866] DMAR-IR: IOAPIC id 15 under DRHD base  0xd37fc000 IOMMU 0
[    0.376869] DMAR-IR: IOAPIC id 8 under DRHD base  0xa0bfc000 IOMMU 7
[    0.376872] DMAR-IR: IOAPIC id 9 under DRHD base  0xa0bfc000 IOMMU 7
[    0.376875] DMAR-IR: HPET id 0 under DRHD base 0xa0bfc000
[    0.376879] DMAR-IR: Queued invalidation will be enabled to support x2apic and Intr-remapping.
[    0.379165] DMAR-IR: Enabled IRQ remapping in x2apic mode
[   34.087511] DMAR: No SATC found
[   34.087516] DMAR: dmar6: Using Queued invalidation
[   34.087522] DMAR: dmar5: Using Queued invalidation
[   34.087528] DMAR: dmar4: Using Queued invalidation
[   34.087547] DMAR: dmar3: Using Queued invalidation
[   34.087551] DMAR: dmar2: Using Queued invalidation
[   34.087555] DMAR: dmar1: Using Queued invalidation
[   34.087559] DMAR: dmar0: Using Queued invalidation
[   34.087571] DMAR: dmar7: Using Queued invalidation
[   34.180091] DMAR: Intel(R) Virtualization Technology for Directed I/O
[   41.831310] snd_emu10k1 0000:d6:00.0: non-passthrough IOMMU detected, widening DMA allocations

Bash:
root@projectvm:~# cat /etc/modules
vfio
vfio_iommu_type1
vfio_pci
vfio_virqfd

Bash:
root@projectvm:~# dmesg | grep 'remapping'
[    0.376879] DMAR-IR: Queued invalidation will be enabled to support x2apic and Intr-remapping.
[    0.379165] DMAR-IR: Enabled IRQ remapping in x2apic mode

Bash:
root@projectvm:~# cat /etc/modprobe.d/*
blacklist nouveau
blacklist nvidia

options kvm ignore_msrs=1

blacklist nvidiafb

options vfio-pci ids=10de:1b80,10de:10f0,10de:1eb0,10de:10f8,10de:1ad8,10de:1ad9,10de:1eb1,10de:10f8,10de:1ad8,10de:1ad9,10de:1cb3,10de:0fb9,10de:107d,10de:0e08 disable_vga=1


I know that using the pve-no-subscription repository is less tested and at our own risk, but I don't even know where I should start investigating to fix the problem.
At the moment I am continuing my tests on 5.13.19-6-pve but I would like to understand what is going on.
Any ideas ?

Many thanks !
 
Last edited:
Here is some additional configuration :

IOMMU groups are working :
Bash:
root@projectvm:~# find /sys/kernel/iommu_groups/ -type l
/sys/kernel/iommu_groups/108/devices/0000:a5:0f.0
/sys/kernel/iommu_groups/108/devices/0000:a5:0f.1
/sys/kernel/iommu_groups/55/devices/0000:44:0a.6
/sys/kernel/iommu_groups/136/devices/0000:d1:0c.6
/sys/kernel/iommu_groups/83/devices/0000:72:12.2
/sys/kernel/iommu_groups/83/devices/0000:72:12.1
/sys/kernel/iommu_groups/17/devices/0000:00:16.2
/sys/kernel/iommu_groups/17/devices/0000:00:16.0
/sys/kernel/iommu_groups/17/devices/0000:00:16.3
/sys/kernel/iommu_groups/45/devices/0000:44:05.4
/sys/kernel/iommu_groups/126/devices/0000:d1:0b.0
/sys/kernel/iommu_groups/73/devices/0000:45:00.1
/sys/kernel/iommu_groups/73/devices/0000:45:00.0
/sys/kernel/iommu_groups/154/devices/0000:d6:00.0
/sys/kernel/iommu_groups/154/devices/0000:d5:00.0
/sys/kernel/iommu_groups/35/devices/0000:16:08.6
/sys/kernel/iommu_groups/35/devices/0000:16:08.4
/sys/kernel/iommu_groups/35/devices/0000:16:08.2
/sys/kernel/iommu_groups/35/devices/0000:16:08.0
/sys/kernel/iommu_groups/35/devices/0000:16:08.7
/sys/kernel/iommu_groups/35/devices/0000:16:08.5
/sys/kernel/iommu_groups/35/devices/0000:16:08.3
/sys/kernel/iommu_groups/35/devices/0000:16:08.1
/sys/kernel/iommu_groups/7/devices/0000:00:04.6
/sys/kernel/iommu_groups/116/devices/0000:d1:08.0
/sys/kernel/iommu_groups/63/devices/0000:44:0c.2
/sys/kernel/iommu_groups/144/devices/0000:d4:05.2
/sys/kernel/iommu_groups/91/devices/0000:a0:04.4
/sys/kernel/iommu_groups/25/devices/0000:02:00.0
/sys/kernel/iommu_groups/25/devices/0000:02:00.1
/sys/kernel/iommu_groups/106/devices/0000:a5:09.0
/sys/kernel/iommu_groups/106/devices/0000:a5:09.1
/sys/kernel/iommu_groups/53/devices/0000:44:0a.4
/sys/kernel/iommu_groups/134/devices/0000:d1:0c.4
/sys/kernel/iommu_groups/81/devices/0000:72:0f.1
/sys/kernel/iommu_groups/15/devices/0000:00:11.5
/sys/kernel/iommu_groups/15/devices/0000:00:11.0
/sys/kernel/iommu_groups/43/devices/0000:44:05.0
/sys/kernel/iommu_groups/124/devices/0000:d1:0a.6
/sys/kernel/iommu_groups/71/devices/0000:44:0d.2
/sys/kernel/iommu_groups/152/devices/0000:d4:15.0
/sys/kernel/iommu_groups/152/devices/0000:d4:15.1
/sys/kernel/iommu_groups/33/devices/0000:16:05.2
/sys/kernel/iommu_groups/5/devices/0000:00:04.4
/sys/kernel/iommu_groups/114/devices/0000:d1:05.4
/sys/kernel/iommu_groups/61/devices/0000:44:0c.0
/sys/kernel/iommu_groups/142/devices/0000:d4:00.0
/sys/kernel/iommu_groups/23/devices/0000:00:1f.2
/sys/kernel/iommu_groups/23/devices/0000:00:1f.0
/sys/kernel/iommu_groups/23/devices/0000:00:1f.5
/sys/kernel/iommu_groups/23/devices/0000:00:1f.6
/sys/kernel/iommu_groups/23/devices/0000:00:1f.4
/sys/kernel/iommu_groups/104/devices/0000:a5:05.4
/sys/kernel/iommu_groups/51/devices/0000:44:0a.2
/sys/kernel/iommu_groups/132/devices/0000:d1:0c.2
/sys/kernel/iommu_groups/13/devices/0000:00:08.1
/sys/kernel/iommu_groups/41/devices/0000:17:00.0
/sys/kernel/iommu_groups/122/devices/0000:d1:0a.4
/sys/kernel/iommu_groups/150/devices/0000:d4:12.0
/sys/kernel/iommu_groups/98/devices/0000:a0:08.0
/sys/kernel/iommu_groups/31/devices/0000:16:00.0
/sys/kernel/iommu_groups/3/devices/0000:00:04.2
/sys/kernel/iommu_groups/112/devices/0000:d1:05.0
/sys/kernel/iommu_groups/140/devices/0000:d1:0d.2
/sys/kernel/iommu_groups/88/devices/0000:a0:04.1
/sys/kernel/iommu_groups/21/devices/0000:00:1c.7
/sys/kernel/iommu_groups/102/devices/0000:a5:05.0
/sys/kernel/iommu_groups/130/devices/0000:d1:0c.0
/sys/kernel/iommu_groups/78/devices/0000:72:0e.0
/sys/kernel/iommu_groups/11/devices/0000:00:05.4
/sys/kernel/iommu_groups/120/devices/0000:d1:0a.2
/sys/kernel/iommu_groups/68/devices/0000:44:0c.7
/sys/kernel/iommu_groups/149/devices/0000:d4:0f.1
/sys/kernel/iommu_groups/96/devices/0000:a0:05.2
/sys/kernel/iommu_groups/1/devices/0000:00:04.0
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.2
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.0
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.5
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.3
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.1
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.6
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.4
/sys/kernel/iommu_groups/58/devices/0000:44:0b.1
/sys/kernel/iommu_groups/139/devices/0000:d1:0d.1
/sys/kernel/iommu_groups/86/devices/0000:73:00.3
/sys/kernel/iommu_groups/86/devices/0000:73:00.1
/sys/kernel/iommu_groups/86/devices/0000:73:00.2
/sys/kernel/iommu_groups/86/devices/0000:73:00.0
/sys/kernel/iommu_groups/100/devices/0000:a0:08.2
/sys/kernel/iommu_groups/48/devices/0000:44:09.0
/sys/kernel/iommu_groups/129/devices/0000:d1:0b.3
/sys/kernel/iommu_groups/76/devices/0000:72:05.2
/sys/kernel/iommu_groups/38/devices/0000:16:0f.1
/sys/kernel/iommu_groups/38/devices/0000:16:0f.0
/sys/kernel/iommu_groups/119/devices/0000:d1:0a.1
/sys/kernel/iommu_groups/66/devices/0000:44:0c.5
/sys/kernel/iommu_groups/147/devices/0000:d4:0e.1
/sys/kernel/iommu_groups/94/devices/0000:a0:04.7
/sys/kernel/iommu_groups/28/devices/0000:04:00.1
/sys/kernel/iommu_groups/109/devices/0000:a5:1d.2
/sys/kernel/iommu_groups/109/devices/0000:a5:1d.0
/sys/kernel/iommu_groups/109/devices/0000:a5:1d.3
/sys/kernel/iommu_groups/109/devices/0000:a5:1d.1
/sys/kernel/iommu_groups/56/devices/0000:44:0a.7
/sys/kernel/iommu_groups/137/devices/0000:d1:0c.7
/sys/kernel/iommu_groups/84/devices/0000:72:15.0
/sys/kernel/iommu_groups/84/devices/0000:72:15.1
/sys/kernel/iommu_groups/18/devices/0000:00:17.0
/sys/kernel/iommu_groups/46/devices/10000:00:03.0
/sys/kernel/iommu_groups/46/devices/0000:44:05.5
/sys/kernel/iommu_groups/46/devices/10000:00:02.0
/sys/kernel/iommu_groups/127/devices/0000:d1:0b.1
/sys/kernel/iommu_groups/74/devices/0000:72:00.0
/sys/kernel/iommu_groups/36/devices/0000:16:09.0
/sys/kernel/iommu_groups/36/devices/0000:16:09.1
/sys/kernel/iommu_groups/8/devices/0000:00:04.7
/sys/kernel/iommu_groups/117/devices/0000:d1:09.0
/sys/kernel/iommu_groups/64/devices/0000:44:0c.3
/sys/kernel/iommu_groups/145/devices/0000:d4:05.4
/sys/kernel/iommu_groups/92/devices/0000:a0:04.5
/sys/kernel/iommu_groups/26/devices/0000:03:00.0
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.6
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.4
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.2
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.0
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.7
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.5
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.3
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.1
/sys/kernel/iommu_groups/54/devices/0000:44:0a.5
/sys/kernel/iommu_groups/135/devices/0000:d1:0c.5
/sys/kernel/iommu_groups/82/devices/0000:72:12.0
/sys/kernel/iommu_groups/16/devices/0000:00:14.2
/sys/kernel/iommu_groups/16/devices/0000:00:14.0
/sys/kernel/iommu_groups/44/devices/0000:44:05.2
/sys/kernel/iommu_groups/125/devices/0000:d1:0a.7
/sys/kernel/iommu_groups/72/devices/0000:44:0d.3
/sys/kernel/iommu_groups/153/devices/0000:d4:16.5
/sys/kernel/iommu_groups/153/devices/0000:d4:16.1
/sys/kernel/iommu_groups/153/devices/0000:d4:16.4
/sys/kernel/iommu_groups/153/devices/0000:d4:16.0
/sys/kernel/iommu_groups/34/devices/0000:16:05.4
/sys/kernel/iommu_groups/6/devices/0000:00:04.5
/sys/kernel/iommu_groups/115/devices/10001:00:00.0
/sys/kernel/iommu_groups/115/devices/0000:d1:05.5
/sys/kernel/iommu_groups/115/devices/10001:00:01.0
/sys/kernel/iommu_groups/62/devices/0000:44:0c.1
/sys/kernel/iommu_groups/143/devices/0000:d4:05.0
/sys/kernel/iommu_groups/90/devices/0000:a0:04.3
/sys/kernel/iommu_groups/24/devices/0000:00:1f.3
/sys/kernel/iommu_groups/105/devices/0000:a5:08.3
/sys/kernel/iommu_groups/105/devices/0000:a5:08.1
/sys/kernel/iommu_groups/105/devices/0000:a5:08.6
/sys/kernel/iommu_groups/105/devices/0000:a5:08.4
/sys/kernel/iommu_groups/105/devices/0000:a5:08.2
/sys/kernel/iommu_groups/105/devices/0000:a5:08.0
/sys/kernel/iommu_groups/105/devices/0000:a5:08.7
/sys/kernel/iommu_groups/105/devices/0000:a5:08.5
/sys/kernel/iommu_groups/52/devices/0000:44:0a.3
/sys/kernel/iommu_groups/133/devices/0000:d1:0c.3
/sys/kernel/iommu_groups/80/devices/0000:72:0f.0
/sys/kernel/iommu_groups/14/devices/0000:00:08.2
/sys/kernel/iommu_groups/42/devices/0000:44:00.0
/sys/kernel/iommu_groups/123/devices/0000:d1:0a.5
/sys/kernel/iommu_groups/70/devices/0000:44:0d.1
/sys/kernel/iommu_groups/151/devices/0000:d4:12.2
/sys/kernel/iommu_groups/151/devices/0000:d4:12.1
/sys/kernel/iommu_groups/99/devices/0000:a0:08.1
/sys/kernel/iommu_groups/32/devices/0000:16:05.0
/sys/kernel/iommu_groups/4/devices/0000:00:04.3
/sys/kernel/iommu_groups/113/devices/0000:d1:05.2
/sys/kernel/iommu_groups/60/devices/0000:44:0b.3
/sys/kernel/iommu_groups/141/devices/0000:d1:0d.3
/sys/kernel/iommu_groups/89/devices/0000:a0:04.2
/sys/kernel/iommu_groups/22/devices/0000:00:1d.0
/sys/kernel/iommu_groups/103/devices/0000:a5:05.2
/sys/kernel/iommu_groups/50/devices/0000:44:0a.1
/sys/kernel/iommu_groups/131/devices/0000:d1:0c.1
/sys/kernel/iommu_groups/79/devices/0000:72:0e.1
/sys/kernel/iommu_groups/12/devices/0000:00:08.0
/sys/kernel/iommu_groups/40/devices/0000:16:1e.5
/sys/kernel/iommu_groups/40/devices/0000:16:1e.3
/sys/kernel/iommu_groups/40/devices/0000:16:1e.1
/sys/kernel/iommu_groups/40/devices/0000:16:1e.6
/sys/kernel/iommu_groups/40/devices/0000:16:1e.4
/sys/kernel/iommu_groups/40/devices/0000:16:1e.2
/sys/kernel/iommu_groups/40/devices/0000:16:1e.0
/sys/kernel/iommu_groups/121/devices/0000:d1:0a.3
/sys/kernel/iommu_groups/69/devices/0000:44:0d.0
/sys/kernel/iommu_groups/97/devices/0000:a0:05.4
/sys/kernel/iommu_groups/30/devices/0000:04:00.3
/sys/kernel/iommu_groups/2/devices/0000:00:04.1
/sys/kernel/iommu_groups/111/devices/0000:a6:00.1
/sys/kernel/iommu_groups/111/devices/0000:a6:00.0
/sys/kernel/iommu_groups/59/devices/0000:44:0b.2
/sys/kernel/iommu_groups/87/devices/0000:a0:04.0
/sys/kernel/iommu_groups/20/devices/0000:00:1c.4
/sys/kernel/iommu_groups/101/devices/0000:a5:00.0
/sys/kernel/iommu_groups/49/devices/0000:44:0a.0
/sys/kernel/iommu_groups/77/devices/0000:72:05.4
/sys/kernel/iommu_groups/10/devices/0000:00:05.2
/sys/kernel/iommu_groups/39/devices/0000:16:1d.0
/sys/kernel/iommu_groups/39/devices/0000:16:1d.3
/sys/kernel/iommu_groups/39/devices/0000:16:1d.1
/sys/kernel/iommu_groups/39/devices/0000:16:1d.2
/sys/kernel/iommu_groups/67/devices/0000:44:0c.6
/sys/kernel/iommu_groups/148/devices/0000:d4:0f.0
/sys/kernel/iommu_groups/95/devices/0000:a0:05.0
/sys/kernel/iommu_groups/29/devices/0000:04:00.2
/sys/kernel/iommu_groups/0/devices/0000:00:00.0
/sys/kernel/iommu_groups/57/devices/0000:44:0b.0
/sys/kernel/iommu_groups/138/devices/0000:d1:0d.0
/sys/kernel/iommu_groups/85/devices/0000:72:16.5
/sys/kernel/iommu_groups/85/devices/0000:72:16.1
/sys/kernel/iommu_groups/85/devices/0000:72:16.4
/sys/kernel/iommu_groups/85/devices/0000:72:16.0
/sys/kernel/iommu_groups/19/devices/0000:00:1c.0
/sys/kernel/iommu_groups/47/devices/0000:44:08.0
/sys/kernel/iommu_groups/128/devices/0000:d1:0b.2
/sys/kernel/iommu_groups/75/devices/0000:72:05.0
/sys/kernel/iommu_groups/37/devices/0000:16:0e.2
/sys/kernel/iommu_groups/37/devices/0000:16:0e.0
/sys/kernel/iommu_groups/37/devices/0000:16:0e.7
/sys/kernel/iommu_groups/37/devices/0000:16:0e.5
/sys/kernel/iommu_groups/37/devices/0000:16:0e.3
/sys/kernel/iommu_groups/37/devices/0000:16:0e.1
/sys/kernel/iommu_groups/37/devices/0000:16:0e.6
/sys/kernel/iommu_groups/37/devices/0000:16:0e.4
/sys/kernel/iommu_groups/9/devices/0000:00:05.0
/sys/kernel/iommu_groups/118/devices/0000:d1:0a.0
/sys/kernel/iommu_groups/65/devices/0000:44:0c.4
/sys/kernel/iommu_groups/146/devices/0000:d4:0e.0
/sys/kernel/iommu_groups/93/devices/0000:a0:04.6
/sys/kernel/iommu_groups/27/devices/0000:04:00.0

vfio-pci driver is in use for the RTX4000 :
Bash:
root@projectvm:~# lspci -v -n -s 73:00
0000:73:00.0 0300: 10de:1eb1 (rev a1) (prog-if 00 [VGA controller])
        Subsystem: 1028:12a0
        Flags: bus master, fast devsel, latency 0, IRQ 74, NUMA node 0, IOMMU group 86
        Memory at c4000000 (32-bit, non-prefetchable) [size=16M]
        Memory at b0000000 (64-bit, prefetchable) [size=256M]
        Memory at c0000000 (64-bit, prefetchable) [size=32M]
        I/O ports at 9000 [size=128]
        Expansion ROM at 000c0000 [disabled] [size=128K]
        Capabilities: [60] Power Management version 3
        Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
        Capabilities: [78] Express Legacy Endpoint, MSI 00
        Capabilities: [100] Virtual Channel
        Capabilities: [250] Latency Tolerance Reporting
        Capabilities: [258] L1 PM Substates
        Capabilities: [128] Power Budgeting <?>
        Capabilities: [420] Advanced Error Reporting
        Capabilities: [600] Vendor Specific Information: ID=0001 Rev=1 Len=024 <?>
        Capabilities: [900] Secondary PCI Express
        Capabilities: [bb0] Physical Resizable BAR
        Kernel driver in use: vfio-pci
        Kernel modules: nvidiafb, nouveau

0000:73:00.1 0403: 10de:10f8 (rev a1)
        Subsystem: 1028:12a0
        Flags: bus master, fast devsel, latency 0, IRQ 75, NUMA node 0, IOMMU group 86
        Memory at c5080000 (32-bit, non-prefetchable) [size=16K]
        Capabilities: [60] Power Management version 3
        Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
        Capabilities: [78] Express Endpoint, MSI 00
        Capabilities: [100] Advanced Error Reporting
        Kernel driver in use: vfio-pci
        Kernel modules: snd_hda_intel

0000:73:00.2 0c03: 10de:1ad8 (rev a1) (prog-if 30 [XHCI])
        Subsystem: 1028:12a0
        Flags: fast devsel, IRQ 191, NUMA node 0, IOMMU group 86
        Memory at c2000000 (64-bit, prefetchable) [size=256K]
        Memory at c2040000 (64-bit, prefetchable) [size=64K]
        Capabilities: [68] MSI: Enable+ Count=1/1 Maskable- 64bit+
        Capabilities: [78] Express Endpoint, MSI 00
        Capabilities: [b4] Power Management version 3
        Capabilities: [100] Advanced Error Reporting
        Kernel driver in use: vfio-pci
        Kernel modules: xhci_pci

0000:73:00.3 0c80: 10de:1ad9 (rev a1)
        Subsystem: 1028:12a0
        Flags: bus master, fast devsel, latency 0, IRQ 164, NUMA node 0, IOMMU group 86
        Memory at c5084000 (32-bit, non-prefetchable) [size=4K]
        Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
        Capabilities: [78] Express Endpoint, MSI 00
        Capabilities: [b4] Power Management version 3
        Capabilities: [100] Advanced Error Reporting
        Kernel driver in use: vfio-pci
        Kernel modules: i2c_nvidia_gpu

Bash:
root@projectvm:~# cat /etc/pve/qemu-server/106.conf
bios: ovmf
boot: order=net0;scsi0;ide2
cores: 4
cpu: host
hostpci0: 0000:73:00,pcie=1
hostpci1: 0000:a6:00,pcie=1
ide2: none,media=cdrom
kvm: 1
machine: q35
memory: 4096
meta: creation-qemu=6.1.1,ctime=1647391643
name: 99test-pcie-passthrough
net0: virtio=9E:69:2A:1F:A1:14,bridge=vmbr0,firewall=1
numa: 0
ostype: l26
scsi0: local-zfs:vm-106-disk-0,backup=0,discard=on,size=2007048K,ssd=1
scsihw: virtio-scsi-pci
smbios1: uuid=4c2b6f45-2e8e-42b5-8402-9b274a387f1a
sockets: 1
vmgenid: 9ba0a9d1-bb5e-49dc-b190-356b7e5bbcec
 
Last edited:
Multiple people are having issues with kernel 5.15 (since PVE 7.2 and simplefb). Please have a look at those other threads on this forum for potential work-arounds, such as virtually disconecting the boot GPU and rescanning the PCI bus. Apologies for not collecting links to those threads for you,.