GPU Passthrough partially not working since kernel upgrade

Slyrack

Member
May 16, 2022
5
0
6
Hello,

I'm currently testing different GPU cards on a Dell T7920 with quite good results until about a week ago. At the moment there are a GTX1080 and a RTX4000.
I'm using the pve-no-subscription repo and since the kernel has been upgraded from 5.13.19-6-pve (last fully working version for me) I get the following error when I try to boot the VM : kvm: -device vfio-pci,host=0000:73:00.0,id=hostpci0.0,bus=ich9-pcie-port-1,addr=0x0.0,multifunction=on: Failed to mmap 0000:73:00.0 BAR 3. Performance may be slow
This only affects the RTX4000. It is no longer detected inside the VM while the GTX1080 is still working normally.
I know that the problem is due to the fact that the kernel has been updated because when I pin the latest functional version (without any other changes), it works normally again.

Here below some tests with the two next kernel versions released (5.15.35-1-pve and 5.15.30-2-pve).
Bash:
root@projectvm:~# uname -a
Linux projectvm 5.15.35-1-pve #1 SMP PVE 5.15.35-2 (Thu, 05 May 2022 13:54:35 +0200) x86_64 GNU/Linux
root@projectvm:~# qm start 106
no efidisk configured! Using temporary efivars disk.
kvm: -device vfio-pci,host=0000:73:00.0,id=hostpci0.0,bus=ich9-pcie-port-1,addr=0x0.0,multifunction=on: Failed to mmap 0000:73:00.0 BAR 3. Performance may be slow
root@projectvm:~# proxmox-boot-tool kernel pin 5.15.30-3-pve
Overriding previously pinned version '5.15.35-1-pve' with '5.15.30-2-pve'
Set kernel '5.15.30-2-pve' in /etc/kernel/proxmox-boot-pin.
Refresh the actual boot ESPs now? [yN] y
Running hook script 'proxmox-auto-removal'..
Running hook script 'zz-proxmox-boot'..
Re-executing '/etc/kernel/postinst.d/zz-proxmox-boot' in new private mount namespace..
Copying and configuring kernels on /dev/disk/by-uuid/6CA5-071E
        Copying kernel and creating boot-entry for 5.13.19-6-pve
        Copying kernel and creating boot-entry for 5.15.30-2-pve
        Copying kernel and creating boot-entry for 5.15.35-1-pve
Copying and configuring kernels on /dev/disk/by-uuid/6CA5-51E5
        Copying kernel and creating boot-entry for 5.13.19-6-pve
        Copying kernel and creating boot-entry for 5.15.30-2-pve
        Copying kernel and creating boot-entry for 5.15.35-1-pve
root@projectvm:~# reboot


root@projectvm:~# uname -a
Linux projectvm 5.15.30-2-pve #1 SMP PVE 5.15.30-3 (Fri, 22 Apr 2022 18:08:27 +0200) x86_64 GNU/Linux
root@projectvm:~# qm start 106
no efidisk configured! Using temporary efivars disk.
kvm: -device vfio-pci,host=0000:73:00.0,id=hostpci0.0,bus=ich9-pcie-port-1,addr=0x0.0,multifunction=on: Failed to mmap 0000:73:00.0 BAR 3. Performance may be slow
root@projectvm:~# proxmox-boot-tool kernel pin 5.13.19-6-pve
[...]
root@projectvm:~# reboot


root@projectvm:~# uname -a
Linux projectvm 5.13.19-6-pve #1 SMP PVE 5.13.19-15 (Tue, 29 Mar 2022 15:59:50 +0200) x86_64 GNU/Linux
root@projectvm:~# qm start 106
no efidisk configured! Using temporary efivars disk.
root@projectvm:~# <-- Working again !

Some configuration files :

Bash:
root@projectvm:~# cat /proc/cmdline
initrd=\EFI\proxmox\5.13.19-6-pve\initrd.img-5.13.19-6-pve root=ZFS=rpool/ROOT/pve-1 boot=zfs video=efifb:off intel_iommu=on irqpoll
root@projectvm:~# cat /proc/cmdline
initrd=\EFI\proxmox\5.15.30-2-pve\initrd.img-5.15.30-2-pve root=ZFS=rpool/ROOT/pve-1 boot=zfs video=efifb:off intel_iommu=on irqpoll
root@projectvm:~# cat /proc/cmdline
initrd=\EFI\proxmox\5.15.35-1-pve\initrd.img-5.15.35-1-pve root=ZFS=rpool/ROOT/pve-1 boot=zfs video=efifb:off intel_iommu=on irqpoll

(using 5.15.35-1-pve)
Bash:
root@projectvm:~# dmesg | grep -e DMAR -e IOMMU
[    0.020385] ACPI: DMAR 0x0000000069800DF8 000270 (v01 DELL\x CBX3     00000001 INTL 20091013)
[    0.020470] ACPI: Reserving DMAR table memory at [mem 0x69800df8-0x69801067]
[    0.160165] DMAR: IOMMU enabled
[    0.376725] DMAR: Host address width 46
[    0.376727] DMAR: DRHD base: 0x000000d37fc000 flags: 0x0
[    0.376736] DMAR: dmar0: reg_base_addr d37fc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376742] DMAR: DRHD base: 0x000000e0ffc000 flags: 0x0
[    0.376748] DMAR: dmar1: reg_base_addr e0ffc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376753] DMAR: DRHD base: 0x000000ee7fc000 flags: 0x0
[    0.376758] DMAR: dmar2: reg_base_addr ee7fc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376763] DMAR: DRHD base: 0x000000fbffc000 flags: 0x0
[    0.376768] DMAR: dmar3: reg_base_addr fbffc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376772] DMAR: DRHD base: 0x000000a0ffc000 flags: 0x0
[    0.376777] DMAR: dmar4: reg_base_addr a0ffc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376781] DMAR: DRHD base: 0x000000a47fc000 flags: 0x0
[    0.376791] DMAR: dmar5: reg_base_addr a47fc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376796] DMAR: DRHD base: 0x000000c5ffc000 flags: 0x0
[    0.376801] DMAR: dmar6: reg_base_addr c5ffc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376805] DMAR: DRHD base: 0x000000a0bfc000 flags: 0x1
[    0.376810] DMAR: dmar7: reg_base_addr a0bfc000 ver 1:0 cap 8d2078c106f0466 ecap f020df
[    0.376814] DMAR: RMRR base: 0x0000006bc56000 end: 0x0000006bc66fff
[    0.376818] DMAR: ATSR flags: 0x0
[    0.376821] DMAR: ATSR flags: 0x0
[    0.376823] DMAR: RHSA base: 0x000000a0bfc000 proximity domain: 0x0
[    0.376827] DMAR: RHSA base: 0x000000a0ffc000 proximity domain: 0x0
[    0.376829] DMAR: RHSA base: 0x000000a47fc000 proximity domain: 0x0
[    0.376832] DMAR: RHSA base: 0x000000c5ffc000 proximity domain: 0x0
[    0.376835] DMAR: RHSA base: 0x000000d37fc000 proximity domain: 0x1
[    0.376837] DMAR: RHSA base: 0x000000e0ffc000 proximity domain: 0x1
[    0.376840] DMAR: RHSA base: 0x000000ee7fc000 proximity domain: 0x1
[    0.376842] DMAR: RHSA base: 0x000000fbffc000 proximity domain: 0x1
[    0.376847] DMAR-IR: IOAPIC id 12 under DRHD base  0xc5ffc000 IOMMU 6
[    0.376851] DMAR-IR: IOAPIC id 11 under DRHD base  0xa47fc000 IOMMU 5
[    0.376854] DMAR-IR: IOAPIC id 10 under DRHD base  0xa0ffc000 IOMMU 4
[    0.376857] DMAR-IR: IOAPIC id 18 under DRHD base  0xfbffc000 IOMMU 3
[    0.376860] DMAR-IR: IOAPIC id 17 under DRHD base  0xee7fc000 IOMMU 2
[    0.376863] DMAR-IR: IOAPIC id 16 under DRHD base  0xe0ffc000 IOMMU 1
[    0.376866] DMAR-IR: IOAPIC id 15 under DRHD base  0xd37fc000 IOMMU 0
[    0.376869] DMAR-IR: IOAPIC id 8 under DRHD base  0xa0bfc000 IOMMU 7
[    0.376872] DMAR-IR: IOAPIC id 9 under DRHD base  0xa0bfc000 IOMMU 7
[    0.376875] DMAR-IR: HPET id 0 under DRHD base 0xa0bfc000
[    0.376879] DMAR-IR: Queued invalidation will be enabled to support x2apic and Intr-remapping.
[    0.379165] DMAR-IR: Enabled IRQ remapping in x2apic mode
[   34.087511] DMAR: No SATC found
[   34.087516] DMAR: dmar6: Using Queued invalidation
[   34.087522] DMAR: dmar5: Using Queued invalidation
[   34.087528] DMAR: dmar4: Using Queued invalidation
[   34.087547] DMAR: dmar3: Using Queued invalidation
[   34.087551] DMAR: dmar2: Using Queued invalidation
[   34.087555] DMAR: dmar1: Using Queued invalidation
[   34.087559] DMAR: dmar0: Using Queued invalidation
[   34.087571] DMAR: dmar7: Using Queued invalidation
[   34.180091] DMAR: Intel(R) Virtualization Technology for Directed I/O
[   41.831310] snd_emu10k1 0000:d6:00.0: non-passthrough IOMMU detected, widening DMA allocations

Bash:
root@projectvm:~# cat /etc/modules
vfio
vfio_iommu_type1
vfio_pci
vfio_virqfd

Bash:
root@projectvm:~# dmesg | grep 'remapping'
[    0.376879] DMAR-IR: Queued invalidation will be enabled to support x2apic and Intr-remapping.
[    0.379165] DMAR-IR: Enabled IRQ remapping in x2apic mode

Bash:
root@projectvm:~# cat /etc/modprobe.d/*
blacklist nouveau
blacklist nvidia

options kvm ignore_msrs=1

blacklist nvidiafb

options vfio-pci ids=10de:1b80,10de:10f0,10de:1eb0,10de:10f8,10de:1ad8,10de:1ad9,10de:1eb1,10de:10f8,10de:1ad8,10de:1ad9,10de:1cb3,10de:0fb9,10de:107d,10de:0e08 disable_vga=1


I know that using the pve-no-subscription repository is less tested and at our own risk, but I don't even know where I should start investigating to fix the problem.
At the moment I am continuing my tests on 5.13.19-6-pve but I would like to understand what is going on.
Any ideas ?

Many thanks !
 
Last edited:
Here is some additional configuration :

IOMMU groups are working :
Bash:
root@projectvm:~# find /sys/kernel/iommu_groups/ -type l
/sys/kernel/iommu_groups/108/devices/0000:a5:0f.0
/sys/kernel/iommu_groups/108/devices/0000:a5:0f.1
/sys/kernel/iommu_groups/55/devices/0000:44:0a.6
/sys/kernel/iommu_groups/136/devices/0000:d1:0c.6
/sys/kernel/iommu_groups/83/devices/0000:72:12.2
/sys/kernel/iommu_groups/83/devices/0000:72:12.1
/sys/kernel/iommu_groups/17/devices/0000:00:16.2
/sys/kernel/iommu_groups/17/devices/0000:00:16.0
/sys/kernel/iommu_groups/17/devices/0000:00:16.3
/sys/kernel/iommu_groups/45/devices/0000:44:05.4
/sys/kernel/iommu_groups/126/devices/0000:d1:0b.0
/sys/kernel/iommu_groups/73/devices/0000:45:00.1
/sys/kernel/iommu_groups/73/devices/0000:45:00.0
/sys/kernel/iommu_groups/154/devices/0000:d6:00.0
/sys/kernel/iommu_groups/154/devices/0000:d5:00.0
/sys/kernel/iommu_groups/35/devices/0000:16:08.6
/sys/kernel/iommu_groups/35/devices/0000:16:08.4
/sys/kernel/iommu_groups/35/devices/0000:16:08.2
/sys/kernel/iommu_groups/35/devices/0000:16:08.0
/sys/kernel/iommu_groups/35/devices/0000:16:08.7
/sys/kernel/iommu_groups/35/devices/0000:16:08.5
/sys/kernel/iommu_groups/35/devices/0000:16:08.3
/sys/kernel/iommu_groups/35/devices/0000:16:08.1
/sys/kernel/iommu_groups/7/devices/0000:00:04.6
/sys/kernel/iommu_groups/116/devices/0000:d1:08.0
/sys/kernel/iommu_groups/63/devices/0000:44:0c.2
/sys/kernel/iommu_groups/144/devices/0000:d4:05.2
/sys/kernel/iommu_groups/91/devices/0000:a0:04.4
/sys/kernel/iommu_groups/25/devices/0000:02:00.0
/sys/kernel/iommu_groups/25/devices/0000:02:00.1
/sys/kernel/iommu_groups/106/devices/0000:a5:09.0
/sys/kernel/iommu_groups/106/devices/0000:a5:09.1
/sys/kernel/iommu_groups/53/devices/0000:44:0a.4
/sys/kernel/iommu_groups/134/devices/0000:d1:0c.4
/sys/kernel/iommu_groups/81/devices/0000:72:0f.1
/sys/kernel/iommu_groups/15/devices/0000:00:11.5
/sys/kernel/iommu_groups/15/devices/0000:00:11.0
/sys/kernel/iommu_groups/43/devices/0000:44:05.0
/sys/kernel/iommu_groups/124/devices/0000:d1:0a.6
/sys/kernel/iommu_groups/71/devices/0000:44:0d.2
/sys/kernel/iommu_groups/152/devices/0000:d4:15.0
/sys/kernel/iommu_groups/152/devices/0000:d4:15.1
/sys/kernel/iommu_groups/33/devices/0000:16:05.2
/sys/kernel/iommu_groups/5/devices/0000:00:04.4
/sys/kernel/iommu_groups/114/devices/0000:d1:05.4
/sys/kernel/iommu_groups/61/devices/0000:44:0c.0
/sys/kernel/iommu_groups/142/devices/0000:d4:00.0
/sys/kernel/iommu_groups/23/devices/0000:00:1f.2
/sys/kernel/iommu_groups/23/devices/0000:00:1f.0
/sys/kernel/iommu_groups/23/devices/0000:00:1f.5
/sys/kernel/iommu_groups/23/devices/0000:00:1f.6
/sys/kernel/iommu_groups/23/devices/0000:00:1f.4
/sys/kernel/iommu_groups/104/devices/0000:a5:05.4
/sys/kernel/iommu_groups/51/devices/0000:44:0a.2
/sys/kernel/iommu_groups/132/devices/0000:d1:0c.2
/sys/kernel/iommu_groups/13/devices/0000:00:08.1
/sys/kernel/iommu_groups/41/devices/0000:17:00.0
/sys/kernel/iommu_groups/122/devices/0000:d1:0a.4
/sys/kernel/iommu_groups/150/devices/0000:d4:12.0
/sys/kernel/iommu_groups/98/devices/0000:a0:08.0
/sys/kernel/iommu_groups/31/devices/0000:16:00.0
/sys/kernel/iommu_groups/3/devices/0000:00:04.2
/sys/kernel/iommu_groups/112/devices/0000:d1:05.0
/sys/kernel/iommu_groups/140/devices/0000:d1:0d.2
/sys/kernel/iommu_groups/88/devices/0000:a0:04.1
/sys/kernel/iommu_groups/21/devices/0000:00:1c.7
/sys/kernel/iommu_groups/102/devices/0000:a5:05.0
/sys/kernel/iommu_groups/130/devices/0000:d1:0c.0
/sys/kernel/iommu_groups/78/devices/0000:72:0e.0
/sys/kernel/iommu_groups/11/devices/0000:00:05.4
/sys/kernel/iommu_groups/120/devices/0000:d1:0a.2
/sys/kernel/iommu_groups/68/devices/0000:44:0c.7
/sys/kernel/iommu_groups/149/devices/0000:d4:0f.1
/sys/kernel/iommu_groups/96/devices/0000:a0:05.2
/sys/kernel/iommu_groups/1/devices/0000:00:04.0
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.2
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.0
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.5
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.3
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.1
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.6
/sys/kernel/iommu_groups/110/devices/0000:a5:1e.4
/sys/kernel/iommu_groups/58/devices/0000:44:0b.1
/sys/kernel/iommu_groups/139/devices/0000:d1:0d.1
/sys/kernel/iommu_groups/86/devices/0000:73:00.3
/sys/kernel/iommu_groups/86/devices/0000:73:00.1
/sys/kernel/iommu_groups/86/devices/0000:73:00.2
/sys/kernel/iommu_groups/86/devices/0000:73:00.0
/sys/kernel/iommu_groups/100/devices/0000:a0:08.2
/sys/kernel/iommu_groups/48/devices/0000:44:09.0
/sys/kernel/iommu_groups/129/devices/0000:d1:0b.3
/sys/kernel/iommu_groups/76/devices/0000:72:05.2
/sys/kernel/iommu_groups/38/devices/0000:16:0f.1
/sys/kernel/iommu_groups/38/devices/0000:16:0f.0
/sys/kernel/iommu_groups/119/devices/0000:d1:0a.1
/sys/kernel/iommu_groups/66/devices/0000:44:0c.5
/sys/kernel/iommu_groups/147/devices/0000:d4:0e.1
/sys/kernel/iommu_groups/94/devices/0000:a0:04.7
/sys/kernel/iommu_groups/28/devices/0000:04:00.1
/sys/kernel/iommu_groups/109/devices/0000:a5:1d.2
/sys/kernel/iommu_groups/109/devices/0000:a5:1d.0
/sys/kernel/iommu_groups/109/devices/0000:a5:1d.3
/sys/kernel/iommu_groups/109/devices/0000:a5:1d.1
/sys/kernel/iommu_groups/56/devices/0000:44:0a.7
/sys/kernel/iommu_groups/137/devices/0000:d1:0c.7
/sys/kernel/iommu_groups/84/devices/0000:72:15.0
/sys/kernel/iommu_groups/84/devices/0000:72:15.1
/sys/kernel/iommu_groups/18/devices/0000:00:17.0
/sys/kernel/iommu_groups/46/devices/10000:00:03.0
/sys/kernel/iommu_groups/46/devices/0000:44:05.5
/sys/kernel/iommu_groups/46/devices/10000:00:02.0
/sys/kernel/iommu_groups/127/devices/0000:d1:0b.1
/sys/kernel/iommu_groups/74/devices/0000:72:00.0
/sys/kernel/iommu_groups/36/devices/0000:16:09.0
/sys/kernel/iommu_groups/36/devices/0000:16:09.1
/sys/kernel/iommu_groups/8/devices/0000:00:04.7
/sys/kernel/iommu_groups/117/devices/0000:d1:09.0
/sys/kernel/iommu_groups/64/devices/0000:44:0c.3
/sys/kernel/iommu_groups/145/devices/0000:d4:05.4
/sys/kernel/iommu_groups/92/devices/0000:a0:04.5
/sys/kernel/iommu_groups/26/devices/0000:03:00.0
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.6
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.4
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.2
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.0
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.7
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.5
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.3
/sys/kernel/iommu_groups/107/devices/0000:a5:0e.1
/sys/kernel/iommu_groups/54/devices/0000:44:0a.5
/sys/kernel/iommu_groups/135/devices/0000:d1:0c.5
/sys/kernel/iommu_groups/82/devices/0000:72:12.0
/sys/kernel/iommu_groups/16/devices/0000:00:14.2
/sys/kernel/iommu_groups/16/devices/0000:00:14.0
/sys/kernel/iommu_groups/44/devices/0000:44:05.2
/sys/kernel/iommu_groups/125/devices/0000:d1:0a.7
/sys/kernel/iommu_groups/72/devices/0000:44:0d.3
/sys/kernel/iommu_groups/153/devices/0000:d4:16.5
/sys/kernel/iommu_groups/153/devices/0000:d4:16.1
/sys/kernel/iommu_groups/153/devices/0000:d4:16.4
/sys/kernel/iommu_groups/153/devices/0000:d4:16.0
/sys/kernel/iommu_groups/34/devices/0000:16:05.4
/sys/kernel/iommu_groups/6/devices/0000:00:04.5
/sys/kernel/iommu_groups/115/devices/10001:00:00.0
/sys/kernel/iommu_groups/115/devices/0000:d1:05.5
/sys/kernel/iommu_groups/115/devices/10001:00:01.0
/sys/kernel/iommu_groups/62/devices/0000:44:0c.1
/sys/kernel/iommu_groups/143/devices/0000:d4:05.0
/sys/kernel/iommu_groups/90/devices/0000:a0:04.3
/sys/kernel/iommu_groups/24/devices/0000:00:1f.3
/sys/kernel/iommu_groups/105/devices/0000:a5:08.3
/sys/kernel/iommu_groups/105/devices/0000:a5:08.1
/sys/kernel/iommu_groups/105/devices/0000:a5:08.6
/sys/kernel/iommu_groups/105/devices/0000:a5:08.4
/sys/kernel/iommu_groups/105/devices/0000:a5:08.2
/sys/kernel/iommu_groups/105/devices/0000:a5:08.0
/sys/kernel/iommu_groups/105/devices/0000:a5:08.7
/sys/kernel/iommu_groups/105/devices/0000:a5:08.5
/sys/kernel/iommu_groups/52/devices/0000:44:0a.3
/sys/kernel/iommu_groups/133/devices/0000:d1:0c.3
/sys/kernel/iommu_groups/80/devices/0000:72:0f.0
/sys/kernel/iommu_groups/14/devices/0000:00:08.2
/sys/kernel/iommu_groups/42/devices/0000:44:00.0
/sys/kernel/iommu_groups/123/devices/0000:d1:0a.5
/sys/kernel/iommu_groups/70/devices/0000:44:0d.1
/sys/kernel/iommu_groups/151/devices/0000:d4:12.2
/sys/kernel/iommu_groups/151/devices/0000:d4:12.1
/sys/kernel/iommu_groups/99/devices/0000:a0:08.1
/sys/kernel/iommu_groups/32/devices/0000:16:05.0
/sys/kernel/iommu_groups/4/devices/0000:00:04.3
/sys/kernel/iommu_groups/113/devices/0000:d1:05.2
/sys/kernel/iommu_groups/60/devices/0000:44:0b.3
/sys/kernel/iommu_groups/141/devices/0000:d1:0d.3
/sys/kernel/iommu_groups/89/devices/0000:a0:04.2
/sys/kernel/iommu_groups/22/devices/0000:00:1d.0
/sys/kernel/iommu_groups/103/devices/0000:a5:05.2
/sys/kernel/iommu_groups/50/devices/0000:44:0a.1
/sys/kernel/iommu_groups/131/devices/0000:d1:0c.1
/sys/kernel/iommu_groups/79/devices/0000:72:0e.1
/sys/kernel/iommu_groups/12/devices/0000:00:08.0
/sys/kernel/iommu_groups/40/devices/0000:16:1e.5
/sys/kernel/iommu_groups/40/devices/0000:16:1e.3
/sys/kernel/iommu_groups/40/devices/0000:16:1e.1
/sys/kernel/iommu_groups/40/devices/0000:16:1e.6
/sys/kernel/iommu_groups/40/devices/0000:16:1e.4
/sys/kernel/iommu_groups/40/devices/0000:16:1e.2
/sys/kernel/iommu_groups/40/devices/0000:16:1e.0
/sys/kernel/iommu_groups/121/devices/0000:d1:0a.3
/sys/kernel/iommu_groups/69/devices/0000:44:0d.0
/sys/kernel/iommu_groups/97/devices/0000:a0:05.4
/sys/kernel/iommu_groups/30/devices/0000:04:00.3
/sys/kernel/iommu_groups/2/devices/0000:00:04.1
/sys/kernel/iommu_groups/111/devices/0000:a6:00.1
/sys/kernel/iommu_groups/111/devices/0000:a6:00.0
/sys/kernel/iommu_groups/59/devices/0000:44:0b.2
/sys/kernel/iommu_groups/87/devices/0000:a0:04.0
/sys/kernel/iommu_groups/20/devices/0000:00:1c.4
/sys/kernel/iommu_groups/101/devices/0000:a5:00.0
/sys/kernel/iommu_groups/49/devices/0000:44:0a.0
/sys/kernel/iommu_groups/77/devices/0000:72:05.4
/sys/kernel/iommu_groups/10/devices/0000:00:05.2
/sys/kernel/iommu_groups/39/devices/0000:16:1d.0
/sys/kernel/iommu_groups/39/devices/0000:16:1d.3
/sys/kernel/iommu_groups/39/devices/0000:16:1d.1
/sys/kernel/iommu_groups/39/devices/0000:16:1d.2
/sys/kernel/iommu_groups/67/devices/0000:44:0c.6
/sys/kernel/iommu_groups/148/devices/0000:d4:0f.0
/sys/kernel/iommu_groups/95/devices/0000:a0:05.0
/sys/kernel/iommu_groups/29/devices/0000:04:00.2
/sys/kernel/iommu_groups/0/devices/0000:00:00.0
/sys/kernel/iommu_groups/57/devices/0000:44:0b.0
/sys/kernel/iommu_groups/138/devices/0000:d1:0d.0
/sys/kernel/iommu_groups/85/devices/0000:72:16.5
/sys/kernel/iommu_groups/85/devices/0000:72:16.1
/sys/kernel/iommu_groups/85/devices/0000:72:16.4
/sys/kernel/iommu_groups/85/devices/0000:72:16.0
/sys/kernel/iommu_groups/19/devices/0000:00:1c.0
/sys/kernel/iommu_groups/47/devices/0000:44:08.0
/sys/kernel/iommu_groups/128/devices/0000:d1:0b.2
/sys/kernel/iommu_groups/75/devices/0000:72:05.0
/sys/kernel/iommu_groups/37/devices/0000:16:0e.2
/sys/kernel/iommu_groups/37/devices/0000:16:0e.0
/sys/kernel/iommu_groups/37/devices/0000:16:0e.7
/sys/kernel/iommu_groups/37/devices/0000:16:0e.5
/sys/kernel/iommu_groups/37/devices/0000:16:0e.3
/sys/kernel/iommu_groups/37/devices/0000:16:0e.1
/sys/kernel/iommu_groups/37/devices/0000:16:0e.6
/sys/kernel/iommu_groups/37/devices/0000:16:0e.4
/sys/kernel/iommu_groups/9/devices/0000:00:05.0
/sys/kernel/iommu_groups/118/devices/0000:d1:0a.0
/sys/kernel/iommu_groups/65/devices/0000:44:0c.4
/sys/kernel/iommu_groups/146/devices/0000:d4:0e.0
/sys/kernel/iommu_groups/93/devices/0000:a0:04.6
/sys/kernel/iommu_groups/27/devices/0000:04:00.0

vfio-pci driver is in use for the RTX4000 :
Bash:
root@projectvm:~# lspci -v -n -s 73:00
0000:73:00.0 0300: 10de:1eb1 (rev a1) (prog-if 00 [VGA controller])
        Subsystem: 1028:12a0
        Flags: bus master, fast devsel, latency 0, IRQ 74, NUMA node 0, IOMMU group 86
        Memory at c4000000 (32-bit, non-prefetchable) [size=16M]
        Memory at b0000000 (64-bit, prefetchable) [size=256M]
        Memory at c0000000 (64-bit, prefetchable) [size=32M]
        I/O ports at 9000 [size=128]
        Expansion ROM at 000c0000 [disabled] [size=128K]
        Capabilities: [60] Power Management version 3
        Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
        Capabilities: [78] Express Legacy Endpoint, MSI 00
        Capabilities: [100] Virtual Channel
        Capabilities: [250] Latency Tolerance Reporting
        Capabilities: [258] L1 PM Substates
        Capabilities: [128] Power Budgeting <?>
        Capabilities: [420] Advanced Error Reporting
        Capabilities: [600] Vendor Specific Information: ID=0001 Rev=1 Len=024 <?>
        Capabilities: [900] Secondary PCI Express
        Capabilities: [bb0] Physical Resizable BAR
        Kernel driver in use: vfio-pci
        Kernel modules: nvidiafb, nouveau

0000:73:00.1 0403: 10de:10f8 (rev a1)
        Subsystem: 1028:12a0
        Flags: bus master, fast devsel, latency 0, IRQ 75, NUMA node 0, IOMMU group 86
        Memory at c5080000 (32-bit, non-prefetchable) [size=16K]
        Capabilities: [60] Power Management version 3
        Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
        Capabilities: [78] Express Endpoint, MSI 00
        Capabilities: [100] Advanced Error Reporting
        Kernel driver in use: vfio-pci
        Kernel modules: snd_hda_intel

0000:73:00.2 0c03: 10de:1ad8 (rev a1) (prog-if 30 [XHCI])
        Subsystem: 1028:12a0
        Flags: fast devsel, IRQ 191, NUMA node 0, IOMMU group 86
        Memory at c2000000 (64-bit, prefetchable) [size=256K]
        Memory at c2040000 (64-bit, prefetchable) [size=64K]
        Capabilities: [68] MSI: Enable+ Count=1/1 Maskable- 64bit+
        Capabilities: [78] Express Endpoint, MSI 00
        Capabilities: [b4] Power Management version 3
        Capabilities: [100] Advanced Error Reporting
        Kernel driver in use: vfio-pci
        Kernel modules: xhci_pci

0000:73:00.3 0c80: 10de:1ad9 (rev a1)
        Subsystem: 1028:12a0
        Flags: bus master, fast devsel, latency 0, IRQ 164, NUMA node 0, IOMMU group 86
        Memory at c5084000 (32-bit, non-prefetchable) [size=4K]
        Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
        Capabilities: [78] Express Endpoint, MSI 00
        Capabilities: [b4] Power Management version 3
        Capabilities: [100] Advanced Error Reporting
        Kernel driver in use: vfio-pci
        Kernel modules: i2c_nvidia_gpu

Bash:
root@projectvm:~# cat /etc/pve/qemu-server/106.conf
bios: ovmf
boot: order=net0;scsi0;ide2
cores: 4
cpu: host
hostpci0: 0000:73:00,pcie=1
hostpci1: 0000:a6:00,pcie=1
ide2: none,media=cdrom
kvm: 1
machine: q35
memory: 4096
meta: creation-qemu=6.1.1,ctime=1647391643
name: 99test-pcie-passthrough
net0: virtio=9E:69:2A:1F:A1:14,bridge=vmbr0,firewall=1
numa: 0
ostype: l26
scsi0: local-zfs:vm-106-disk-0,backup=0,discard=on,size=2007048K,ssd=1
scsihw: virtio-scsi-pci
smbios1: uuid=4c2b6f45-2e8e-42b5-8402-9b274a387f1a
sockets: 1
vmgenid: 9ba0a9d1-bb5e-49dc-b190-356b7e5bbcec
 
Last edited:
Multiple people are having issues with kernel 5.15 (since PVE 7.2 and simplefb). Please have a look at those other threads on this forum for potential work-arounds, such as virtually disconecting the boot GPU and rescanning the PCI bus. Apologies for not collecting links to those threads for you,.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!