I am at a loss on this one. HELP!
I have 2 GPU in the system, an AMD RX580 that passes through no problem to Win 10 and a second Nvidia 1660Ti that I can not get to work in a separate Win 11 VM. I use proxmox-boot-tool and have tried pinning kernel 5.15.39-4-pve in addition to using the 5.18.19 edge kernel - same result.
The REALLY weird part is I cannot even RDP into the VM (114) as it does not present any IP to my network after the drivers are installed - I have connection right up until the Nvidia USB drivers install - then it goes out. Here are my settings. Let me know if there is anything else I can supply - this SHOULD be working...
cat /etc/kernel/cmdline
root=ZFS=rpool/ROOT/pve-1 boot=zfs video=vesafb
ff video=efifb
ff video=simplefb
ff initcall_blacklist=sysfb_init quiet intel_iommu=on iommu=pt
cat /proc/cmdline
initrd=\EFI\proxmox\5.15.39-4-pve\initrd.img-5.15.39-4-pve root=ZFS=rpool/ROOT/pve-1 boot=zfs video=vesafb
ff video=efifb
ff video=simplefb
ff initcall_blacklist=sysfb_init quiet intel_iommu=on iommu=pt
cat /etc/modprobe.d/vfio.conf
options vfio-pci ids=1002:67df,1002:aaf0,10de:2182,10de:1aeb,10de:1aec,10de:1aed disable_vga=1
/etc/modprobe.d/blacklist.conf
blacklist amdgpu
blacklist nouveau
blacklist radeon
blacklist nvidia
blacklist i2c_nvidia_gpu
blacklist nvidiafb
cat /etc/modules
coretemp
nct6775
vfio
vfio_iommu_type1
vfio_pci
vfio_virqfd
vhost-net
/etc/pve/qemu-server/114.conf
agent: 1
bios: ovmf
boot: order=scsi0;ide2
cores: 4
cpu: host,hidden=1,flags=+pcid
efidisk0: vm-disks1:vm-114-disk-0,efitype=4m,pre-enrolled-keys=1,size=1M
hostpci0: 03:00,pcie=1,x-vga=1
ide0: none,media=cdrom
ide2: none,media=cdrom
machine: pc-q35-7.0
memory: 4096
meta: creation-qemu=7.0.0,ctime=1663733622
name: Win11-1660
net0: virtio=56:60:BC:EF:C5:16,bridge=vmbr0,firewall=1
numa: 0
ostype: win11
scsi0: vm-disks1:vm-114-disk-1,cache=unsafe,discard=on,iothread=1,size=128G,ssd=1
scsihw: virtio-scsi-single
smbios1: uuid=ada83453-f7a1-40fd-aac0-4f8fde22c2fa
sockets: 1
tpmstate0: vm-disks1:vm-114-disk-2,size=4M,version=v2.0
vga: none
vmgenid: 7d5f2532-1049-4fdd-a1f8-1e7a99c82458
grep vfio /var/log/syslog (since last reboot and VM start)
Sep 21 17:19:03 pve kernel: [ 231.775896] vfio-pci 0000:03:00.0: vgaarb: changed VGA decodes: olddecodes=none,decodes=io+mem
wns=none
Sep 21 17:19:03 pve kernel: [ 231.855280] vfio-pci 0000:03:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none
wns=none
Sep 21 17:20:44 pve kernel: [ 333.263601] vfio-pci 0000:81:00.0: enabling device (0000 -> 0003)
Sep 21 17:20:44 pve kernel: [ 333.263950] vfio-pci 0000:81:00.0: vfio_ecap_init: hiding ecap 0x19@0x270
Sep 21 17:20:44 pve kernel: [ 333.263960] vfio-pci 0000:81:00.0: vfio_ecap_init: hiding ecap 0x1b@0x2d0
Sep 21 17:20:44 pve kernel: [ 333.263966] vfio-pci 0000:81:00.0: vfio_ecap_init: hiding ecap 0x1e@0x370
Sep 21 17:20:44 pve kernel: [ 333.282623] vfio-pci 0000:81:00.1: enabling device (0000 -> 0002)
Sep 21 17:20:50 pve kernel: [ 338.798159] vfio-pci 0000:03:00.0: enabling device (0000 -> 0003)
Sep 21 17:20:50 pve kernel: [ 338.902762] vfio-pci 0000:03:00.0: vfio_ecap_init: hiding ecap 0x1e@0x258
Sep 21 17:20:50 pve kernel: [ 338.902781] vfio-pci 0000:03:00.0: vfio_ecap_init: hiding ecap 0x19@0x900
Sep 21 17:27:35 pve kernel: [ 744.054038] vfio-pci 0000:03:00.0: vfio_ecap_init: hiding ecap 0x1e@0x258
Sep 21 17:27:35 pve kernel: [ 744.054058] vfio-pci 0000:03:00.0: vfio_ecap_init: hiding ecap 0x19@0x900
Sep 21 17:30:40 pve kernel: [ 928.926154] vfio-pci 0000:03:00.0: vfio_ecap_init: hiding ecap 0x1e@0x258
Sep 21 17:30:40 pve kernel: [ 928.926174] vfio-pci 0000:03:00.0: vfio_ecap_init: hiding ecap 0x19@0x900
lspci -nnk (problem GPU)
03:00.0 VGA compatible controller [0300]: NVIDIA Corporation TU116 [GeForce GTX 1660 Ti] [10de:2182] (rev a1)
Subsystem: Gigabyte Technology Co., Ltd TU116 [GeForce GTX 1660 Ti] [1458:3fbe]
Kernel driver in use: vfio-pci
Kernel modules: nvidiafb, nouveau
03:00.1 Audio device [0403]: NVIDIA Corporation TU116 High Definition Audio Controller [10de:1aeb] (rev a1)
Subsystem: Gigabyte Technology Co., Ltd TU116 High Definition Audio Controller [1458:3fbe]
Kernel driver in use: vfio-pci
Kernel modules: snd_hda_intel
03:00.2 USB controller [0c03]: NVIDIA Corporation TU116 USB 3.1 Host Controller [10de:1aec] (rev a1)
Subsystem: Gigabyte Technology Co., Ltd TU116 USB 3.1 Host Controller [1458:3fbe]
Kernel driver in use: vfio-pci
Kernel modules: xhci_pci
03:00.3 Serial bus controller [0c80]: NVIDIA Corporation TU116 USB Type-C UCSI Controller [10de:1aed] (rev a1)
Subsystem: Gigabyte Technology Co., Ltd TU116 USB Type-C UCSI Controller [1458:3fbe]
Kernel driver in use: vfio-pci
Kernel modules: i2c_nvidia_gpu
(good GPU)
81:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 470/480/570/570X/580/580X/590] [1002:67df] (rev e7)
Subsystem: Sapphire Technology Limited Radeon RX 570 Pulse 4GB [1da2:e353]
Kernel driver in use: vfio-pci
Kernel modules: amdgpu
81:00.1 Audio device [0403]: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere HDMI Audio [Radeon RX 470/480 / 570/580/590] [1002:aaf0]
Subsystem: Sapphire Technology Limited Ellesmere HDMI Audio [Radeon RX 470/480 / 570/580/590] [1da2:aaf0]
Kernel driver in use: vfio-pci
Kernel modules: snd_hda_intel
dmesg | grep -e DMAR -e IOMMU
[ 0.015740] ACPI: DMAR 0x000000007DDEA580 000140 (v01 A M I OEMDMAR 00000001 INTL 00000001)
[ 0.015763] ACPI: Reserving DMAR table memory at [mem 0x7ddea580-0x7ddea6bf]
[ 0.197897] DMAR: IOMMU enabled
[ 0.470748] DMAR: Host address width 46
[ 0.470750] DMAR: DRHD base: 0x000000fbffe000 flags: 0x0
[ 0.470757] DMAR: dmar0: reg_base_addr fbffe000 ver 1:0 cap d2078c106f0466 ecap f020df
[ 0.470761] DMAR: DRHD base: 0x000000dfffc000 flags: 0x1
[ 0.470765] DMAR: dmar1: reg_base_addr dfffc000 ver 1:0 cap d2078c106f0466 ecap f020df
[ 0.470768] DMAR: RMRR base: 0x0000007f231000 end: 0x0000007f23efff
[ 0.470771] DMAR: ATSR flags: 0x0
[ 0.470772] DMAR: RHSA base: 0x000000fbffe000 proximity domain: 0x1
[ 0.470774] DMAR: RHSA base: 0x000000dfffc000 proximity domain: 0x0
[ 0.470778] DMAR-IR: IOAPIC id 3 under DRHD base 0xfbffe000 IOMMU 0
[ 0.470780] DMAR-IR: IOAPIC id 0 under DRHD base 0xdfffc000 IOMMU 1
[ 0.470782] DMAR-IR: IOAPIC id 2 under DRHD base 0xdfffc000 IOMMU 1
[ 0.470783] DMAR-IR: HPET id 0 under DRHD base 0xdfffc000
[ 0.470785] DMAR-IR: Queued invalidation will be enabled to support x2apic and Intr-remapping.
[ 0.471562] DMAR-IR: Enabled IRQ remapping in x2apic mode
[ 1.091764] DMAR: [Firmware Bug]: RMRR entry for device 03:00.2 is broken - applying workaround
[ 1.091802] DMAR: No SATC found
[ 1.091807] DMAR: dmar0: Using Queued invalidation
[ 1.091813] DMAR: dmar1: Using Queued invalidation
[ 1.096685] DMAR: Intel(R) Virtualization Technology for Directed I/O
cat /proc/iomem
00000000-00000fff : Reserved
00001000-0009ffff : System RAM
00000000-00000000 : PCI Bus 0000:00
000a0000-000dffff : PCI Bus 0000:00
000c0000-000c7fff : Video ROM
000f0000-000fffff : System ROM
00100000-733c4017 : System RAM
733c4018-733e0e57 : System RAM
733e0e58-733e1017 : System RAM
733e1018-733f1857 : System RAM
733f1858-733f2017 : System RAM
733f2018-73402857 : System RAM
73402858-73403017 : System RAM
73403018-7340b457 : System RAM
7340b458-7340c017 : System RAM
7340c018-7341c857 : System RAM
7341c858-7341d017 : System RAM
7341d018-7342d857 : System RAM
7342d858-7342e017 : System RAM
7342e018-7344d857 : System RAM
7344d858-7dca2fff : System RAM
7dca3000-7dcd2fff : Reserved
7dcd3000-7ddeafff : ACPI Tables
7ddeb000-7e012fff : ACPI Non-volatile Storage
7e013000-7f350fff : Reserved
7f351000-7f351fff : System RAM
7f352000-7f3d7fff : ACPI Non-volatile Storage
7f3d8000-7f7fffff : System RAM
7f800000-7fffffff : RAM buffer
80000000-dfffffff : PCI Bus 0000:00
80000000-8fffffff : PCI MMCONFIG 0000 [bus 00-ff]
80000000-8fffffff : Reserved
c0000000-d20fffff : PCI Bus 0000:03
c0000000-cfffffff : 0000:03:00.0
c0000000-cfffffff : vfio-pci
d0000000-d1ffffff : 0000:03:00.0
d0000000-d1ffffff : vfio-pci
d2000000-d203ffff : 0000:03:00.2
d2000000-d203ffff : vfio-pci
d2040000-d204ffff : 0000:03:00.2
d2040000-d204ffff : vfio-pci
dc000000-dd0fffff : PCI Bus 0000:08
dc000000-dd0fffff : PCI Bus 0000:09
dc000000-dcffffff : 0000:09:00.0
dd000000-dd01ffff : 0000:09:00.0
dd100000-dd103fff : 0000:00:04.7
dd100000-dd103fff : ioatdma
dd104000-dd107fff : 0000:00:04.6
dd104000-dd107fff : ioatdma
dd108000-dd10bfff : 0000:00:04.5
dd108000-dd10bfff : ioatdma
dd10c000-dd10ffff : 0000:00:04.4
dd10c000-dd10ffff : ioatdma
dd110000-dd113fff : 0000:00:04.3
dd110000-dd113fff : ioatdma
dd114000-dd117fff : 0000:00:04.2
dd114000-dd117fff : ioatdma
dd118000-dd11bfff : 0000:00:04.1
dd118000-dd11bfff : ioatdma
dd11c000-dd11ffff : 0000:00:04.0
dd11c000-dd11ffff : ioatdma
dd120000-dd1200ff : 0000:00:1f.3
dd121000-dd1217ff : 0000:00:1f.2
dd121000-dd1217ff : ahci
dd122000-dd1223ff : 0000:00:1d.0
dd122000-dd1223ff : ehci_hcd
dd123000-dd1233ff : 0000:00:1a.0
dd123000-dd1233ff : ehci_hcd
dd125000-dd12500f : 0000:00:16.1
dd126000-dd12600f : 0000:00:16.0
dd127000-dd127fff : 0000:00:05.4
de000000-df0fffff : PCI Bus 0000:03
de000000-deffffff : 0000:03:00.0
de000000-deffffff : vfio-pci
df000000-df07ffff : 0000:03:00.0
df080000-df083fff : 0000:03:00.1
df080000-df083fff : vfio-pci
df084000-df084fff : 0000:03:00.3
df084000-df084fff : vfio-pci
df200000-df3fffff : PCI Bus 0000:0c
df200000-df2fffff : 0000:0c:00.0
df200000-df2fffff : e1000e
df300000-df33ffff : 0000:0c:00.0
df340000-df35ffff : 0000:0c:00.0
df340000-df35ffff : e1000e
df360000-df363fff : 0000:0c:00.0
df360000-df363fff : e1000e
df400000-df5fffff : PCI Bus 0000:0b
df400000-df4fffff : 0000:0b:00.0
df400000-df4fffff : e1000e
df500000-df53ffff : 0000:0b:00.0
df540000-df55ffff : 0000:0b:00.0
df540000-df55ffff : e1000e
df560000-df563fff : 0000:0b:00.0
df560000-df563fff : e1000e
df600000-df7fffff : PCI Bus 0000:07
df600000-df6fffff : 0000:07:00.0
df600000-df6fffff : e1000e
df700000-df73ffff : 0000:07:00.0
df740000-df75ffff : 0000:07:00.0
df740000-df75ffff : e1000e
df760000-df763fff : 0000:07:00.0
df760000-df763fff : e1000e
df800000-df9fffff : PCI Bus 0000:06
df800000-df8fffff : 0000:06:00.0
df800000-df8fffff : e1000e
df900000-df93ffff : 0000:06:00.0
df940000-df95ffff : 0000:06:00.0
df940000-df95ffff : e1000e
df960000-df963fff : 0000:06:00.0
df960000-df963fff : e1000e
dfa00000-dfafffff : PCI Bus 0000:0a
dfa00000-dfa0ffff : 0000:0a:00.0
dfa10000-dfa107ff : 0000:0a:00.0
dfa10000-dfa107ff : ahci
dfb00000-dfbfffff : PCI Bus 0000:02
dfb00000-dfb7ffff : 0000:02:00.0
dfb80000-dfb81fff : 0000:02:00.0
dfb80000-dfb81fff : ahci
dfb82000-dfb83fff : 0000:02:00.0
dfb82000-dfb83fff : ahci
dfffc000-dfffcfff : dmar1
e0000000-fbffffff : PCI Bus 0000:80
e0000000-f01fffff : PCI Bus 0000:81
e0000000-efffffff : 0000:81:00.0
e0000000-efffffff : vfio-pci
f0000000-f01fffff : 0000:81:00.0
f0000000-f01fffff : vfio-pci
fbe00000-fbefffff : PCI Bus 0000:81
fbe00000-fbe3ffff : 0000:81:00.0
fbe00000-fbe3ffff : vfio-pci
fbe40000-fbe5ffff : 0000:81:00.0
fbe60000-fbe63fff : 0000:81:00.1
fbe60000-fbe63fff : vfio-pci
fbf00000-fbf03fff : 0000:80:04.7
fbf00000-fbf03fff : ioatdma
fbf04000-fbf07fff : 0000:80:04.6
fbf04000-fbf07fff : ioatdma
fbf08000-fbf0bfff : 0000:80:04.5
fbf08000-fbf0bfff : ioatdma
fbf0c000-fbf0ffff : 0000:80:04.4
fbf0c000-fbf0ffff : ioatdma
fbf10000-fbf13fff : 0000:80:04.3
fbf10000-fbf13fff : ioatdma
fbf14000-fbf17fff : 0000:80:04.2
fbf14000-fbf17fff : ioatdma
fbf18000-fbf1bfff : 0000:80:04.1
fbf18000-fbf1bfff : ioatdma
fbf1c000-fbf1ffff : 0000:80:04.0
fbf1c000-fbf1ffff : ioatdma
fbf20000-fbf20fff : 0000:80:05.4
fbffe000-fbffefff : dmar0
fc000000-fcffffff : pnp 00:00
fd000000-fdffffff : pnp 00:00
fe000000-feafffff : pnp 00:00
feb00000-febfffff : pnp 00:00
fec00000-fec003ff : IOAPIC 0
fec01000-fec013ff : IOAPIC 1
fec40000-fec403ff : IOAPIC 2
fed00000-fed003ff : HPET 0
fed00000-fed003ff : PNP0103:00
fed08000-fed08fff : pnp 00:07
fed1c000-fed3ffff : Reserved
fed1c000-fed1ffff : pnp 00:07
fed1f410-fed1f414 : iTCO_wdt.1.auto
fed45000-fedfffff : pnp 00:00
fee00000-fee00fff : Local APIC
ff000000-ffffffff : Reserved
ff000000-ffffffff : pnp 00:07
100000000-107fffffff : System RAM
e72600000-e736025c7 : Kernel code
e73800000-e741f4fff : Kernel rodata
e74200000-e7463da7f : Kernel data
e74982000-e74ffffff : Kernel bss