Problems with GPU Passthrough since 8.2

merasil

Member
Mar 9, 2020
16
3
23
47
Hi,

I have had a problem with GPU passthrough since updating to 8.2. The VM to which I have passed the GPU no longer starts and has the following error message:

Code:
kvm: -device vfio-pci,host=0000:06:00.0,id=hostpci0.0,bus=ich9-pcie-port-1,addr=0x0.0,x-vga=on,multifunction=on: vfio 0000:06:00.0: failed to setup container for group 20: Failed to set group container: Invalid argument
TASK ERROR: start failed: QEMU exited with code 1

I have these logs on the host:

Code:
root@hvs:~# dmesg | grep -e DMAR -e IOMMU -e AMD-Vi
[    0.512344] AMD-Vi: Using global IVHD EFR:0xf77ef22294ada, EFR2:0x0
[    1.044799] pci 0000:00:00.2: AMD-Vi: IOMMU performance counters supported
[    1.046587] AMD-Vi: Extended features (0xf77ef22294ada, 0x0): PPR NX GT IA GA PC GA_vAPIC
[    1.046600] AMD-Vi: Interrupt remapping enabled
[    1.046759] AMD-Vi: Virtual APIC enabled
[    1.046912] perf/amd_iommu: Detected AMD IOMMU #0 (2 banks, 4 counters/bank).
[   59.575729] vfio-pci 0000:06:00.0: Firmware has requested this device have a 1:1 IOMMU mapping, rejecting configuring the device without a 1:1 mapping. Contact your platform vendor.

Code:
root@hvs:~# lsmod | grep vfio
vfio_pci               16384  0
vfio_pci_core          86016  1 vfio_pci
irqbypass              12288  2 vfio_pci_core,kvm
vfio_iommu_type1       49152  0
vfio                   69632  3 vfio_pci_core,vfio_iommu_type1,vfio_pci
iommufd                98304  1 vfio

Code:
root@hvs:~# lspci -k
06:00.0 VGA compatible controller: NVIDIA Corporation TU117GL [T400 4GB] (rev a1)
        Subsystem: Lenovo TU117GL [T400 4GB]
        Kernel driver in use: vfio-pci
        Kernel modules: nvidiafb, nouveau
06:00.1 Audio device: NVIDIA Corporation Device 10fa (rev a1)
        Subsystem: Lenovo Device 1613
        Kernel driver in use: vfio-pci
        Kernel modules: snd_hda_intel

I have not changed anything in the BIOS of the host. IOMMU also seems to be enabled. How can I debug the whole thing further?
 
Last edited:
  • Like
Reactions: athurdent
As this does not seem to be widespread, I guess it might be due to the fact that we are running AMD CPUs instead of Intel.
Perhaps that newer kernel code is not compatible with AMD.
 
I have the same problem. I was going to passthrough my nvidia gtx1060 6gb to my windows11 vm and then i got the folowing error message:
swtpm_setup: Not overwriting existing state file.
kvm: -device vfio-pci,host=0000:0b:00.0,id=hostpci0.0,bus=ich9-pcie-port-1,addr=0x0.0,multifunction=on: vfio 0000:0b:00.0: failed to setup container for group 32: Failed to set group container: Invalid argument
stopping swtpm instance (pid 3367) due to QEMU startup error
TASK ERROR: start failed: QEMU exited with code 1
CPU: Intel(R) Xeon(R) CPU E5530
GPU: Nvidia Geforce GTX1060 6gb
RAM: ECC rdimm 16gb
Boot Mode: Legacy BIOS (maybe thats the problem?)
Proxmox version: 8.2.2
any help would be appreciated and also im quite new to proxmox and linux in general so maybe i just misconfigured something?
 
I'm using a 5700X and after enabled IOMMU related stuff in BIOS(IOMMU, AES and ACR) and checked my GPU has a separate group, this command dmesg | grep -e DMAR -e IOMMU -e AMD-Vi just returned nothing...? any idea?
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!