[SOLVED] Kernel 5.15 bricht GPU passthrough

dekoration

Renowned Member
Apr 16, 2016
9
3
68
34
Hallo Zusammen,

teste momentan etwas mit dem pve-kernel-5.15.30-1-pve herum.

Momentan scheint dieser ein Problem bei meinem passthrough Setup zu machen.
Unter den pve-kernel-5.13.19-6-pve: 5.13.19-15 und älter ist alles schickt nach dem ich ein unbind auf den Framebuffer gemacht habe.

echo efi-framebuffer.0 > /sys/bus/platform/drivers/efi-framebuffer/unbind

Bin aktuell etwas ratlos.


Fehler

kvm: -device vfio-pci,host=0000:08:00.0,id=hostpci0.0,bus=ich9-pcie-port-1,addr=0x0.0,multifunction=on: Failed to mmap 0000:08:00.0 BAR 1. Performance may be slow

pveversion -v

proxmox-ve: 7.1-1 (running kernel: 5.15.30-1-pve)
pve-manager: 7.1-12 (running version: 7.1-12/b3c09de3)
pve-kernel-helper: 7.1-14
pve-kernel-5.13: 7.1-9
pve-kernel-5.11: 7.0-10
pve-kernel-5.15.30-1-pve: 5.15.30-1
pve-kernel-5.13.19-6-pve: 5.13.19-15
pve-kernel-5.13.19-2-pve: 5.13.19-4
pve-kernel-5.11.22-7-pve: 5.11.22-12
pve-kernel-5.11.22-1-pve: 5.11.22-2
ceph-fuse: 15.2.13-pve1
corosync: 3.1.5-pve2
criu: 3.15-1+pve-1
glusterfs-client: 9.2-1
ifupdown2: 3.1.0-1+pmx3
ksm-control-daemon: 1.4-1
libjs-extjs: 7.0.0-1
libknet1: 1.22-pve2
libproxmox-acme-perl: 1.4.1
libproxmox-backup-qemu0: 1.2.0-1
libpve-access-control: 7.1-7
libpve-apiclient-perl: 3.2-1
libpve-common-perl: 7.1-5
libpve-guest-common-perl: 4.1-1
libpve-http-server-perl: 4.1-1
libpve-storage-perl: 7.1-1
libspice-server1: 0.14.3-2.1
lvm2: 2.03.11-2.1
lxc-pve: 4.0.11-1
lxcfs: 4.0.11-pve1
novnc-pve: 1.3.0-2
proxmox-backup-client: 2.1.5-1
proxmox-backup-file-restore: 2.1.5-1
proxmox-mini-journalreader: 1.3-1
proxmox-widget-toolkit: 3.4-7
pve-cluster: 7.1-3
pve-container: 4.1-4
pve-docs: 7.1-2
pve-edk2-firmware: 3.20210831-2
pve-firewall: 4.2-5
pve-firmware: 3.3-6
pve-ha-manager: 3.3-3
pve-i18n: 2.6-2
pve-qemu-kvm: 6.2.0-2
pve-xtermjs: 4.16.0-1
qemu-server: 7.1-4
smartmontools: 7.2-1
spiceterm: 3.2-2
swtpm: 0.7.1~bpo11+1
vncterm: 1.7-1
zfsutils-linux: 2.1.4-pve1

cat /etc/modprobe.d/*

# The microcode module attempts to apply a microcode update when
# it autoloads. This is not always safe, so we block it by default.
blacklist microcode
blacklist radeon
blacklist nouveau
blacklist nvidia
blacklist amdgpu
options vfio_iommu_type1 allow_unsafe_interrupts=1
options kvm-amd nested=1
# This file contains a list of modules which are not supported by Proxmox VE

# nidiafb see bugreport https://bugzilla.proxmox.com/show_bug.cgi?id=701
blacklist nvidiafb
options vfio-pci ids=10de:1f07,10de:10f9,10de:1ada,10de:1adb

cat /etc/default/grub

hierbei habe ich es auch schon mit dem video=simplefb:eek:ff herum getestet aber alles ohne Erfolg.

# If you change this file, run 'update-grub' afterwards to update
# /boot/grub/grub.cfg.
# For full documentation of the options in this file, see:
# info -f grub -n 'Simple configuration'

GRUB_DEFAULT=0
GRUB_TIMEOUT=5
GRUB_DISTRIBUTOR="Proxmox VE"
GRUB_CMDLINE_LINUX_DEFAULT="quiet amd_iommu=on iommu=pt video=simplefb:eek:ff video=efifb:eek:ff pcie_acs_override=downstream video=vesafb:eek:ff vga=off nofb nomodeset video=off"
GRUB_CMDLINE_LINUX="root=ZFS=rpool/ROOT/pve-1 boot=zfs"

# Disable os-prober, it might add menu entries for each guest
GRUB_DISABLE_OS_PROBER=true

# Uncomment to enable BadRAM filtering, modify to suit your needs
# This works with Linux (no patch required) and with any kernel that obtains
# the memory map information from GRUB (GNU Mach, kernel of FreeBSD ...)
#GRUB_BADRAM="0x01234567,0xfefefefe,0x89abcdef,0xefefefef"

# Uncomment to disable graphical terminal (grub-pc only)
#GRUB_TERMINAL=console

# The resolution used on graphical terminal
# note that you can use only modes which your graphic card supports via VBE
# you can see them in real GRUB with the command `vbeinfo'
#GRUB_GFXMODE=640x480

# Uncomment if you don't want GRUB to pass "root=UUID=xxx" parameter to Linux
#GRUB_DISABLE_LINUX_UUID=true

# Disable generation of recovery mode menu entries
GRUB_DISABLE_RECOVERY="true"

# Uncomment to get a beep at grub start
#GRUB_INIT_TUNE="480 440 1"

lspci (hab das auf die entsprechende Komponente gekürzt)

lspci -n
08:00.0 0300: 10de:1f07 (rev a1)
08:00.1 0403: 10de:10f9 (rev a1)
08:00.2 0c03: 10de:1ada (rev a1)
08:00.3 0c80: 10de:1adb (rev a1)

lspci -nv

08:00.0 0300: 10de:1f07 (rev a1) (prog-if 00 [VGA controller])
Subsystem: 1462:3732
Flags: fast devsel, IRQ 53, IOMMU group 14
Memory at d8000000 (32-bit, non-prefetchable) [size=16M]

Memory at c0000000 (64-bit, prefetchable) [size=256M]
Memory at d0000000 (64-bit, prefetchable) [size=32M]
I/O ports at 3000
Expansion ROM at d9000000 [disabled] [size=512K]

Capabilities: [60] Power Management version 3
Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
Capabilities: [78] Express Legacy Endpoint, MSI 00
Capabilities: [100] Virtual Channel
Capabilities: [250] Latency Tolerance Reporting
Capabilities: [258] L1 PM Substates
Capabilities: [128] Power Budgeting <?>
Capabilities: [420] Advanced Error Reporting
Capabilities: [600] Vendor Specific Information: ID=0001 Rev=1 Len=024 <?>
Capabilities: [900] Secondary PCI Express
Capabilities: [bb0] Physical Resizable BAR
Kernel driver in use: vfio-pci
Kernel modules: nvidiafb, nouveau

08:00.1 0403: 10de:10f9 (rev a1)
Subsystem: 1462:3732
Flags: fast devsel, IRQ 54, IOMMU group 14
Memory at d9080000 (32-bit, non-prefetchable) [size=16K]

Capabilities: [60] Power Management version 3
Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
Capabilities: [78] Express Endpoint, MSI 00
Capabilities: [100] Advanced Error Reporting
Kernel driver in use: vfio-pci
Kernel modules: snd_hda_intel

08:00.2 0c03: 10de:1ada (rev a1) (prog-if 30 [XHCI])
Subsystem: 1462:3732
Flags: fast devsel, IRQ 63, IOMMU group 14
Memory at d2000000 (64-bit, prefetchable) [size=256K]

Memory at d2040000 (64-bit, prefetchable) [size=64K]
Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
Capabilities: [78] Express Endpoint, MSI 00
Capabilities: [b4] Power Management version 3
Capabilities: [100] Advanced Error Reporting
Kernel driver in use: vfio-pci
Kernel modules: xhci_pci

08:00.3 0c80: 10de:1adb (rev a1)
Subsystem: 1462:3732
Flags: fast devsel, IRQ 64, IOMMU group 14
Memory at d9084000 (32-bit, non-prefetchable) [size=4K]

Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
Capabilities: [78] Express Endpoint, MSI 00
Capabilities: [b4] Power Management version 3
Capabilities: [100] Advanced Error Reporting
Kernel driver in use: vfio-pci
Kernel modules: i2c_nvidia_gpu


find /sys/kernel/iommu_groups/ -type l

/sys/kernel/iommu_groups/17/devices/0000:09:00.3

/sys/kernel/iommu_groups/35/devices/0000:43:00.3

/sys/kernel/iommu_groups/7/devices/0000:00:07.1

/sys/kernel/iommu_groups/25/devices/0000:40:03.0

/sys/kernel/iommu_groups/15/devices/0000:09:00.0

/sys/kernel/iommu_groups/33/devices/0000:43:00.0

/sys/kernel/iommu_groups/5/devices/0000:00:04.0

/sys/kernel/iommu_groups/23/devices/0000:40:01.3

/sys/kernel/iommu_groups/13/devices/0000:02:07.0

/sys/kernel/iommu_groups/13/devices/0000:02:00.0

/sys/kernel/iommu_groups/13/devices/0000:01:00.2

/sys/kernel/iommu_groups/13/devices/0000:01:00.0

/sys/kernel/iommu_groups/13/devices/0000:02:06.0

/sys/kernel/iommu_groups/13/devices/0000:06:00.0

/sys/kernel/iommu_groups/13/devices/0000:02:05.0

/sys/kernel/iommu_groups/13/devices/0000:05:00.0

/sys/kernel/iommu_groups/13/devices/0000:01:00.1

/sys/kernel/iommu_groups/13/devices/0000:04:00.0

/sys/kernel/iommu_groups/13/devices/0000:02:04.0

/sys/kernel/iommu_groups/31/devices/0000:41:00.0

/sys/kernel/iommu_groups/3/devices/0000:00:03.0

/sys/kernel/iommu_groups/21/devices/0000:40:01.0

/sys/kernel/iommu_groups/11/devices/0000:00:18.3

/sys/kernel/iommu_groups/11/devices/0000:00:18.1

/sys/kernel/iommu_groups/11/devices/0000:00:18.6

/sys/kernel/iommu_groups/11/devices/0000:00:18.4

/sys/kernel/iommu_groups/11/devices/0000:00:18.2

/sys/kernel/iommu_groups/11/devices/0000:00:18.0

/sys/kernel/iommu_groups/11/devices/0000:00:18.7

/sys/kernel/iommu_groups/11/devices/0000:00:18.5

/sys/kernel/iommu_groups/1/devices/0000:00:01.1

/sys/kernel/iommu_groups/28/devices/0000:40:07.1

/sys/kernel/iommu_groups/18/devices/0000:0a:00.0

/sys/kernel/iommu_groups/36/devices/0000:44:00.0

/sys/kernel/iommu_groups/8/devices/0000:00:08.0

/sys/kernel/iommu_groups/26/devices/0000:40:04.0

/sys/kernel/iommu_groups/16/devices/0000:09:00.2

/sys/kernel/iommu_groups/34/devices/0000:43:00.2

/sys/kernel/iommu_groups/6/devices/0000:00:07.0

/sys/kernel/iommu_groups/24/devices/0000:40:02.0

/sys/kernel/iommu_groups/14/devices/0000:08:00.2

/sys/kernel/iommu_groups/14/devices/0000:08:00.0

/sys/kernel/iommu_groups/14/devices/0000:08:00.3

/sys/kernel/iommu_groups/14/devices/0000:08:00.1

/sys/kernel/iommu_groups/32/devices/0000:42:00.0

/sys/kernel/iommu_groups/32/devices/0000:42:00.1

/sys/kernel/iommu_groups/4/devices/0000:00:03.1

/sys/kernel/iommu_groups/22/devices/0000:40:01.2

/sys/kernel/iommu_groups/12/devices/0000:00:19.6

/sys/kernel/iommu_groups/12/devices/0000:00:19.4

/sys/kernel/iommu_groups/12/devices/0000:00:19.2

/sys/kernel/iommu_groups/12/devices/0000:00:19.0

/sys/kernel/iommu_groups/12/devices/0000:00:19.7

/sys/kernel/iommu_groups/12/devices/0000:00:19.5

/sys/kernel/iommu_groups/12/devices/0000:00:19.3

/sys/kernel/iommu_groups/12/devices/0000:00:19.1

/sys/kernel/iommu_groups/30/devices/0000:40:08.1

/sys/kernel/iommu_groups/2/devices/0000:00:02.0

/sys/kernel/iommu_groups/20/devices/0000:0a:00.3

/sys/kernel/iommu_groups/10/devices/0000:00:14.3

/sys/kernel/iommu_groups/10/devices/0000:00:14.0

/sys/kernel/iommu_groups/29/devices/0000:40:08.0

/sys/kernel/iommu_groups/0/devices/0000:00:01.0

/sys/kernel/iommu_groups/19/devices/0000:0a:00.2

/sys/kernel/iommu_groups/37/devices/0000:44:00.2

/sys/kernel/iommu_groups/9/devices/0000:00:08.1

/sys/kernel/iommu_groups/27/devices/0000:40:07.0

Hardware

CPU: AMD Ryzen Threadripper 2920X 12-Core Processor
GPU: NVIDIA GeForce RTX 2070
MB: ASRock X399 Taichi

VM Config

agent: 1
balloon: 0
bios: ovmf
boot: order=virtio0;ide2;net0;ide0
cores: 8
efidisk0: pve-storage-01:vm-100-disk-0,efitype=4m,pre-enrolled-keys=1,size=1M
hostpci0: 0000:08:00,pcie=1,x-vga=1
ide0: none,media=cdrom
ide2: none,media=cdrom
machine: pc-q35-6.2
memory: 16384
meta: creation-qemu=6.1.0,ctime=1637849064
name: gvm01
net0: virtio=xxxxxx,bridge=vmbr0,firewall=1
numa: 0
ostype: win11
scsihw: virtio-scsi-pci
smbios1: uuid=fff4d682-9a50-4abd-9b77-00150de125e4
sockets: 1
tpmstate0: pve-storage-01:vm-100-disk-1,size=4M,version=v2.0
vga: none
virtio0: pve-storage-01:vm-100-disk-2,discard=on,size=800G
vmgenid: xxxxx
 
Last edited:
Als work-a-round wurde die Grafikkarte mit einer Grafikkarte in einem anderen Slot getauscht. Nun liegt der BOOTFB auf der anderen Karte. Dies löst das Problem nicht direkt, aber die Funktionalität ist hergestellt.