Guys, i have no idea how to solve this. Already lost like 2 weeks.
Already fixed one side of issues by using ovmf bios but here is the second half mostly when sometimes guest shuts down.
I am passthrough RTX5090 and RTX PRO 6000 to linux VMs. And this is working fine (that would suggest no riser issues).
Then sometimes on guest shutdown i am getting those errors below.
Then only reboot of the whole host helps. Its like GPU goes into sleep or some other state and cant wake up or cant be seen.
Or probably there is some issue with vfio ?
I am getting two errors when this happens:
Errors:
"Unable to change power state from D3cold to D0, device inaccessible"
and
CPU BUG soft lockup
Software:
Proxmox 8.4
Motherboard GENOA2D24G-2L+
CPU: 2x AMD EPYC 9654 96-Core Processor
GPU: 5x RTX PRO 6000 blackwell and 6x RTX 5090
I really need to get this sorted and if anyone will help, i can pay for your time guys.
Beside configs below i also tried:
Setting 0 into each GPU at host boot: /sys/bus/pci/devices/0000:$id.0/d3cold_allowed
Disabled ESPM pci-e in bios so that it always stays on PCI-E 5.0 x16
Full errors:
[490431.407576] tap12850079i0: entered promiscuous mode
[490431.440282] wanbr: port 4(tap12850079i0) entered blocking state
[490431.440291] wanbr: port 4(tap12850079i0) entered disabled state
[490431.440354] tap12850079i0: entered allmulticast mode
[490431.440608] wanbr: port 4(tap12850079i0) entered blocking state
[490431.440611] wanbr: port 4(tap12850079i0) entered forwarding state
[490431.805032] vfio-pci 0000:81:00.0: Unable to change power state from D3cold to D0, device inaccessible
[490431.818384] vfio-pci 0000:81:00.0: Unable to change power state from D3cold to D0, device inaccessible
[490431.821151] vfio-pci 0000:81:00.0: Unable to change power state from D3cold to D0, device inaccessible
[490432.869619] pcieport 0000:80:01.1: Data Link Layer Link Active not set in 1000 msec
[490432.874478] wanbr: port 4(tap12850079i0) entered disabled state
[490432.875371] tap12850079i0 (unregistering): left allmulticast mode


And then i see also CPU softlockup:

pveversion
uname
cat /etc/default/grub | grep LINUX_DEF
cat /etc/modprobe.d/vfio.conf
cat /etc/modprobe.d/blacklist-gpu.conf
lspci -vv | grep NVID
lspci -nnk | grep -iA3 nvidia
iommu
4G support: I do not have directly in bios that setting but i guess it works (it has 64bit so i guess it has that support)
My special command for checking link state of each GPU via riser and its speed and PCI-E gen
result:
BUS ID: 01:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 24:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 41:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 61:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 81:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
After this issue happens, i no longer can read device info:

Already fixed one side of issues by using ovmf bios but here is the second half mostly when sometimes guest shuts down.
I am passthrough RTX5090 and RTX PRO 6000 to linux VMs. And this is working fine (that would suggest no riser issues).
Then sometimes on guest shutdown i am getting those errors below.
Then only reboot of the whole host helps. Its like GPU goes into sleep or some other state and cant wake up or cant be seen.
Or probably there is some issue with vfio ?
I am getting two errors when this happens:
Errors:
"Unable to change power state from D3cold to D0, device inaccessible"
and
CPU BUG soft lockup
Software:
Proxmox 8.4
Motherboard GENOA2D24G-2L+
CPU: 2x AMD EPYC 9654 96-Core Processor
GPU: 5x RTX PRO 6000 blackwell and 6x RTX 5090
I really need to get this sorted and if anyone will help, i can pay for your time guys.
Beside configs below i also tried:
Setting 0 into each GPU at host boot: /sys/bus/pci/devices/0000:$id.0/d3cold_allowed
Disabled ESPM pci-e in bios so that it always stays on PCI-E 5.0 x16
Full errors:
[490431.407576] tap12850079i0: entered promiscuous mode
[490431.440282] wanbr: port 4(tap12850079i0) entered blocking state
[490431.440291] wanbr: port 4(tap12850079i0) entered disabled state
[490431.440354] tap12850079i0: entered allmulticast mode
[490431.440608] wanbr: port 4(tap12850079i0) entered blocking state
[490431.440611] wanbr: port 4(tap12850079i0) entered forwarding state
[490431.805032] vfio-pci 0000:81:00.0: Unable to change power state from D3cold to D0, device inaccessible
[490431.818384] vfio-pci 0000:81:00.0: Unable to change power state from D3cold to D0, device inaccessible
[490431.821151] vfio-pci 0000:81:00.0: Unable to change power state from D3cold to D0, device inaccessible
[490432.869619] pcieport 0000:80:01.1: Data Link Layer Link Active not set in 1000 msec
[490432.874478] wanbr: port 4(tap12850079i0) entered disabled state
[490432.875371] tap12850079i0 (unregistering): left allmulticast mode


And then i see also CPU softlockup:

pveversion
Code:
pve-manager/8.4.1/2a5fa54a8503f96d (running kernel: 6.8.12-11-pve)
uname
Code:
Linux cee93dbc-5f39-4690-bc2c-ec4d4cbf4ea2 6.8.12-11-pve #1 SMP PREEMPT_DYNAMIC PMX 6.8.12-11 (2025-05-22T09:39Z) x86_64 GNU/Linux
cat /etc/default/grub | grep LINUX_DEF
Code:
GRUB_CMDLINE_LINUX_DEFAULT="quiet amd_iommu=on iommu=pt vfio_iommu_type1.allow_unsafe_interrupts=1 vfio-pci.ids=10de:22e8,10de:2bb1 hugepagesz=1G hugepages=1300 default_hugepagesz=1G initcall_blacklist=sysfb_init"
cat /etc/modprobe.d/vfio.conf
Code:
options vfio_iommu_type1 allow_unsafe_interrupts=1
options kvm ignore_msrs=1 report_ignored_msrs=0
options vfio-pci ids=10de:22e8,10de:2bb1 disable_vga=1 disable_idle_d3=1
cat /etc/modprobe.d/blacklist-gpu.conf
Code:
blacklist radeon
blacklist nouveau
blacklist nvidia
# Additional NVIDIA related blacklists
blacklist snd_hda_intel
blacklist amd76x_edac
blacklist vga16fb
blacklist rivafb
blacklist nvidiafb
blacklist rivatv
blacklist microcode
lspci -vv | grep NVID
Code:
01:00.0 VGA compatible controller: NVIDIA Corporation Device 2bb1 (rev a1) (prog-if 00 [VGA controller])
Subsystem: NVIDIA Corporation Device 204b
01:00.1 Audio device: NVIDIA Corporation Device 22e8 (rev a1)
Subsystem: NVIDIA Corporation Device 0000
24:00.0 VGA compatible controller: NVIDIA Corporation Device 2bb1 (rev a1) (prog-if 00 [VGA controller])
Subsystem: NVIDIA Corporation Device 204b
24:00.1 Audio device: NVIDIA Corporation Device 22e8 (rev a1)
Subsystem: NVIDIA Corporation Device 0000
41:00.0 VGA compatible controller: NVIDIA Corporation Device 2bb1 (rev a1) (prog-if 00 [VGA controller])
Subsystem: NVIDIA Corporation Device 204b
41:00.1 Audio device: NVIDIA Corporation Device 22e8 (rev a1)
Subsystem: NVIDIA Corporation Device 0000
61:00.0 VGA compatible controller: NVIDIA Corporation Device 2bb1 (rev a1) (prog-if 00 [VGA controller])
Subsystem: NVIDIA Corporation Device 204b
61:00.1 Audio device: NVIDIA Corporation Device 22e8 (rev a1)
Subsystem: NVIDIA Corporation Device 0000
81:00.0 VGA compatible controller: NVIDIA Corporation Device 2bb1 (rev a1) (prog-if 00 [VGA controller])
Subsystem: NVIDIA Corporation Device 204b
81:00.1 Audio device: NVIDIA Corporation Device 22e8 (rev a1)
Subsystem: NVIDIA Corporation Device 0000
lspci -nnk | grep -iA3 nvidia
Code:
01:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:2bb1] (rev a1)
Subsystem: NVIDIA Corporation Device [10de:204b]
Kernel driver in use: vfio-pci
Kernel modules: nvidiafb, nouveau
01:00.1 Audio device [0403]: NVIDIA Corporation Device [10de:22e8] (rev a1)
Subsystem: NVIDIA Corporation Device [10de:0000]
Kernel driver in use: vfio-pci
Kernel modules: snd_hda_intel
iommu
Code:
root@cee93dbc-5f39-4690-bc2c-ec4d4cbf4ea2:/# dmesg | grep -i iommu
[ 0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-6.8.12-11-pve root=UUID=072efd2c-22ba-443d-b869-2606bf91cea7 ro quiet quiet amd_iommu=on iommu=pt vfio_iommu_type1.allow_unsafe_interrupts=1 vfio-pci.ids=10de:22e8,10de:2bb1 skew_tick=1 hugepagesz=1G hugepages=1495 default_hugepagesz=1G hugepagesz=1G hugepages=1495 default_hugepagesz=1G
[ 1.236207] Kernel command line: BOOT_IMAGE=/boot/vmlinuz-6.8.12-11-pve root=UUID=072efd2c-22ba-443d-b869-2606bf91cea7 ro quiet quiet amd_iommu=on iommu=pt vfio_iommu_type1.allow_unsafe_interrupts=1 vfio-pci.ids=10de:22e8,10de:2bb1 skew_tick=1 hugepagesz=1G hugepages=1495 default_hugepagesz=1G hugepagesz=1G hugepages=1495 default_hugepagesz=1G
[ 6.949405] iommu: Default domain type: Passthrough (set via kernel command line)
[ 7.042236] pci 0000:60:00.2: AMD-Vi: IOMMU performance counters supported
[ 7.051849] pci 0000:60:00.3: Adding to iommu group 0
[ 7.051923] pci 0000:60:01.0: Adding to iommu group 1
[ 7.051965] pci 0000:60:01.1: Adding to iommu group 2
[ 7.052036] pci 0000:60:02.0: Adding to iommu group 3
[ 7.052106] pci 0000:60:03.0: Adding to iommu group 4
[ 7.052177] pci 0000:60:04.0: Adding to iommu group 5
[ 7.052252] pci 0000:60:05.0: Adding to iommu group 6
[ 7.052296] pci 0000:60:05.3: Adding to iommu group 7
[ 7.052339] pci 0000:60:05.4: Adding to iommu group 8
[ 7.052476] pci 0000:60:07.0: Adding to iommu group 9
[ 7.052517] pci 0000:60:07.1: Adding to iommu group 9
[ 7.052557] pci 0000:60:07.2: Adding to iommu group 9
...
4G support: I do not have directly in bios that setting but i guess it works (it has 64bit so i guess it has that support)
Code:
root@cee93dbc-5f39-4690-bc2c-ec4d4cbf4ea2:/# lspci -vvv | grep -i prefetchable
Prefetchable memory behind bridge: 18c000000000-18e001ffffff [size=131104M] [32-bit]
Prefetchable memory behind bridge: 18e002100000-18e0021fffff [size=1M] [32-bit]
Prefetchable memory behind bridge: 00000000fff00000-00000000000fffff [disabled] [64-bit]
Region 0: Memory at f0000000 (32-bit, non-prefetchable) [size=64M]
Region 1: Memory at 18c000000000 (64-bit, prefetchable) [size=128G]
Region 3: Memory at 18e000000000 (64-bit, prefetchable) [size=32M]
Region 0: Memory at f4080000 (32-bit, non-prefetchable) [disabled] [size=16K]
Region 0: Memory at 18e002180000 (64-bit, prefetchable) [disabled] [size=512K]
Region 2: Memory at 18e002100000 (64-bit, prefetchable) [disabled] [size=512K]
Region 0: Memory at f4300000 (64-bit, non-prefetchable) [size=1M]
Region 2: Memory at f4200000 (32-bit, non-prefetchable) [size=1M]
Region 5: Memory at f4400000 (32-bit, non-prefetchable) [size=8K]
Region 5: Memory at f4501000 (32-bit, non-prefetchable) [size=2K]
My special command for checking link state of each GPU via riser and its speed and PCI-E gen
Code:
lspci | grep VGA | awk '{print $1}' | while read id; do info=$(lspci -vvv -s "$id"); lnkcap=$(echo "$info" | grep -m1 "LnkCap:" | grep -oP 'Speed \K[0-9]+'); capw=$(echo "$info" | grep -m1 "LnkCap:" | grep -oP 'Width x\K[0-9]+'); lnksta=$(echo "$info" | grep -m1 "LnkSta:" | grep -oP 'Speed \K[0-9]+'); staw=$(echo "$info" | grep -m1 "LnkSta:" | grep -oP 'Width x\K[0-9]+'); case $lnksta in 2) gen="PCIe 1.0" ;; 5) gen="PCIe 2.0" ;; 8) gen="PCIe 3.0" ;; 16) gen="PCIe 4.0" ;; 32) gen="PCIe 5.0" ;; 64) gen="PCIe 6.0" ;; *) gen="" ;; esac; [ -z "$gen" ] || [ -z "$lnksta" ] || [ -z "$lnkcap" ] && continue; color="\033[32m"; [ "$lnksta" -lt "$lnkcap" ] || [ "$staw" -lt "$capw" ] && color="\033[31m"; echo -e "\033[36mBUS ID: $id\033[0m"; echo -e "${color}$gen (${lnksta} GT/s)\033[0m"; echo -e "LnkSta/LnkCap: ${color}Speed ${lnksta}GT/s, Width x${staw} / Speed ${lnkcap}GT/s, Width x${capw}\033[0m"; echo ""; done
BUS ID: 01:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 24:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 41:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 61:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 81:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
After this issue happens, i no longer can read device info:

Last edited: