EDIT2: I think this solved the issue. Still confirming.
	
	
	
	
EDIT1: Got response from nvidia that they were able to reproduce this issue and they are thinking about fix.
Also i have installed apt install proxmox-kernel-6.14.8-2-bpo12-pve/stable and i see that RTX6000 boots super fast now vs very slow when i had older 6.8 and 6.11 kernels. In 6.14 they added some support for blackwell so worth to try it out.
https://www.phoronix.com/news/Linux-6.14-VFIO
Anyway the crash on shutdown is caused by either specific training itself or/and some module options for nvidia.
The training that caused issues afte applying options nvidia-drm modeset=0 and /etc/X11/xorg.conf.d now it does not crash gpu anymore.
But since client can do any stuff in VM, this is not good solution.
Filled bug report here: https://bugzilla.proxmox.com/show_bug.cgi?id=6603
Guys, i have no idea how to solve this. Already lost like 2 weeks.
Already fixed one side of issues by using ovmf bios but here is the second half mostly when sometimes guest shuts down.
I am passthrough RTX5090 and RTX PRO 6000 to linux VMs. And this is working fine (that would suggest no riser issues).
Then sometimes on guest shutdown i am getting those errors below.
Then only reboot of the whole host helps. Its like GPU goes into sleep or some other state and cant wake up or cant be seen.
Or probably there is some issue with vfio ?
I am getting two errors when this happens:
Errors:
"Unable to change power state from D3cold to D0, device inaccessible"
and
CPU BUG soft lockup
Software:
Proxmox 8.4
Motherboard GENOA2D24G-2L+
CPU: 2x AMD EPYC 9654 96-Core Processor
GPU: 5x RTX PRO 6000 blackwell and 6x RTX 5090
I really need to get this sorted and if anyone will help, i can pay for your time guys.
Beside configs below i also tried:
Setting 0 into each GPU at host boot: /sys/bus/pci/devices/0000:$id.0/d3cold_allowed
Disabled ESPM pci-e in bios so that it always stays on PCI-E 5.0 x16
Full errors:
[490431.407576] tap12850079i0: entered promiscuous mode
[490431.440282] wanbr: port 4(tap12850079i0) entered blocking state
[490431.440291] wanbr: port 4(tap12850079i0) entered disabled state
[490431.440354] tap12850079i0: entered allmulticast mode
[490431.440608] wanbr: port 4(tap12850079i0) entered blocking state
[490431.440611] wanbr: port 4(tap12850079i0) entered forwarding state
[490431.805032] vfio-pci 0000:81:00.0: Unable to change power state from D3cold to D0, device inaccessible
[490431.818384] vfio-pci 0000:81:00.0: Unable to change power state from D3cold to D0, device inaccessible
[490431.821151] vfio-pci 0000:81:00.0: Unable to change power state from D3cold to D0, device inaccessible
[490432.869619] pcieport 0000:80:01.1: Data Link Layer Link Active not set in 1000 msec
[490432.874478] wanbr: port 4(tap12850079i0) entered disabled state
[490432.875371] tap12850079i0 (unregistering): left allmulticast mode


And then i see also CPU softlockup:

pveversion
	
	
	
		
uname
	
	
	
		
cat /etc/default/grub | grep LINUX_DEF
	
	
	
		
cat /etc/modprobe.d/vfio.conf
	
	
	
		
cat /etc/modprobe.d/blacklist-gpu.conf
	
	
	
		
lspci -vv | grep NVID
	
	
	
		
 
lspci -nnk | grep -iA3 nvidia
	
	
	
		
 
iommu
	
	
	
		
4G support: I do not have directly in bios that setting but i guess it works (it has 64bit so i guess it has that support)
	
	
	
		
 
My special command for checking link state of each GPU via riser and its speed and PCI-E gen
	
	
	
		
result:
BUS ID: 01:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 24:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 41:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 61:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 81:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
After this issue happens, i no longer can read device info:

My main thread
https://forum.proxmox.com/threads/p...kup-d3cold-to-d0-after-guest-shutdown.168424/
Reddit questions:
https://www.reddit.com/r/VFIO/comments/1lzx4hc/gpu_passthrough_cpu_bug_soft_lockup/
https://www.reddit.com/r/VFIO/comments/1m1nna2/do_you_have_stable_passthrough_on_rtx5090_rtx/
Guy who found solution in guest VM:
https://forum.level1techs.com/t/do-...ies-has-reset-bug-in-vm-passthrough/228549/34
My bug report in proxmox (probably its nvidia issue):
https://bugzilla.proxmox.com/show_bug.cgi?id=6603
				
			These messages are related to the GPU being put into a power sleep state, and not being able to be activated in time for the VM.
vfio-pci 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible
vfio-pci 0000:02:00.1: Unable to change power state from D3cold to D0, device inaccessible
Try on the Host VM:
1. Kernel module
In /etc/default/grub add to options:
disable_idle_d3=1
2. Bind the GPU early. and stop idle 3D
Create /etc/modprobe.d/vfio-pci.conf:
# RTX 5090 GPU (10de:2b85) + Audio (10de:22e8)
# Bind both functions to vfio-pci and prevent runtime D3...
vfio-pci 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible
vfio-pci 0000:02:00.1: Unable to change power state from D3cold to D0, device inaccessible
Try on the Host VM:
1. Kernel module
In /etc/default/grub add to options:
disable_idle_d3=1
2. Bind the GPU early. and stop idle 3D
Create /etc/modprobe.d/vfio-pci.conf:
# RTX 5090 GPU (10de:2b85) + Audio (10de:22e8)
# Bind both functions to vfio-pci and prevent runtime D3...
EDIT1: Got response from nvidia that they were able to reproduce this issue and they are thinking about fix.
Also i have installed apt install proxmox-kernel-6.14.8-2-bpo12-pve/stable and i see that RTX6000 boots super fast now vs very slow when i had older 6.8 and 6.11 kernels. In 6.14 they added some support for blackwell so worth to try it out.
https://www.phoronix.com/news/Linux-6.14-VFIO
Anyway the crash on shutdown is caused by either specific training itself or/and some module options for nvidia.
The training that caused issues afte applying options nvidia-drm modeset=0 and /etc/X11/xorg.conf.d now it does not crash gpu anymore.
But since client can do any stuff in VM, this is not good solution.
Filled bug report here: https://bugzilla.proxmox.com/show_bug.cgi?id=6603
Guys, i have no idea how to solve this. Already lost like 2 weeks.
Already fixed one side of issues by using ovmf bios but here is the second half mostly when sometimes guest shuts down.
I am passthrough RTX5090 and RTX PRO 6000 to linux VMs. And this is working fine (that would suggest no riser issues).
Then sometimes on guest shutdown i am getting those errors below.
Then only reboot of the whole host helps. Its like GPU goes into sleep or some other state and cant wake up or cant be seen.
Or probably there is some issue with vfio ?
I am getting two errors when this happens:
Errors:
"Unable to change power state from D3cold to D0, device inaccessible"
and
CPU BUG soft lockup
Software:
Proxmox 8.4
Motherboard GENOA2D24G-2L+
CPU: 2x AMD EPYC 9654 96-Core Processor
GPU: 5x RTX PRO 6000 blackwell and 6x RTX 5090
I really need to get this sorted and if anyone will help, i can pay for your time guys.
Beside configs below i also tried:
Setting 0 into each GPU at host boot: /sys/bus/pci/devices/0000:$id.0/d3cold_allowed
Disabled ESPM pci-e in bios so that it always stays on PCI-E 5.0 x16
Full errors:
[490431.407576] tap12850079i0: entered promiscuous mode
[490431.440282] wanbr: port 4(tap12850079i0) entered blocking state
[490431.440291] wanbr: port 4(tap12850079i0) entered disabled state
[490431.440354] tap12850079i0: entered allmulticast mode
[490431.440608] wanbr: port 4(tap12850079i0) entered blocking state
[490431.440611] wanbr: port 4(tap12850079i0) entered forwarding state
[490431.805032] vfio-pci 0000:81:00.0: Unable to change power state from D3cold to D0, device inaccessible
[490431.818384] vfio-pci 0000:81:00.0: Unable to change power state from D3cold to D0, device inaccessible
[490431.821151] vfio-pci 0000:81:00.0: Unable to change power state from D3cold to D0, device inaccessible
[490432.869619] pcieport 0000:80:01.1: Data Link Layer Link Active not set in 1000 msec
[490432.874478] wanbr: port 4(tap12850079i0) entered disabled state
[490432.875371] tap12850079i0 (unregistering): left allmulticast mode


And then i see also CPU softlockup:

pveversion
		Code:
	
	pve-manager/8.4.1/2a5fa54a8503f96d (running kernel: 6.8.12-11-pve)uname
		Code:
	
	Linux cee93dbc-5f39-4690-bc2c-ec4d4cbf4ea2 6.8.12-11-pve #1 SMP PREEMPT_DYNAMIC PMX 6.8.12-11 (2025-05-22T09:39Z) x86_64 GNU/Linuxcat /etc/default/grub | grep LINUX_DEF
		Code:
	
	GRUB_CMDLINE_LINUX_DEFAULT="quiet amd_iommu=on iommu=pt vfio_iommu_type1.allow_unsafe_interrupts=1 vfio-pci.ids=10de:22e8,10de:2bb1 hugepagesz=1G hugepages=1300 default_hugepagesz=1G initcall_blacklist=sysfb_init"cat /etc/modprobe.d/vfio.conf
		Code:
	
	options vfio_iommu_type1 allow_unsafe_interrupts=1
options kvm ignore_msrs=1 report_ignored_msrs=0
options vfio-pci ids=10de:22e8,10de:2bb1 disable_vga=1 disable_idle_d3=1cat /etc/modprobe.d/blacklist-gpu.conf
		Code:
	
	blacklist radeon
blacklist nouveau
blacklist nvidia
# Additional NVIDIA related blacklists
blacklist snd_hda_intel
blacklist amd76x_edac
blacklist vga16fb
blacklist rivafb
blacklist nvidiafb
blacklist rivatv
blacklist microcodelspci -vv | grep NVID
		Code:
	
	01:00.0 VGA compatible controller: NVIDIA Corporation Device 2bb1 (rev a1) (prog-if 00 [VGA controller])
    Subsystem: NVIDIA Corporation Device 204b
01:00.1 Audio device: NVIDIA Corporation Device 22e8 (rev a1)
    Subsystem: NVIDIA Corporation Device 0000
24:00.0 VGA compatible controller: NVIDIA Corporation Device 2bb1 (rev a1) (prog-if 00 [VGA controller])
    Subsystem: NVIDIA Corporation Device 204b
24:00.1 Audio device: NVIDIA Corporation Device 22e8 (rev a1)
    Subsystem: NVIDIA Corporation Device 0000
41:00.0 VGA compatible controller: NVIDIA Corporation Device 2bb1 (rev a1) (prog-if 00 [VGA controller])
    Subsystem: NVIDIA Corporation Device 204b
41:00.1 Audio device: NVIDIA Corporation Device 22e8 (rev a1)
    Subsystem: NVIDIA Corporation Device 0000
61:00.0 VGA compatible controller: NVIDIA Corporation Device 2bb1 (rev a1) (prog-if 00 [VGA controller])
    Subsystem: NVIDIA Corporation Device 204b
61:00.1 Audio device: NVIDIA Corporation Device 22e8 (rev a1)
    Subsystem: NVIDIA Corporation Device 0000
81:00.0 VGA compatible controller: NVIDIA Corporation Device 2bb1 (rev a1) (prog-if 00 [VGA controller])
    Subsystem: NVIDIA Corporation Device 204b
81:00.1 Audio device: NVIDIA Corporation Device 22e8 (rev a1)
    Subsystem: NVIDIA Corporation Device 0000lspci -nnk | grep -iA3 nvidia
		Code:
	
	    01:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:2bb1] (rev a1)
    Subsystem: NVIDIA Corporation Device [10de:204b]
    Kernel driver in use: vfio-pci
    Kernel modules: nvidiafb, nouveau
01:00.1 Audio device [0403]: NVIDIA Corporation Device [10de:22e8] (rev a1)
    Subsystem: NVIDIA Corporation Device [10de:0000]
    Kernel driver in use: vfio-pci
    Kernel modules: snd_hda_inteliommu
		Code:
	
	root@cee93dbc-5f39-4690-bc2c-ec4d4cbf4ea2:/# dmesg | grep -i iommu
[    0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-6.8.12-11-pve root=UUID=072efd2c-22ba-443d-b869-2606bf91cea7 ro quiet quiet amd_iommu=on iommu=pt vfio_iommu_type1.allow_unsafe_interrupts=1 vfio-pci.ids=10de:22e8,10de:2bb1 skew_tick=1 hugepagesz=1G hugepages=1495 default_hugepagesz=1G hugepagesz=1G hugepages=1495 default_hugepagesz=1G
[    1.236207] Kernel command line: BOOT_IMAGE=/boot/vmlinuz-6.8.12-11-pve root=UUID=072efd2c-22ba-443d-b869-2606bf91cea7 ro quiet quiet amd_iommu=on iommu=pt vfio_iommu_type1.allow_unsafe_interrupts=1 vfio-pci.ids=10de:22e8,10de:2bb1 skew_tick=1 hugepagesz=1G hugepages=1495 default_hugepagesz=1G hugepagesz=1G hugepages=1495 default_hugepagesz=1G
[    6.949405] iommu: Default domain type: Passthrough (set via kernel command line)
[    7.042236] pci 0000:60:00.2: AMD-Vi: IOMMU performance counters supported
[    7.051849] pci 0000:60:00.3: Adding to iommu group 0
[    7.051923] pci 0000:60:01.0: Adding to iommu group 1
[    7.051965] pci 0000:60:01.1: Adding to iommu group 2
[    7.052036] pci 0000:60:02.0: Adding to iommu group 3
[    7.052106] pci 0000:60:03.0: Adding to iommu group 4
[    7.052177] pci 0000:60:04.0: Adding to iommu group 5
[    7.052252] pci 0000:60:05.0: Adding to iommu group 6
[    7.052296] pci 0000:60:05.3: Adding to iommu group 7
[    7.052339] pci 0000:60:05.4: Adding to iommu group 8
[    7.052476] pci 0000:60:07.0: Adding to iommu group 9
[    7.052517] pci 0000:60:07.1: Adding to iommu group 9
[    7.052557] pci 0000:60:07.2: Adding to iommu group 9
...4G support: I do not have directly in bios that setting but i guess it works (it has 64bit so i guess it has that support)
		Code:
	
	root@cee93dbc-5f39-4690-bc2c-ec4d4cbf4ea2:/# lspci -vvv | grep -i prefetchable
    Prefetchable memory behind bridge: 18c000000000-18e001ffffff [size=131104M] [32-bit]
    Prefetchable memory behind bridge: 18e002100000-18e0021fffff [size=1M] [32-bit]
    Prefetchable memory behind bridge: 00000000fff00000-00000000000fffff [disabled] [64-bit]
    Region 0: Memory at f0000000 (32-bit, non-prefetchable) [size=64M]
    Region 1: Memory at 18c000000000 (64-bit, prefetchable) [size=128G]
    Region 3: Memory at 18e000000000 (64-bit, prefetchable) [size=32M]
    Region 0: Memory at f4080000 (32-bit, non-prefetchable) [disabled] [size=16K]
    Region 0: Memory at 18e002180000 (64-bit, prefetchable) [disabled] [size=512K]
    Region 2: Memory at 18e002100000 (64-bit, prefetchable) [disabled] [size=512K]
    Region 0: Memory at f4300000 (64-bit, non-prefetchable) [size=1M]
    Region 2: Memory at f4200000 (32-bit, non-prefetchable) [size=1M]
    Region 5: Memory at f4400000 (32-bit, non-prefetchable) [size=8K]
    Region 5: Memory at f4501000 (32-bit, non-prefetchable) [size=2K]My special command for checking link state of each GPU via riser and its speed and PCI-E gen
		Code:
	
	lspci | grep VGA | awk '{print $1}' | while read id; do info=$(lspci -vvv -s "$id"); lnkcap=$(echo "$info" | grep -m1 "LnkCap:" | grep -oP 'Speed \K[0-9]+'); capw=$(echo "$info" | grep -m1 "LnkCap:" | grep -oP 'Width x\K[0-9]+'); lnksta=$(echo "$info" | grep -m1 "LnkSta:" | grep -oP 'Speed \K[0-9]+'); staw=$(echo "$info" | grep -m1 "LnkSta:" | grep -oP 'Width x\K[0-9]+'); case $lnksta in 2) gen="PCIe 1.0" ;; 5) gen="PCIe 2.0" ;; 8) gen="PCIe 3.0" ;; 16) gen="PCIe 4.0" ;; 32) gen="PCIe 5.0" ;; 64) gen="PCIe 6.0" ;; *) gen="" ;; esac; [ -z "$gen" ] || [ -z "$lnksta" ] || [ -z "$lnkcap" ] && continue; color="\033[32m"; [ "$lnksta" -lt "$lnkcap" ] || [ "$staw" -lt "$capw" ] && color="\033[31m"; echo -e "\033[36mBUS ID: $id\033[0m"; echo -e "${color}$gen (${lnksta} GT/s)\033[0m"; echo -e "LnkSta/LnkCap: ${color}Speed ${lnksta}GT/s, Width x${staw} / Speed ${lnkcap}GT/s, Width x${capw}\033[0m"; echo ""; doneBUS ID: 01:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 24:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 41:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 61:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
BUS ID: 81:00.0
PCIe 5.0 (32 GT/s)
LnkSta/LnkCap: Speed 32GT/s, Width x16 / Speed 32GT/s, Width x16
After this issue happens, i no longer can read device info:

My main thread
https://forum.proxmox.com/threads/p...kup-d3cold-to-d0-after-guest-shutdown.168424/
Reddit questions:
https://www.reddit.com/r/VFIO/comments/1lzx4hc/gpu_passthrough_cpu_bug_soft_lockup/
https://www.reddit.com/r/VFIO/comments/1m1nna2/do_you_have_stable_passthrough_on_rtx5090_rtx/
Guy who found solution in guest VM:
https://forum.level1techs.com/t/do-...ies-has-reset-bug-in-vm-passthrough/228549/34
My bug report in proxmox (probably its nvidia issue):
https://bugzilla.proxmox.com/show_bug.cgi?id=6603
			
				Last edited: 
				
		
	
										
										
											
	
										
									
								 
	 
	
 on a 5090 lol ? Nope
 on a 5090 lol ? Nope 