I have implemented this also and i also see issue.Which device is reporting the D3cold to D0 message?
If you have different hardware you can find using: lspci -nn
e.g
02:00.0 VGA compatible controller [0300]: NVIDIA Corporation GB202 [GeForce RTX 5090] [10de:2b85] (rev a1)
02:00.1 Audio device [0403]: NVIDIA Corporation Device [10de:22e8] (rev a1)
The appropriate vendor and device id's for your system should be used when binding to vfio-pic and setting pm rules.
For the interrupt, in the guest check dmesg and lspci to determine which device is having the issue, then follow through to the host to see what's next.
dmesg -T
lspci -vv |grep -i 'interrupt:'
But here is the thing.
My RTX PRO 6000 blackwell 96GB 600w i think is working fine now
But RTX PRO 6000 blackwell 96GB 300w max-q is having this issue regardless.
Also did upgrade to 6.17 kernel and flashed that uefi firmware fix but still no luck with max-q version.
https://www.nvidia.com/content/Driv...ter_2.0-x64.exe&firmware=1&lang=us&type=Other