pcieport Non-Fatal error

jzness

New Member
May 7, 2023
2
1
3
Hello,

I have been dealing with an issue recently ever since I added a secondary video card to my machine. I am not sure if it has to due with it being an identical card or what but I continuously get this error non stop 24/7. It makes it difficult to see the actual syslog events because its just over run with these errors. The video cards are GForce GT 730's

May 03 16:15:06 redactprox kernel: pcieport 0000:00:02.0: AER: Multiple Uncorrected (Non-Fatal) error received: 0000:02:00.0

May 03 16:15:06 redactprox kernel: pcieport 0000:00:02.0: AER: Multiple Uncorrected (Non-Fatal) error received: 0000:02:00.0

May 03 16:15:06 redactprox kernel: pcieport 0000:00:02.0: AER: Multiple Uncorrected (Non-Fatal) error received: 0000:02:00.0

May 03 16:15:07 redactprox kernel: pcieport 0000:00:02.0: AER: Multiple Uncorrected (Non-Fatal) error received: 0000:02:00.0

I have seen some reports online of similar pciport errors that just consistently flood the syslog, but nothing quite like what I am seeing. I have seen some ways to hide these error alerts but am more curious what is causing this/how I can resolve it.

Here is my lspci list -> lspci - proxmox issue - Pastebin.com

Has anyone experienced this before? Could it be due to the identical GPU's or something different? One GPU is passed through to a Windows 11 VM, the other is just being used by Proxmox I assume. I had to get the additional card because I was having trouble passing through the 1 gpu without proxmox taking it for itself.

Thank you ahead of time for any help!

r/Proxmox - pcieport Non-Fatal error
 
I had to get the additional card because I was having trouble passing through the 1 gpu without proxmox taking it for itself.
Have you tried this work-around for that?

I have no idea how to fix the AER errors, sorry. Maybe you can find some more informative errors than "Multiple Uncorrected" early after a reboot? When do those errors start; at boot or when starting the VM with passthrough? Which GPU of the two is passed through and which is used by Proxmox? What do your IOMMU groups look like and what is the VM configuration file?
 
Have you tried this work-around for that?

I have no idea how to fix the AER errors, sorry. Maybe you can find some more informative errors than "Multiple Uncorrected" early after a reboot? When do those errors start; at boot or when starting the VM with passthrough? Which GPU of the two is passed through and which is used by Proxmox? What do your IOMMU groups look like and what is the VM configuration file?
Ah ha! So I actually did resolve it I believe, but it wasn't due to the duplicate vendor/device ID's. I initially went ahead and assigned the second card to a dummy VM, which seemed to resolve it for a few minutes, but then it started again.

Turns out -- it appears to have been due to me having the nvidia drivers still on blacklist when I did an initial passthrough config previously (when I just had 1 card). Once I removed those drivers (I assume the nvidia driver specifically) and rebooted, I am no longer getting those errors.

Thanks for those resources, they will def may be useful for me in the future depending on what I end up doing with the second card.
 
  • Like
Reactions: leesteken

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!