DELL fatal error was detected after Proxmox install

upgrade your firmware.
I did it, upgraded idrac, bios, lifecycle controller and raid controller but it didn't solve the problem. In my case HPE network cards can be the reason too(not properly recognized by the BIOS), so I'm going to replace them with Dell ones. I hope that's gonna make it work.
 
I did it, upgraded idrac, bios, lifecycle controller and raid controller but it didn't solve the problem. In my case HPE network cards can be the reason too(not properly recognized by the BIOS), so I'm going to replace them with Dell ones. I hope that's gonna make it work.
me too :) no good at the moment, i just hope it will get fixed in time
 
I had issues with the onboard Broadcom NICs on my Dell PowerEdge T440 (driver instability). The solution that worked for me was:
  1. In BIOS → System BIOS / Integrated Devices → Embedded NIC1/2, I disabled both onboard Broadcom ports.
  2. Installed an Intel PCIe NIC in one of the available slots.
  3. Verified that the new card was detected in BIOS and the OS, then installed the Intel drivers.

After this change, the system became stable and networking works without problems. The Broadcom interfaces are no longer visible to the OS, and the Intel card is handling all network traffic.

cheers

 
Same problem here. With Dell T440.
I firmly hope for a patch.
Everything is updated, including the firmware of each hardware component and the newest updates from the Proxmox side.
The problem is with the onboard NICs, Broadcom NetXtreme BCM5720.
The issue only occurs during reboot, but then the system freezes and doesn't reboot on its own.
Never had an issue while ESXI 7.0 was installed.
 
I have the same issue. I'll try disabling the onboard NICs to see if that sorts things out.
Yes. Disabling my onboard NICs solves it. This was tricky because the error message,
A fatal error was detected on a component at bus X device Y function Z
suggested that my HBA hardware was the issue here. I almost bought a new one before I found this thread!

Even blacklisting the onboard NIC's tg3 driver does not solve it, only disabling the NICs in the bios did. This suggests to me that this is actually a bug in Dell's/Broadcom's firmware, not one in any proxmox or linux software.
 
Last edited:
still running into what seems to be the same issue. Dell R940 , broadcom BCM5720 2port pci card, driver tg3 7.0.2-4-pe, pve 9.1.14, upgraded nic to bootcode 5720-v1.43. still during reboot , not seeing kernel panic messages , in idrac system event log you do see "A fatal error was detected on a component at bus 1 device 0 function 1." so far our only option has been to swap in intel nics.
 
Didn't have much time to read the full thread, but from what I saw, the issue occurs on reboot, right?
If that is the case, then maybe try the kernel commandline parameter "reboot=pci". Just in case the NICs have issues with the standard Linux reboot method. There are several other options for the "reboot=" parameter as well, if this one should not work. Just look up "Linux Kernel Parameters", and then search for "reboot=".
Good luck!
 
Didn't have much time to read the full thread, but from what I saw, the issue occurs on reboot, right?
If that is the case, then maybe try the kernel commandline parameter "reboot=pci". Just in case the NICs have issues with the standard Linux reboot method. There are several other options for the "reboot=" parameter as well, if this one should not work. Just look up "Linux Kernel Parameters", and then search for "reboot=".
Good luck!
was an interesting thought but no dice. tried all of bios, acpi, kbd, triple, efi, or pci