Pve crashes because of BCM57416

SheridansNL

New Member
Jul 30, 2023
8
0
1
Hey everybody,
I have a problem and hope someone can help me solve it or point me in the right direction.
My setup: Mobo: Asrockrack X570D4U-2L2T/BCM
-- Bcm and bios updated to the latest drivers.
Cpu: AMD 4750G
Gpu: Nvidia GTX 1050Ti

I'm running pve8.0 After the host being up for some time I get the following error on my 10G broadcom Nics. (BCM57416)
After a reboot the Nics are down and not discovered by lspci or with ip link show.
If I disconnect the power from the host and after a few minutes reconnect and start the host the Nics are up again.
What do you think?

Error Message:
mb85t0jldyeb1.jpg


Ethtool output:
hwnur4pldyeb1.jpg


Thank you all for the help.

error message
ethtool output
 
Hey Moayad,

Bios: is the latest
The latest firmware version for the Nic that I could find is installed.

Could this error be related to overheating?
 
I have the same issue. Same board too. Seems to be an issue with the ASRock Rack too. At least more prevalent on them. I thought I was on the right track but turning on SR-IOV option in the BIOS. The error changed a bit for me. I was originally getting what the OP Posted. I have an open ticket with ASRock too. Would love to hear if anyone resolved this issue or if they just returned the boards.
 

Attachments

  • 2024-12-27 09_13_24-Remote KVM [192.168.1.39] - [1920 x 1200 ].png
    2024-12-27 09_13_24-Remote KVM [192.168.1.39] - [1920 x 1200 ].png
    409.1 KB · Views: 9
  • 2024-12-27 09_46_53-Remote KVM [192.168.1.39] - [1024 x 768 ].png
    2024-12-27 09_46_53-Remote KVM [192.168.1.39] - [1024 x 768 ].png
    97.9 KB · Views: 10
Hey Moayad,

Bios: is the latest
The latest firmware version for the Nic that I could find is installed.

Could this error be related to overheating?
Which firmware package did you use? The one I found is giving me a Package PCI Id Mismatch. I have the same board and pcie id is
14e4:16d8