network hung !!

Dhia

New Member
Dec 30, 2024
4
0
1
hello everyone,
I have proxmox 8.4.1 installed on dell PowerEdge t310 but sometime I got disconnected and I can not even access the web page of the proxmox
because of that I have to restart the proxmox server to solve the problem .
note: the NIC is NetXtreme

Can anyone help me with this problem?
 
It seems that you are experiencing network disconnection issues with your Proxmox 8.4.1 server running on a Dell PowerEdge T310, using a NetXtreme NIC. This is a common issue, especially with older Broadcom NICs (like the NetXtreme), and can often be traced to driver or power management problems.


Here are troubleshooting steps and solutions you can try:




1.​


Run the following command after a disconnection or after reboot:

journalctl -xe



Or check kernel logs:

dmesg | grep -i eth



Look for lines that mention:


  • eth0: Link is Down
  • bnx2, tg3, or bnx2x errors (Broadcom driver names)
  • IRQ or DMA issues



2.​


Run:


lspci -nnk | grep -A3 -i net



This shows:


  • The exact model of the NIC
  • The driver in use (tg3, bnx2, etc.)



3.​


Sometimes the NIC goes to sleep and fails to wake.


Add this to your boot kernel parameters:

pcie_aspm=off



Edit /etc/default/grub:

GRUB_CMDLINE_LINUX_DEFAULT="quiet pcie_aspm=off"


Then update GRUB and reboot:

update-grub
reboot


4.​


Sometimes the NIC driver needs proprietary firmware to function correctly.


Run:

apt install firmware-bnx2 firmware-linux-nonfree



Then reboot.




5.​


The NIC might be losing link due to negotiation failure.


Try forcing speed/duplex:

ethtool -s eth0 speed 1000 duplex full autoneg off



(Replace eth0 with your actual interface name, e.g., enp3s0)


To make permanent, create a systemd .link or use ifupdown configs.




6.​


Broadcom NetXtreme cards are known for these issues. Consider:


  • Adding a cheap Intel PRO/1000 PCIe NIC
  • Disabling onboard Broadcom in BIOS



7.​


Boot Ubuntu live or Debian live ISO.


  • If the same NIC loses connection, it’s likely hardware or firmware.
  • If stable, it may be a Proxmox driver or config issue.



8.​


Enable persistent journaling to catch logs even after reboot:


mkdir -p /var/log/journal
systemd-tmpfiles --create --prefix /var/log/journal
systemctl restart systemd-journald




Would you like to share:


  • The output of lspci -nnk | grep -A3 -i net?
  • The output of dmesg | grep -i bnx or tg3?

That will help identify if it's driver-related.
 
It seems that you are experiencing network disconnection issues with your Proxmox 8.4.1 server running on a Dell PowerEdge T310, using a NetXtreme NIC. This is a common issue, especially with older Broadcom NICs (like the NetXtreme), and can often be traced to driver or power management problems.


Here are troubleshooting steps and solutions you can try:




1.​


Run the following command after a disconnection or after reboot:

journalctl -xe



Or check kernel logs:

dmesg | grep -i eth



Look for lines that mention:


  • eth0: Link is Down
  • bnx2, tg3, or bnx2x errors (Broadcom driver names)
  • IRQ or DMA issues



2.​


Run:


lspci -nnk | grep -A3 -i net



This shows:


  • The exact model of the NIC
  • The driver in use (tg3, bnx2, etc.)



3.​


Sometimes the NIC goes to sleep and fails to wake.


Add this to your boot kernel parameters:

pcie_aspm=off



Edit /etc/default/grub:

GRUB_CMDLINE_LINUX_DEFAULT="quiet pcie_aspm=off"


Then update GRUB and reboot:

update-grub
reboot


4.​


Sometimes the NIC driver needs proprietary firmware to function correctly.


Run:

apt install firmware-bnx2 firmware-linux-nonfree



Then reboot.




5.​


The NIC might be losing link due to negotiation failure.


Try forcing speed/duplex:

ethtool -s eth0 speed 1000 duplex full autoneg off



(Replace eth0 with your actual interface name, e.g., enp3s0)


To make permanent, create a systemd .link or use ifupdown configs.




6.​


Broadcom NetXtreme cards are known for these issues. Consider:


  • Adding a cheap Intel PRO/1000 PCIe NIC
  • Disabling onboard Broadcom in BIOS



7.​


Boot Ubuntu live or Debian live ISO.


  • If the same NIC loses connection, it’s likely hardware or firmware.
  • If stable, it may be a Proxmox driver or config issue.



8.​


Enable persistent journaling to catch logs even after reboot:


mkdir -p /var/log/journal
systemd-tmpfiles --create --prefix /var/log/journal
systemctl restart systemd-journald




Would you like to share:


  • The output of lspci -nnk | grep -A3 -i net?
  • The output of dmesg | grep -i bnx or tg3?

That will help identify if it's driver-related.
thank you so much for your fast and detailed support

here's the output of the above commands:
02:00.0 Ethernet controller [0200]: Broadcom Inc. and subsidiaries NetXtreme II BCM5716 Gigabit Ethernet [14e4:163b] (rev 20)
DeviceName: Embedded NIC 1
Subsystem: Dell NetXtreme II BCM5716 Gigabit Ethernet [1028:02a4]
Kernel driver in use: bnx2
Kernel modules: bnx2
02:00.1 Ethernet controller [0200]: Broadcom Inc. and subsidiaries NetXtreme II BCM5716 Gigabit Ethernet [14e4:163b] (rev 20)
DeviceName: Embedded NIC 2
Subsystem: Dell NetXtreme II BCM5716 Gigabit Ethernet [1028:02a4]
Kernel driver in use: bnx2
Kernel modules: bnx2
03:00.0 SCSI storage controller [0100]: Broadcom / LSI SAS1068E PCI-Express Fusion-MPT SAS [1000:0058] (rev 08)
dmesg | grep -i bnx

[ 2.707894] bnx2 0000:02:00.0 eth0: Broadcom NetXtreme II BCM5716 1000Base-T (C0) PCI Express found at mem da000000, IRQ 16, node addr 78:2b:cb:09:98:3f
[ 2.746387] bnx2 0000:02:00.1 eth1: Broadcom NetXtreme II BCM5716 1000Base-T (C0) PCI Express found at mem dc000000, IRQ 17, node addr 78:2b:cb:09:98:40
[ 2.778572] bnx2 0000:02:00.0 eno1: renamed from eth0
[ 2.804796] bnx2 0000:02:00.1 eno2: renamed from eth1
[ 31.436541] bnx2 0000:02:00.0 eno1: entered allmulticast mode
[ 31.594273] bnx2 0000:02:00.0 eno1: using MSIX
[ 35.286972] bnx2 0000:02:00.0 eno1: NIC Copper Link is Up, 1000 Mbps full duplex
[ 44.785347] bnx2 0000:02:00.0 eno1: entered promiscuous mode
dmesg | grep -i eth
[ 2.707894] bnx2 0000:02:00.0 eth0: Broadcom NetXtreme II BCM5716 1000Base-T (C0) PCI Express found at mem da000000, IRQ 16, node addr 78:2b:cb:09:98:3f
[ 2.746387] bnx2 0000:02:00.1 eth1: Broadcom NetXtreme II BCM5716 1000Base-T (C0) PCI Express found at mem dc000000, IRQ 17, node addr 78:2b:cb:09:98:40
[ 2.778572] bnx2 0000:02:00.0 eno1: renamed from eth0
[ 2.804796] bnx2 0000:02:00.1 eno2: renamed from eth1
[ 29.438037] No Local Variables are initialized for Method [_GHL]
[ 29.438065] No Arguments are initialized for method [_GHL]
[ 29.438093] ACPI Error: Aborting method \_SB.PMI0._GHL due to previous error (AE_NOT_EXIST) (20230628/psparse-529)
[ 29.438139] ACPI Error: Aborting method \_SB.PMI0._PMC due to previous error (AE_NOT_EXIST) (20230628/psparse-529)
 
Last edited: