network card drop, igc 0000:09:00.0 eno1: PCIE link lost

Hello, have you found a solution to this problem? I have the same problem and I can't solve it. I'm using proxmox 8.03 on a bee-link SER 6 MAX mini pc. Anyone can help me please ?
I have the exact same computer, the Beelink SER 6 Max and am having the same issue. While I'm glad it's not a hardware issue with the computer itself, it's obviously an issue with the driver or something like that. I haven't tried any of the above solutions yet. The odd part is that this was working for about 3 weeks with no issues. The problem started yesterday after a reboot. I'm going to try a reinstall for starters.
 
Supermicro H13SAE-MF with 2 x i210 onboard. I have 4 servers with this board, all with the same problem. Disabled all Power Management.
Since this is a verry old problem, maybe no fix in the near future will add some i350 network cards to fix the random loss of network.

root@PVE-01:~# cat /proc/cmdline
initrd=\EFI\proxmox\6.5.11-7-pve\initrd.img-6.5.11-7-pve root=ZFS=rpool/ROOT/pve-1 boot=zfs pcie_aspm=off pcie_port_pm=off
 
I have the exact same computer, the Beelink SER 6 Max and am having the same issue. While I'm glad it's not a hardware issue with the computer itself, it's obviously an issue with the driver or something like that. I haven't tried any of the above solutions yet. The odd part is that this was working for about 3 weeks with no issues. The problem started yesterday after a reboot. I'm going to try a reinstall for starters.
The solution was a BIOS update for the computer provided by the manufacturer, no issues after that. I found out when I tried to reinstall Proxmox and had the network drop during the install. This also happened with Ubuntu.
 
Trying smth now, I had in BIOS ASPM Off and kernel parameters "pcie_aspm=off pcie_port_pm=off"

root@PVE-01:~# lspci -vvv | grep LnkCtl:
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM L1 Enabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM L1 Enabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk-
LnkCtl: ASPM L1 Enabled; Disabled- CommClk+
LnkCtl: ASPM L1 Enabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk-
LnkCtl: ASPM L1 Enabled; Disabled- CommClk+
LnkCtl: ASPM L1 Enabled; Disabled- CommClk+
LnkCtl: ASPM L1 Enabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM L1 Enabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM L1 Enabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM L1 Enabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+

Removed "pcie_aspm=off pcie_port_pm=off" and added only "pcie_aspm.policy=performance"

root@PVE-01:~# lspci -vvv | grep LnkCtl:
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk-
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk-
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+


And yes the Intel i210 was "ASPM L1 Enabled"
 
try to disable network card offloading
ethtool must be installed


iface eno1 inet manual
pre-up ethtool --offload $IFACE rx off tx off sg off tso off gso off gro off lro off rxvlan off txvlan off rxhash off

iface eno2 inet manual
pre-up ethtool --offload $IFACE rx off tx off sg off tso off gso off gro off lro off rxvlan off txvlan off rxhash off
 
Supermicro H13SAE-MF with 2 x i210 onboard. I have 4 servers with this board, all with the same problem. Disabled all Power Management.
Since this is a verry old problem, maybe no fix in the near future will add some i350 network cards to fix the random loss of network.

root@PVE-01:~# cat /proc/cmdline
initrd=\EFI\proxmox\6.5.11-7-pve\initrd.img-6.5.11-7-pve root=ZFS=rpool/ROOT/pve-1 boot=zfs pcie_aspm=off pcie_port_pm=off
Same here, 1 Server with Supermicro H13SAE-MF with 2 x i210. One of the nics keeps crashing. Now trying with
pcie_port_pm=off pcie_aspm.policy=performance
I hope that will get solved...
 
You can check with

lspci -vvv | grep LnkCtl:

The output should be smth like this:

LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk-
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk-
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
 
  • Like
Reactions: henkesn
Warning for ZFS users: if you are booting to ZFS disks, updating /etc/default/grub and update-grub will not help.

If you want to add pcie_port_pm=off pcie_aspm.policy=performance to the kernel even when booting on ZFS, you must add these parameters to /etc/kernel/cmdline and then run proxmox-boot-tool refresh. After the reboot, you should already find these parameters by cat /proc/cmdline.

I also use a SuperMicro H13SAE-MF motherboard with Ryzen 7950X3D and the latest Proxmox VE 8.2, and after a few days the server always disconnected from the network. I also tried different kernel versions from 6.2 to 6.8 and only this helped.
 
Warning for ZFS users: if you are booting to ZFS disks, updating /etc/default/grub and update-grub will not help.

If you want to add pcie_port_pm=off pcie_aspm.policy=performance to the kernel even when booting on ZFS, you must add these parameters to /etc/kernel/cmdline and then run proxmox-boot-tool refresh. After the reboot, you should already find these parameters by cat /proc/cmdline.

I also use a SuperMicro H13SAE-MF motherboard with Ryzen 7950X3D and the latest Proxmox VE 8.2, and after a few days the server always disconnected from the network. I also tried different kernel versions from 6.2 to 6.8 and only this helped.
I think this is not related to ZFS, it depends if Proxmox is installed with GRUB or Systemd-boot.

For more details:
Link
 
Last edited:
  • Like
Reactions: jan.reges

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!