e1000e eno1: Detected Hardware Unit Hang:

docstorm · Jan 1, 2026

I just started looking into this issue and found here...testing the tso workaround.

I have three Dell Precision 3431s that use the ~~Q246~~ C246 chipset. The first one I updated from PVE 8 to 9 started exhibiting this issue...more often when under network load like doing file transfers, but sometimes when it wasn't significant. The other two have not shown the problem...yet.

All are BIOS 1.32.0 (the latest when first deployed). 1.36.0 is out now, but details on fixes are pretty terse.

celemine1gig · Jan 1, 2026

There is no "Q246" chipset, but only a "C246". And as mentioned, the original bug only should still affect older chipsets/systems (list see in my previous comment). So, if the issue persists and was not fixed with a BIOS update, then contact Dell. They should also be aware of this i219 Erratum.

docstorm · Jan 1, 2026

celemine1gig said:
There is no "Q246" chipset, but only a "C246". And as mentioned, the original bug only should still affect older chipsets/systems (list see in my previous comment). So, if the issue persists and was not fixed with a BIOS update, then contact Dell. They should also be aware of this i219 Erratum.

Oops, yes, that was a typo.

r4nd0m · Jan 4, 2026

I have a few NUCs showing this behaviour, on 2 of them I have blacklisted the e1000e driver and use a different NIC which seems to work fine.

What I have noticed though, this only seems to happen on servers hosting VMs, I got one that only hosts LXCs which seems to be running forever and did migrate a VM across as I needed to perform some upgrades, as soon as the VM migrated across and started, the server crashed a few seconds/minutes later with the notorious log entry in the logs when I looked for a reason. When the server came back the upgrades had already completed and I migrated the VM away from the server back to the original hosting LCXs and VMs with the module being blacklisted, no issues whatsoever since.

warheat1990 · Jan 9, 2026

Anyone know if the card below is affected by this bug? I have one laying around but not sure about the model, it says Intel PRO

WhatsApp Image 2026-01-09 at 1.44.52 PM.jpeg

WhatsApp Image 2026-01-09 at 1.44.53 PM.jpeg

chrisn-au · Jan 9, 2026

I have just asked Claude

The reply was
Yes, almost certainly. Based on the Intel PRO/1000 PT Desktop Adapter identification, the chip under that heatsink is the Intel 82572GI controller, which uses the e1000e driver on Linux.
To confirm, if you have this card installed in a Linux system, you can run:

```
lspci | grep -i ethernet
```

Honestly unless you have high throughput requirements on the card I have seen no obvious impacts of the workaround and would do that

celemine1gig · Jan 9, 2026

See Erratum Nr. 7: https://www.intel.com/content/dam/d...2571eb-82572ei-gbe-controller-spec-update.pdf
Just because it uses the "e1000e" driver does not necessarily mean, that it is affected by this bug.
You have to specifically check the Intel documentation on related topics. The problem might manifest itself slightly differently on those older chipsets. From what I saw, most poeple here saw it with i218/i219.

TheGadMan · Feb 25, 2026

glcarlos said:
One of the reasons why my setup is still pinned to kernel 6.8.12-8. I've had no issues since rolling back. Honestly have no desire to upgrade until this gets fixed.

If I pin my kernel to 6.8.12-8 can I upgrade all the other packages that need upgrading? I've been holding off

TheGadMan · Mar 2, 2026

Here’s what I did, which has been working for me so far — no network hangs yet.

First, I completely removed the community script (https://community-scripts.github.io/ProxmoxVE/scripts?id=nic-offloading-fix). This simply involved disabling the service and removing the service unit file. The script had actually been working perfectly fine for me, and I never experienced a hang while using it. I mainly removed it because I wanted to apply it manually then update and upgrade my machine. I figured that, worst case, I could downgrade and pin the kernel if needed.

I then applied the community script’s fix manually in /etc/network/interfaces using a post-up command. However, I only disabled TSO and applied it only to the physical Intel NIC — not the vmbr interface.

Code:

iface enp0s25 inet manual
        post-up /sbin/ethtool -K $IFACE tso off

I made this change while running kernel 6.8.12-11-pve. After that, I performed a normal update and upgrade through the Proxmox web interface, which upgraded the system to 6.8.12-19-pve.

So far, everything has been running fine. I’ll report back if I experience any network hangs.

Code:

admin@prox:~$ lspci | grep Eth
00:19.0 Ethernet controller: Intel Corporation Ethernet Connection I217-LM (rev 05)

rds76 · Mar 3, 2026

I could confirm, turning TSO off on phy nic (post-up script) fixed this issues for me as well (I219V), no hangs so far (2w), only slightly higher cpu utilization 1-2% during nic traffic

Tess · May 4, 2026

In Ubiquiti networks, there's another problem. GSO/TSO is not the problem, but EEE (Energy Efficient Ethernet) is.

I've added something to /etc/network/interfaces which I see less mention of:

Code:

# For network card vmbr0, the bridge for all real traffic
post-up ethtool -K vmbr0 tso off gso off

# For "nic0",  the physical network card:
post-up ethtool --set-eee nic0 eee off
post-up ethtool -K nic0 tso off gso off

Since I added that, my cluster has been free of crashes for three full weeks. Before that, it'd crash at least once a week.

zenoprax · May 5, 2026

After a year of running with "TSO OFF" (and all the others typically suggested) I'm still having issues. I don't get an obvious "hardware hang" message like others are reporting and the NIC will still report that it is up... but it is in fact not and this makes my bond rather useless as a result. After 90+ minutes of troubleshooting with LLMs the only thing I can narrow it down to is a "ring buffer" issue. The errors are higher on nodes with a higher workload.

`ethtool -S eno1 | grep -E "no_buffer|missed"` will show errors like so:
```
rx_no_buffer_count: 189
rx_missed_errors: 11696
```
Allegedly the solution is to increase the ring buffer size from 256 to 4096:
`ethtool -G eno1 rx 4096 tx 4096`

The final config appears as follows:
```
ethtool -g eno1
Ring parameters for eno1:
Pre-set maximums:
RX: 4096
RX Mini: n/a
RX Jumbo: n/a
TX: 4096
Current hardware settings:
RX: 4096
RX Mini: n/a
RX Jumbo: n/a
TX: 4096
RX Buf Len: n/a
CQE Size: n/a
TX Push: off
TCP data split: n/a
```

If I get failures after this then the only solution is to stop using the NIC entirely (or use it as the secondary with the USB NIC as primary).

wagnbeu0 · May 11, 2026

I tried this morning to do an update to the new kernel 7.0, but again after some minutes the system became offline. SO when booting again with Linux 6.8.12-13-pve, the system remains online.

Do we still have no idea about the root cause why some kernel version lead the system to become unsstable while other are stable ?

My config is:
Lenovo ThinkCentre
AMD Ryzen 5 2400GE with Radeon Vega Graphics
64 GB RAM

LSPCI:
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Root Complex
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 IOMMU
00:01.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge
00:01.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 PCIe GPP Bridge [6:0]
00:08.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge
00:08.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus A
00:08.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus B
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 61)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 5
00:18.6 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 6
00:18.7 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 7
01:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet Controller (rev 0e)
01:00.1 Serial controller: Realtek Semiconductor Co., Ltd. RTL8111xP UART #1 (rev 0e)
01:00.2 Serial controller: Realtek Semiconductor Co., Ltd. RTL8111xP UART #2 (rev 0e)
01:00.3 IPMI Interface: Realtek Semiconductor Co., Ltd. RTL8111xP IPMI interface (rev 0e)
01:00.4 USB controller: Realtek Semiconductor Co., Ltd. RTL811x EHCI host controller (rev 0e)
02:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Raven Ridge [Radeon Vega Series / Radeon Vega Mobile Series] (rev c9)
02:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Raven/Raven2/Fenghuang HDMI/DP Audio Controller
02:00.2 Encryption controller: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) Platform Security Processor
02:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Raven USB 3.1
02:00.4 USB controller: Advanced Micro Devices, Inc. [AMD] Raven USB 3.1
02:00.5 Multimedia controller: Advanced Micro Devices, Inc. [AMD] Audio Coprocessor
02:00.6 Audio device: Advanced Micro Devices, Inc. [AMD] Family 17h/19h/1ah HD Audio Controller
03:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 61)

celemine1gig · May 11, 2026

wagnbeu0 said:
I tried this morning to do an update to the new kernel 7.0, but again after some minutes the system became offline. SO when booting again with Linux 6.8.12-13-pve, the system remains online.

Do we still have no idea about the root cause why some kernel version lead the system to become unsstable while other are stable ?

My config is:
Lenovo ThinkCentre
AMD Ryzen 5 2400GE with Radeon Vega Graphics
64 GB RAM

LSPCI:
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Root Complex
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 IOMMU
00:01.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge
00:01.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 PCIe GPP Bridge [6:0]
00:08.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge
00:08.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus A
00:08.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus B
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 61)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 5
00:18.6 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 6
00:18.7 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 7
01:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet Controller (rev 0e)
01:00.1 Serial controller: Realtek Semiconductor Co., Ltd. RTL8111xP UART #1 (rev 0e)
01:00.2 Serial controller: Realtek Semiconductor Co., Ltd. RTL8111xP UART #2 (rev 0e)
01:00.3 IPMI Interface: Realtek Semiconductor Co., Ltd. RTL8111xP IPMI interface (rev 0e)
01:00.4 USB controller: Realtek Semiconductor Co., Ltd. RTL811x EHCI host controller (rev 0e)
02:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Raven Ridge [Radeon Vega Series / Radeon Vega Mobile Series] (rev c9)
02:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Raven/Raven2/Fenghuang HDMI/DP Audio Controller
02:00.2 Encryption controller: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) Platform Security Processor
02:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Raven USB 3.1
02:00.4 USB controller: Advanced Micro Devices, Inc. [AMD] Raven USB 3.1
02:00.5 Multimedia controller: Advanced Micro Devices, Inc. [AMD] Audio Coprocessor
02:00.6 Audio device: Advanced Micro Devices, Inc. [AMD] Family 17h/19h/1ah HD Audio Controller
03:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 61)

Either you posted in the wrong thread, or you did not understand the topic at all.
This thread is about a very specific issue, that affects certain Intel Ethernet controllers.
However, in your pci listing I do not see one. So what exactly is it, that you are expecting to happen? Could it be, that you are looking for a completely different solution to your Realtek GBe related issue?

wagnbeu0 · May 13, 2026

@celemine1gig Thank you for the update. I know that this problem occurs primarily with INtel NIC, but as ou can see it happens again and again - so this is why I am also looking for a permanent solution even if I have no Intel NIC. Maybe the problem is not related to the Intel Driver, but to the network stack itself?

celemine1gig · May 14, 2026

wagnbeu0 said:
@celemine1gig ... I know that this problem occurs primarily with INtel NIC ...

This specific issue only happens on Intel hardware, as this discussion is about a problem caused by a bug (Erratum) in the Intel hardware.. You obviously still don't understand that.

wagnbeu0 said:
..., but as ou can see it happens again and again - so this is why I am also looking for a permanent solution even if I have no Intel NIC. ...

OK. So, point being, that you are facing an issue and you are desperate.
Welcome to the club. Somewhat sad, but not the issue discussed here.
Instead you could try debugging your own issue. As it is open source, all the tools and all the information are available. If you do not want to do that: Also fine.

viruslab · Sunday at 22:06

Have this issue today. I had to unplug and plug the cable to fix and get access to the server.

Code:

Jun 29 00:39:57 server-pve-1 kernel: e1000e 0000:00:19.0 eno1: Detected Hardware Unit Hang:
TDH                  <30>
TDT                  <9d>
next_to_use          <9d>
next_to_clean        <2f>
buffer_info[next_to_clean]:
time_stamp           <11f9cb82d>
next_to_watch        <30>
jiffies              <11fa350c0>
next_to_watch.status <0>
MAC Status             <80083>
PHY Status             <796d>
PHY 1000BASE-T Status  <3c00>
PHY Extended Status    <3000>
PCI Status             <10>
Jun 29 00:39:59 server-pve-1 kernel: e1000e 0000:00:19.0 eno1: Detected Hardware Unit Hang:
TDH                  <30>
TDT                  <9d>
next_to_use          <9d>
next_to_clean        <2f>
buffer_info[next_to_clean]:
time_stamp           <11f9cb82d>
next_to_watch        <30>
jiffies              <11fa35880>
next_to_watch.status <0>
MAC Status             <80083>
PHY Status             <796d>
PHY 1000BASE-T Status  <3c00>
PHY Extended Status    <3000>
PCI Status             <10>

e1000e eno1: Detected Hardware Unit Hang:

docstorm

New Member

celemine1gig

Renowned Member

docstorm

New Member

r4nd0m

New Member

warheat1990

New Member

chrisn-au

New Member

celemine1gig

Renowned Member

TheGadMan

New Member

TheGadMan

New Member

rds76

New Member

Tess

New Member

zenoprax

New Member

wagnbeu0

New Member

celemine1gig

Renowned Member

wagnbeu0

New Member

celemine1gig

Renowned Member

viruslab

Member

We value your privacy