Commenting for visibility and to track solutions. I have a NUC7i3BNH running Proxmox and receive the "Detected Hardware Unit Hang" error when uploading large files inside a KVM to the cloud. It completely hangs the entire Proxmox machine and needs to be hard-rebooted.
Here is my NIC:
I set the following in "/etc/network/interfaces".
Here is the syslog snippet of that post-up config running.
Hopefully this will fix my issue. Will keep an eye out for a more permanent solution.
Code:
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] e1000e 0000:00:1f.6 eno1: Detected Hardware Unit Hang:
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] TDH <cf>
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] TDT <17>
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] next_to_use <17>
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] next_to_clean <cf>
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] buffer_info[next_to_clean]:
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] time_stamp <121fbbae8>
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] next_to_watch <d0>
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] jiffies <121fbbd98>
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] next_to_watch.status <0>
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] MAC Status <40080083>
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] PHY Status <796d>
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] PHY 1000BASE-T Status <3c00>
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] PHY Extended Status <3000>
Mar 8 23:01:23 proxmox01 kernel: [2280865.612614] PCI Status <10>
Here is my NIC:
Code:
root@proxmox01:~# lspci -nnk | grep -A2 Ethernet
00:1f.6 Ethernet controller [0200]: Intel Corporation Ethernet Connection (4) I219-V [8086:15d8] (rev 21)
Subsystem: Intel Corporation Ethernet Connection (4) I219-V [8086:2068]
Kernel driver in use: e1000e
Kernel modules: e1000e
I set the following in "/etc/network/interfaces".
Code:
iface eno1 inet manual
post-up /usr/bin/logger -p debug -t ifup "Disabling offload for eno1" && /sbin/ethtool -K $IFACE tso off gso off gro off && /usr/bin/logger -p debug -t ifup "Disabled offload for eno1"
Here is the syslog snippet of that post-up config running.
Code:
Mar 9 10:55:42 proxmox01 ifup: Disabling offload for eno1
Mar 9 10:55:42 proxmox01 ifup: Disabled offload for eno1
Hopefully this will fix my issue. Will keep an eye out for a more permanent solution.