Hi Guys,
I am having intermittent crashing issues on a new server I leased from Hetzner. It is stable for only 24-36 hours then the entire host goes offline requiring a power cycle.
I have searched the forums and have assembled the following information for you, to save time:
Possible solution I am testing, and will report back if it stabilizes my system
https://serverfault.com/questions/6...pter-unexpectedly-detected-hardware-unit-hang
and/or https://jhartman.pl/2018/08/06/proxmox-enp0s31f6-detected-hardware-unit-hang/
If that doesn’t work I’ll try this solution: Disabling Enhanced C1 (C1E) in the BIOS
I want to provide the logs of what I am seeing so you perhaps can update Proxmox to fix this issue with this model of NIC since I've seen other people are having similar issues with it with Proxmox. Please let me know if you know of any other solutions.
Thank you for your time.
root@prox01 ~ # lspci | egrep -i --color 'network|ethernet'
00:1f.6 Ethernet controller: Intel Corporation Ethernet Connection (2) I219-LM
I really need help because I just migrated to this node and it's not financially possible for me to migrate to another server.
Logs:
I put the long form logs here: https://pastebin.com/fR4tJ9Ji
short version:
root@prox01 ~ # qm config 100
agent: 1
bootdisk: scsi0
cores: 4
ide2: local:iso/CentOS-7-x86_64-Minimal-1810.iso,media=cdrom
memory: 16384
name: Proxmox-VM01
net0: virtio=FA:3E:C85:83:2E,bridge=vmbr0,firewall=1
numa: 0
onboot: 1
ostype: l26
scsi0: prox01vmstorage:vm-100-disk-0,size=480G
scsihw: virtio-scsi-pci
smbios1: uuid=bee0785b-9f65-49bf-a6fb-08187ccb33c8
sockets: 1
vmgenid: 1c4565b5-0961-486b-adfe-b3d769206d90
root@prox01 ~ # pveversion -v
proxmox-ve: 5.4-2 (running kernel: 4.15.18-20-pve)
pve-manager: 5.4-13 (running version: 5.4-13/aee6f0ec)
pve-kernel-4.15: 5.4-8
pve-kernel-4.15.18-20-pve: 4.15.18-46
corosync: 2.4.4-pve1
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: not correctly installed
libjs-extjs: 6.0.1-2
libpve-access-control: 5.1-12
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-54
libpve-guest-common-perl: 2.0-20
libpve-http-server-perl: 2.0-14
libpve-storage-perl: 5.0-44
libqb0: 1.0.3-1~bpo9
lvm2: 2.02.168-pve6
lxc-pve: 3.1.0-6
lxcfs: 3.0.3-pve1
novnc-pve: 1.0.0-3
proxmox-widget-toolkit: 1.0-28
pve-cluster: 5.0-38
pve-container: 2.0-40
pve-docs: 5.4-2
pve-edk2-firmware: 1.20190312-1
pve-firewall: 3.0-22
pve-firmware: 2.0-7
pve-ha-manager: 2.0-9
pve-i18n: 1.1-4
pve-libspice-server1: 0.14.1-2
pve-qemu-kvm: 3.0.1-4
pve-xtermjs: 3.12.0-1
qemu-server: 5.0-54
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
Aug 25 10:19:14 prox01 kernel: e1000e 0000:00:1f.6 eno1: Detected Hardware Unit Hang:
TDH <9e>
TDT <a9>
next_to_use <a9>
next_to_clean <9d>
buffer_info[next_to_clean]:
time_stamp <10001883b>
next_to_watch <9e>
jiffies <100018958>
next_to_watch.status <0>
MAC Status <40080083>
PHY Status <796d>
PHY 1000BASE-T Status <3800>
PHY Extended Status <3000>
PCI Status <10>
# dmidecode 3.0
Getting SMBIOS data from sysfs.
SMBIOS 2.8 present.
99 structures occupying 4763 bytes.
Table at 0x8AD2B000.
Handle 0x0000, DMI type 0, 26 bytes
BIOS Information
Vendor: American Megatrends Inc.
Version: 1.EC
Release Date: 05/21/2019 - Up to date
Address: 0xF0000
Runtime Size: 64 kB
ROM Size: 16384 kB
Characteristics:
PCI is supported
BIOS is upgradeable
BIOS shadowing is allowed
Boot from CD is supported
Selectable boot is supported
BIOS ROM is socketed
EDD is supported
5.25"/1.2 MB floppy services are supported (int 13h)
3.5"/720 kB floppy services are supported (int 13h)
3.5"/2.88 MB floppy services are supported (int 13h)
Print screen service is supported (int 5h)
8042 keyboard services are supported (int 9h)
Serial services are supported (int 14h)
Printer services are supported (int 17h)
ACPI is supported
USB legacy is supported
BIOS boot specification is supported
Targeted content distribution is supported
UEFI is supported
BIOS Revision: 5.12
I am having intermittent crashing issues on a new server I leased from Hetzner. It is stable for only 24-36 hours then the entire host goes offline requiring a power cycle.
I have searched the forums and have assembled the following information for you, to save time:
Possible solution I am testing, and will report back if it stabilizes my system
https://serverfault.com/questions/6...pter-unexpectedly-detected-hardware-unit-hang
and/or https://jhartman.pl/2018/08/06/proxmox-enp0s31f6-detected-hardware-unit-hang/
If that doesn’t work I’ll try this solution: Disabling Enhanced C1 (C1E) in the BIOS
I want to provide the logs of what I am seeing so you perhaps can update Proxmox to fix this issue with this model of NIC since I've seen other people are having similar issues with it with Proxmox. Please let me know if you know of any other solutions.
Thank you for your time.
root@prox01 ~ # lspci | egrep -i --color 'network|ethernet'
00:1f.6 Ethernet controller: Intel Corporation Ethernet Connection (2) I219-LM
I really need help because I just migrated to this node and it's not financially possible for me to migrate to another server.
Logs:
I put the long form logs here: https://pastebin.com/fR4tJ9Ji
short version:
root@prox01 ~ # qm config 100
agent: 1
bootdisk: scsi0
cores: 4
ide2: local:iso/CentOS-7-x86_64-Minimal-1810.iso,media=cdrom
memory: 16384
name: Proxmox-VM01
net0: virtio=FA:3E:C85:83:2E,bridge=vmbr0,firewall=1
numa: 0
onboot: 1
ostype: l26
scsi0: prox01vmstorage:vm-100-disk-0,size=480G
scsihw: virtio-scsi-pci
smbios1: uuid=bee0785b-9f65-49bf-a6fb-08187ccb33c8
sockets: 1
vmgenid: 1c4565b5-0961-486b-adfe-b3d769206d90
root@prox01 ~ # pveversion -v
proxmox-ve: 5.4-2 (running kernel: 4.15.18-20-pve)
pve-manager: 5.4-13 (running version: 5.4-13/aee6f0ec)
pve-kernel-4.15: 5.4-8
pve-kernel-4.15.18-20-pve: 4.15.18-46
corosync: 2.4.4-pve1
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: not correctly installed
libjs-extjs: 6.0.1-2
libpve-access-control: 5.1-12
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-54
libpve-guest-common-perl: 2.0-20
libpve-http-server-perl: 2.0-14
libpve-storage-perl: 5.0-44
libqb0: 1.0.3-1~bpo9
lvm2: 2.02.168-pve6
lxc-pve: 3.1.0-6
lxcfs: 3.0.3-pve1
novnc-pve: 1.0.0-3
proxmox-widget-toolkit: 1.0-28
pve-cluster: 5.0-38
pve-container: 2.0-40
pve-docs: 5.4-2
pve-edk2-firmware: 1.20190312-1
pve-firewall: 3.0-22
pve-firmware: 2.0-7
pve-ha-manager: 2.0-9
pve-i18n: 1.1-4
pve-libspice-server1: 0.14.1-2
pve-qemu-kvm: 3.0.1-4
pve-xtermjs: 3.12.0-1
qemu-server: 5.0-54
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
Aug 25 10:19:14 prox01 kernel: e1000e 0000:00:1f.6 eno1: Detected Hardware Unit Hang:
TDH <9e>
TDT <a9>
next_to_use <a9>
next_to_clean <9d>
buffer_info[next_to_clean]:
time_stamp <10001883b>
next_to_watch <9e>
jiffies <100018958>
next_to_watch.status <0>
MAC Status <40080083>
PHY Status <796d>
PHY 1000BASE-T Status <3800>
PHY Extended Status <3000>
PCI Status <10>
# dmidecode 3.0
Getting SMBIOS data from sysfs.
SMBIOS 2.8 present.
99 structures occupying 4763 bytes.
Table at 0x8AD2B000.
Handle 0x0000, DMI type 0, 26 bytes
BIOS Information
Vendor: American Megatrends Inc.
Version: 1.EC
Release Date: 05/21/2019 - Up to date
Address: 0xF0000
Runtime Size: 64 kB
ROM Size: 16384 kB
Characteristics:
PCI is supported
BIOS is upgradeable
BIOS shadowing is allowed
Boot from CD is supported
Selectable boot is supported
BIOS ROM is socketed
EDD is supported
5.25"/1.2 MB floppy services are supported (int 13h)
3.5"/720 kB floppy services are supported (int 13h)
3.5"/2.88 MB floppy services are supported (int 13h)
Print screen service is supported (int 5h)
8042 keyboard services are supported (int 9h)
Serial services are supported (int 14h)
Printer services are supported (int 17h)
ACPI is supported
USB legacy is supported
BIOS boot specification is supported
Targeted content distribution is supported
UEFI is supported
BIOS Revision: 5.12
Last edited: