Hello
My Proxmox server crashed with the following errors in the log. Please could someone explain how to interpret this? Thank you
My Proxmox server crashed with the following errors in the log. Please could someone explain how to interpret this? Thank you
Code:
Jun 30 14:02:24 bagpuss kernel: e1000e 0000:00:1f.6 nic0: Detected Hardware Unit Hang:
TDH <fc>
TDT <1b>
next_to_use <1b>
next_to_clean <fb>
buffer_info[next_to_clean]:
time_stamp <10479cedf>
next_to_watch <fc>
jiffies <10479d300>
next_to_watch.status <0>
MAC Status <80083>
PHY Status <796d>
PHY 1000BASE-T Status <3800>
PHY Extended Status <3000>
PCI Status <10>
Jun 30 14:02:25 bagpuss corosync[1097]: [KNET ] link: host: 2 link: 0 is down
Jun 30 14:02:25 bagpuss corosync[1097]: [KNET ] host: host: 2 (passive) best link: 0 (pri: 1)
Jun 30 14:02:25 bagpuss corosync[1097]: [KNET ] host: host: 2 has no active links
Jun 30 14:02:25 bagpuss corosync[1097]: [TOTEM ] Token has not been received in 2250 ms
Jun 30 14:02:26 bagpuss corosync[1097]: [TOTEM ] A processor failed, forming new configuration: token timed out (3000ms), waiting 3600>
Jun 30 14:02:26 bagpuss kernel: e1000e 0000:00:1f.6 nic0: Detected Hardware Unit Hang:
TDH <fc>
TDT <1b>
next_to_use <1b>
next_to_clean <fb>
buffer_info[next_to_clean]:
time_stamp <10479cedf>
next_to_watch <fc>
jiffies <10479dac0>
next_to_watch.status <0>
MAC Status <80083>
PHY Status <796d>
PHY 1000BASE-T Status <3800>
PHY Extended Status <3000>
PCI Status <10>
Jun 30 14:02:28 bagpuss kernel: e1000e 0000:00:1f.6 nic0: Detected Hardware Unit Hang:
TDH <fc>
TDT <1b>
next_to_use <1b>
next_to_clean <fb>
buffer_info[next_to_clean]:
time_stamp <10479cedf>
next_to_watch <fc>
jiffies <10479e280>
next_to_watch.status <0>
MAC Status <80083>
PHY Status <796d>
PHY 1000BASE-T Status <3800>
PHY Extended Status <3000>
PCI Status <10>
Jun 30 14:02:30 bagpuss corosync[1097]: [QUORUM] Sync members[1]: 1
Jun 30 14:02:30 bagpuss corosync[1097]: [QUORUM] Sync left[1]: 2
Jun 30 14:02:30 bagpuss corosync[1097]: [TOTEM ] A new membership (1.78) was formed. Members left: 2
Jun 30 14:02:30 bagpuss corosync[1097]: [TOTEM ] Failed to receive the leave message. failed: 2
Jun 30 14:02:30 bagpuss pmxcfs[948]: [dcdb] notice: members: 1/948
Jun 30 14:02:30 bagpuss pmxcfs[948]: [status] notice: members: 1/948
Jun 30 14:02:30 bagpuss corosync[1097]: [QUORUM] This node is within the non-primary component and will NOT provide any services.
Jun 30 14:02:30 bagpuss corosync[1097]: [QUORUM] Members[1]: 1
Jun 30 14:02:30 bagpuss corosync[1097]: [MAIN ] Completed service synchronization, ready to provide service.
Jun 30 14:02:30 bagpuss pmxcfs[948]: [status] notice: node lost quorum
Jun 30 14:02:30 bagpuss pmxcfs[948]: [dcdb] crit: received write while not quorate - trigger resync
Jun 30 14:02:30 bagpuss pmxcfs[948]: [dcdb] crit: leaving CPG group
Jun 30 14:02:30 bagpuss pve-ha-lrm[1186]: unable to write lrm status file - unable to open file '/etc/pve/nodes/bagpuss/lrm_status.tmp.1>
Jun 30 14:02:30 bagpuss pmxcfs[948]: [dcdb] notice: start cluster connection
Jun 30 14:02:30 bagpuss pmxcfs[948]: [dcdb] crit: cpg_join failed: CS_ERR_EXIST
Jun 30 14:02:30 bagpuss pmxcfs[948]: [dcdb] crit: can't initialize service
Jun 30 14:02:30 bagpuss kernel: e1000e 0000:00:1f.6 nic0: Detected Hardware Unit Hang: