Proxmox server crash

kermitxyz

New Member
Jun 26, 2025
10
0
1
Hello

My Proxmox server crashed with the following errors in the log. Please could someone explain how to interpret this? Thank you

Code:
Jun 30 14:02:24 bagpuss kernel: e1000e 0000:00:1f.6 nic0: Detected Hardware Unit Hang:
                                  TDH                  <fc>
                                  TDT                  <1b>
                                  next_to_use          <1b>
                                  next_to_clean        <fb>
                                buffer_info[next_to_clean]:
                                  time_stamp           <10479cedf>
                                  next_to_watch        <fc>
                                  jiffies              <10479d300>
                                  next_to_watch.status <0>
                                MAC Status             <80083>
                                PHY Status             <796d>
                                PHY 1000BASE-T Status  <3800>
                                PHY Extended Status    <3000>
                                PCI Status             <10>
Jun 30 14:02:25 bagpuss corosync[1097]:   [KNET  ] link: host: 2 link: 0 is down
Jun 30 14:02:25 bagpuss corosync[1097]:   [KNET  ] host: host: 2 (passive) best link: 0 (pri: 1)
Jun 30 14:02:25 bagpuss corosync[1097]:   [KNET  ] host: host: 2 has no active links
Jun 30 14:02:25 bagpuss corosync[1097]:   [TOTEM ] Token has not been received in 2250 ms
Jun 30 14:02:26 bagpuss corosync[1097]:   [TOTEM ] A processor failed, forming new configuration: token timed out (3000ms), waiting 3600>
Jun 30 14:02:26 bagpuss kernel: e1000e 0000:00:1f.6 nic0: Detected Hardware Unit Hang:
                                  TDH                  <fc>
                                  TDT                  <1b>
                                  next_to_use          <1b>
                                  next_to_clean        <fb>
                                buffer_info[next_to_clean]:
                                  time_stamp           <10479cedf>
                                  next_to_watch        <fc>
                                  jiffies              <10479dac0>
                                  next_to_watch.status <0>
                                MAC Status             <80083>
                                PHY Status             <796d>
                                PHY 1000BASE-T Status  <3800>
                                PHY Extended Status    <3000>
                                PCI Status             <10>
Jun 30 14:02:28 bagpuss kernel: e1000e 0000:00:1f.6 nic0: Detected Hardware Unit Hang:
                                  TDH                  <fc>
                                  TDT                  <1b>
                                  next_to_use          <1b>
                                  next_to_clean        <fb>
                                buffer_info[next_to_clean]:
                                  time_stamp           <10479cedf>
                                  next_to_watch        <fc>
                                  jiffies              <10479e280>
                                  next_to_watch.status <0>
                                MAC Status             <80083>
                                PHY Status             <796d>
                                PHY 1000BASE-T Status  <3800>
                                PHY Extended Status    <3000>
                                PCI Status             <10>
Jun 30 14:02:30 bagpuss corosync[1097]:   [QUORUM] Sync members[1]: 1
Jun 30 14:02:30 bagpuss corosync[1097]:   [QUORUM] Sync left[1]: 2
Jun 30 14:02:30 bagpuss corosync[1097]:   [TOTEM ] A new membership (1.78) was formed. Members left: 2
Jun 30 14:02:30 bagpuss corosync[1097]:   [TOTEM ] Failed to receive the leave message. failed: 2
Jun 30 14:02:30 bagpuss pmxcfs[948]: [dcdb] notice: members: 1/948
Jun 30 14:02:30 bagpuss pmxcfs[948]: [status] notice: members: 1/948
Jun 30 14:02:30 bagpuss corosync[1097]:   [QUORUM] This node is within the non-primary component and will NOT provide any services.
Jun 30 14:02:30 bagpuss corosync[1097]:   [QUORUM] Members[1]: 1
Jun 30 14:02:30 bagpuss corosync[1097]:   [MAIN  ] Completed service synchronization, ready to provide service.
Jun 30 14:02:30 bagpuss pmxcfs[948]: [status] notice: node lost quorum
Jun 30 14:02:30 bagpuss pmxcfs[948]: [dcdb] crit: received write while not quorate - trigger resync
Jun 30 14:02:30 bagpuss pmxcfs[948]: [dcdb] crit: leaving CPG group
Jun 30 14:02:30 bagpuss pve-ha-lrm[1186]: unable to write lrm status file - unable to open file '/etc/pve/nodes/bagpuss/lrm_status.tmp.1>
Jun 30 14:02:30 bagpuss pmxcfs[948]: [dcdb] notice: start cluster connection
Jun 30 14:02:30 bagpuss pmxcfs[948]: [dcdb] crit: cpg_join failed: CS_ERR_EXIST
Jun 30 14:02:30 bagpuss pmxcfs[948]: [dcdb] crit: can't initialize service
Jun 30 14:02:30 bagpuss kernel: e1000e 0000:00:1f.6 nic0: Detected Hardware Unit Hang:
 
Your network card crashed. This unfortunately is a common issue with e1000e.

See for example:
 
Last edited:
  • Like
Reactions: news and leesteken