Node failure

pagy

New Member
Apr 17, 2025
2
1
3
Hi all,

I've had a node die on me and so I was going to remove it from the cluster by following: https://pve.proxmox.com/wiki/Cluster_Manager#_remove_a_cluster_node but when I do pvecm status the dead node (pve2) isn't listed:

root@pve1:~# pvecm nodes

Membership information
----------------------
Nodeid Votes Name
3 1 pve3
4 1 pve1 (local)
root@pve1:~#

But pvecm status shows 3 quorum votes:

root@pve1:~# pvecm status
Cluster information
-------------------
Name: pve-cluster-01
Config Version: 7
Transport: knet
Secure auth: on

Quorum information
------------------
Date: Thu Apr 17 10:58:19 2025
Quorum provider: corosync_votequorum
Nodes: 2
Node ID: 0x00000004
Ring ID: 3.184d
Quorate: Yes

Votequorum information
----------------------
Expected votes: 3
Highest expected: 3
Total votes: 2
Quorum: 2
Flags: Quorate

Membership information
----------------------
Nodeid Votes Name
0x00000003 1 192.168.0.201
0x00000004 1 192.168.0.202


pve2 isn't coming back so i just want it gone now. Do I just remove it from /etc/pve/nodes ?

I'm not worried about the VMs that were on it. They have been restored from backup to another node now.

Thanks
 
I sorted this. In case anyone else ever finds this in the future I found the answer on https://forum.proxmox.com/threads/not-able-to-remove-cluster-node.147628/
Basically removed the dead node from /etc/pve/corosync .conf
systemctl restart pve-cluster
systemctl restart corosync

That removed the node from the cluster and pvecm status then reports the correct number of votes (2)
Then to remove the dead node from the web gui remove from /etc/pve/nodes/nodename - just remove the whole directory
 
  • Like
Reactions: leesteken