I have a setup as follows:
2 physical servers with 2 nodes per server.
1 of the physical servers looks like its failed.
I've got the guests backup and running on the other physical server with 2 nodes and everything appears to be fine.
pvecm status
Cluster information
-------------------
Name: vdc-cluster
Config Version: 8
Transport: knet
Secure auth: on
Quorum information
------------------
Date: Mon Aug 8 14:26:52 2022
Quorum provider: corosync_votequorum
Nodes: 2
Node ID: 0x00000004
Ring ID: 4.894
Quorate: Yes
Vote quorum information
----------------------
Expected votes: 2
Highest expected: 2
Total votes: 2
Quorum: 2
Flags: Quorate
Membership information
----------------------
Nodeid Votes Name
0x00000004 1 172.16.1.5 (local)
0x00000005 1 172.16.1.6
My question is now, what are best practices to remove the 2 dead server from the cluster and what is the best method to bring them back online, provided I can get the existing server working or purchase another replacement server.
HA should now be dead with only a 2 server cluster for the time being, unless I bring up a virtual server on another system to act a dummy server to help manage that. but I'm not sure thats in my best interests either.
Thank you for your help.
2 physical servers with 2 nodes per server.
1 of the physical servers looks like its failed.
I've got the guests backup and running on the other physical server with 2 nodes and everything appears to be fine.
pvecm status
Cluster information
-------------------
Name: vdc-cluster
Config Version: 8
Transport: knet
Secure auth: on
Quorum information
------------------
Date: Mon Aug 8 14:26:52 2022
Quorum provider: corosync_votequorum
Nodes: 2
Node ID: 0x00000004
Ring ID: 4.894
Quorate: Yes
Vote quorum information
----------------------
Expected votes: 2
Highest expected: 2
Total votes: 2
Quorum: 2
Flags: Quorate
Membership information
----------------------
Nodeid Votes Name
0x00000004 1 172.16.1.5 (local)
0x00000005 1 172.16.1.6
My question is now, what are best practices to remove the 2 dead server from the cluster and what is the best method to bring them back online, provided I can get the existing server working or purchase another replacement server.
HA should now be dead with only a 2 server cluster for the time being, unless I bring up a virtual server on another system to act a dummy server to help manage that. but I'm not sure thats in my best interests either.
Thank you for your help.