All,
today I did some HA testing on a 4 node cluster, version 5.4.
I configured a HA group including all nodes and added to VMs tho the group.
The two VMS were running on node 1 and node 2.
I also set shutdown_policy=failover to initiate the failover by simply rebooting a node.
Shortly after rebooting node 1, I reveiced two E-mails with the following subjects:
FENCE: Try to fence node 'pve1'
SUCCEED: fencing: acknowledged - got agent lock for node 'pve1'
The VM running on node 1 failed over to node 0.
Everthing looked fine so far.
But once node 1 was back online, it didn't join the cluster anymore.
Do I have to manually remove the fence?
On the network I just see the following communication:
node3.5404 --> node0.5405
node0.5404 --> node2.5405
node1.5404 --> 239.192.204.105.5405
node2.5404 --> 239.192.91.165.5405
Any help for further troubleshooting is really aprechiated.
Cheers,
luphi
today I did some HA testing on a 4 node cluster, version 5.4.
I configured a HA group including all nodes and added to VMs tho the group.
The two VMS were running on node 1 and node 2.
I also set shutdown_policy=failover to initiate the failover by simply rebooting a node.
Shortly after rebooting node 1, I reveiced two E-mails with the following subjects:
FENCE: Try to fence node 'pve1'
SUCCEED: fencing: acknowledged - got agent lock for node 'pve1'
The VM running on node 1 failed over to node 0.
Everthing looked fine so far.
But once node 1 was back online, it didn't join the cluster anymore.
Do I have to manually remove the fence?
On the network I just see the following communication:
node3.5404 --> node0.5405
node0.5404 --> node2.5405
node1.5404 --> 239.192.204.105.5405
node2.5404 --> 239.192.91.165.5405
Any help for further troubleshooting is really aprechiated.
Cheers,
luphi