Add Node prevent reboot HA

sander93

Renowned Member
Sep 30, 2014
57
2
73
Hello,

Question, we want to replace some new in the cluster, so remove old node, add new node (and this 6 times).

Previous experience we have is that sometimes during the process off adding a new noding something goes wrong and all the nodes will reboot(fence) itself.

We wan't to prevent this if possible, i know you can disable HA temporary during maintenance in (for example) the network bij stopping LRM and HRM
systemctl stop pve-ha-lrm
systemctl stop pve-ha-hrm

But the new added node will probilly have this services running after joining the cluster i think..

Is there some way to temporary disable all HA functions during this kind of maintenance?

I read something about renaming/move the cluster resource file /etc/pve/ha/resources.cfg
Is it save to doe a: mv /etc/pve/ha/resources.cfg /etc/pve/ha/resources.bak

and after maintenance is done do a mv /etc/pve/ha/resources.bak /etc/pve/ha/resources.cfg

Or is there any better option to do it?
 
Hello

Have you seen the documentation on how to remove a node?

Is there anything not covered in there you need to know?
 
Hi,

Thank you for you response!
I have seen the documentation about it and used it in the past, but most of the times there is no problem but 1 of the 10 times it goes wrong and the whole cluster reboots.

If possible i wan't to prevent this from happen of course, if there is any way to be more save and prevent auto reboots by fencing it gives me more confidence.