It's nowhere in the PVE official docs, but corosync does support
Now I understand the official PVE endorsed way would be to just use a qdevice, but this does not solve particular situations, for instance looking to maximize off-grid time by having cascade shutdown nodes leaving only 3, with only few essential HA services that won't overload them.
My question is - has anyone been running this in production or at least for a reasonably long period on reasonably large cluster (10+) to test any anomalies when nodes are going down and then re-starting up and its effect on the HA stack on PVE?
Note: As the rebalanacing might end up with even node count, I suppose I better set
last_man_standing
and when used with HA it is suggested to also set wait_for_all
. I found some previous threads, but not in relation to HA.Now I understand the official PVE endorsed way would be to just use a qdevice, but this does not solve particular situations, for instance looking to maximize off-grid time by having cascade shutdown nodes leaving only 3, with only few essential HA services that won't overload them.
My question is - has anyone been running this in production or at least for a reasonably long period on reasonably large cluster (10+) to test any anomalies when nodes are going down and then re-starting up and its effect on the HA stack on PVE?
Note: As the rebalanacing might end up with even node count, I suppose I better set
auto_tie_breaker
as well, but that should have no influence on the two above.
Last edited: