I'm asking because this old answer appears to be incorrect.
We have a cluster with nine nodes. HA is enabled, with three rings. When it's time to upgrade packages, we do the following:
So. What's the recommended procedure when one wants to reboot all nodes in sequence?
We have a cluster with nine nodes. HA is enabled, with three rings. When it's time to upgrade packages, we do the following:
- Remove a node from all HA groups
- Wait for it to be fully evacuated
- apt update && apt upgrade
- Reboot and wait for it to reappear in the Proxmox UI
- Add it to the HA groups again
- Rinse and repeat.
watchdog-mux[2159]: client watchdog expired - disable watchdog updates
, fence, and reboot. If we're really unlucky, this then triggers a reboot of all the nodes. So, obviously, there needs to be more to it than "systemd is responsible to shutdown services", because that's apparently not happening gracefully.So. What's the recommended procedure when one wants to reboot all nodes in sequence?