Question About Fencing

Feb 6, 2017
10
1
6
62
I'm putting together a cluster and confused about fencing. It looks like the recommended way is to use a watchdog timer to reboot a host that is hung. In my previous experience with clusters, the remaining hosts would "STONITH" the non-responsive host. Is there a reason that this method isn't recommended?

If the host is hung, the watchdog is fine but if there is a cluster network failure and VM are restarted on new hosts, you could have a split brain situation. Using IPMI to power cycle the disconnected server would be my choice.

Thanks
 
If the host is hung, the watchdog is fine but if there is a cluster network failure and VM are restarted on new hosts, you could have a split brain situation.

This is simply not true. The corosync cluster stack detect such situations and the watchdog triggers.
 
I'm putting together a cluster and confused about fencing. It looks like the recommended way is to use a watchdog timer to reboot a host that is hung. In my previous experience with clusters, the remaining hosts would "STONITH" the non-responsive host. Is there a reason that this method isn't recommended?

Fence devices are often brittle and extra components which can fail to. Watchdogs resets (not reboots) the host, they are very simple and do not need network communication to work.

Watchdogs are always loaded, external fence devices are the oppossite and thus may not be reachable in certain failure cases.
See our documentation: https://pve.proxmox.com/pve-docs/chapter-ha-manager.html#ha_manager_fencing

If the host is hung, the watchdog is fine but if there is a cluster network failure and VM are restarted on new hosts, you could have a split brain situation

No, as then you also do not have quorum, which is the underlying principle of all of our operations.
Further, we only start VMs/CTs after getting the failed nodes cluster locks, so no.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!