I am currently having a similar situation using Dell machines (offering IPMIv2) and older HP Machines with ILO2.
The question is what happens if one node fails and the network switch that the IPMI is connected to fails as well.
Further more I am missing some documentation how to activate the IPMI setting as the fencing device. I followed the docu and set WATCHDOG_MODULE=ipmi_watchdog in /etc/default/pve-ha-manager and set the GRUB option to nmi_watchdog=0.
What else has to be done? There must be some config to tell Proxmox about the credentials, right?
Should the be done according to the old documentation within /etc/pve/cluster.conf?
I found out, that I had a wrong understanding of how the ipmi watchdog is used. Basically it needs a driver to talk to a piece of hardware within the baseboard management controller (BMC). There is no communication via LAN, but direct access to the IPMI/BMC hardware, that gets polled and commands can be set to shutdown the machine.
There is no need to submit credentials and no problem if network fails. Nothing has to be done except making sure, that the ipmi driver is loaded, proxmox is told which watchdog to use and the GRUB setting is made. You can check your IPMI configuration with:
If you want to simulate the proper fencing, execute the following and the node should reset within a few seconds.
Code:
echo "A" | socat - UNIX-CONNECT:/var/run/watchdog-mux.sock
Nevertheless I experienced problems with HP ILO2. Those machines did not fence correctly in my test setting. So I used the soft watchdog for those machines.
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.