Hello,
I've configured the three nodes (Supermicro Server) of my Proxmox/Ceph Cluster (newest version) for using hardware watchdog
(https://pve.proxmox.com/wiki/High_Availability_Cluster_4.x#Hardware_Watchdogs):
1.) Enable watchdog in BIOS
2.) /etc/modprobe.d/ipmi_watchdog.conf:
options ipmi_watchdog action=power_cycle panic_wdt_timeout=10
3.) /etc/default/grub:
GRUB_CMDLINE_LINUX_DEFAULT="quiet nmi_watchdog=0"
4.) /etc/default/pve-ha-manager:
WATCHDOG_MODULE=ipmi_watchdog
After that the three servers were rebooting all the time, so I have to undo these configuration steps.
Any ideas ?
Thanks
Stefan
I've configured the three nodes (Supermicro Server) of my Proxmox/Ceph Cluster (newest version) for using hardware watchdog
(https://pve.proxmox.com/wiki/High_Availability_Cluster_4.x#Hardware_Watchdogs):
1.) Enable watchdog in BIOS
2.) /etc/modprobe.d/ipmi_watchdog.conf:
options ipmi_watchdog action=power_cycle panic_wdt_timeout=10
3.) /etc/default/grub:
GRUB_CMDLINE_LINUX_DEFAULT="quiet nmi_watchdog=0"
4.) /etc/default/pve-ha-manager:
WATCHDOG_MODULE=ipmi_watchdog
After that the three servers were rebooting all the time, so I have to undo these configuration steps.
Any ideas ?
Thanks
Stefan