ipmi_watchgod not work

david_tao

Active Member
Jul 28, 2022
119
30
33
Dear everyone:
I followed https://pve.proxmox.com/wiki/High_Availability_Cluster_4.x to configured ipmi_watchdog. After I enabled this option "watch Dog Function" in BIOS for Supermicro H12SSL-i motherboard, The is reboot repeatedly every 5 minutes! Does anyone known am I missed configure something for ipmi_watchdog?
Code:
root@pve:~# cat /etc/default/pve-ha-manager
# select watchdog module (default is softdog)
WATCHDOG_MODULE=ipmi_watchdog

root@pve:~# cat /etc/modprobe.d/ipmi_watchdog.conf
options ipmi_watchdog action=power_cycle panic_wdt_timeout=10

root@pve:~# cat /etc/default/grub
GRUB_CMDLINE_LINUX_DEFAULT="quiet nmi_watchdog=0"

root@pve:~# systemctl status watchdog-mux.service
● watchdog-mux.service - Proxmox VE watchdog multiplexer
     Loaded: loaded (/lib/systemd/system/watchdog-mux.service; static)
     Active: active (running) since Mon 2025-06-30 23:27:05 CST; 2min 41s ago
   Main PID: 2224 (watchdog-mux)
      Tasks: 1 (limit: 309032)
     Memory: 200.0K
        CPU: 17ms
     CGroup: /system.slice/watchdog-mux.service
             └─2224 /usr/sbin/watchdog-mux

Jun 30 23:27:05 pve67 systemd[1]: Started watchdog-mux.service - Proxmox VE watchdog multiplexer.
Jun 30 23:27:05 pve67 watchdog-mux[2224]: Loading watchdog module 'ipmi_watchdog'
Jun 30 23:27:05 pve67 watchdog-mux[2224]: Watchdog driver 'IPMI', version 1

root@pve:~# date && ipmitool mc watchdog get
Mon Jun 30 11:33:03 PM CST 2025
Watchdog Timer Use:     SMS/OS (0x44)
Watchdog Timer Is:      Started/Running
Watchdog Timer Logging: On
Watchdog Timer Action:  Power Cycle (0x03)
Pre-timeout interrupt:  None
Pre-timeout interval:   0 seconds
Timer Expiration Flags: (0x10)
                        * SMS/OS
Initial Countdown:      10.0 sec
Present Countdown:      10.0 sec

root@pve:~# date && ipmitool mc watchdog get
Mon Jun 30 11:33:04 PM CST 2025
Watchdog Timer Use:     SMS/OS (0x44)
Watchdog Timer Is:      Started/Running
Watchdog Timer Logging: On
Watchdog Timer Action:  Power Cycle (0x03)
Pre-timeout interrupt:  None
Pre-timeout interval:   0 seconds
Timer Expiration Flags: (0x10)
                        * SMS/OS
Initial Countdown:      10.0 sec
Present Countdown:      9.6 sec
With the output of above ipmitool mc watchdog get command, I'm assume the watchdog function is working properly, but the system is still restarted every 5 minutes!
1751690877290.png