Trying to test self fencing in PVE 4. I am using 3 Dell workstations as PoC, which have a HW(?) watchdog: iTCO_wdt.
If I load the module, a new watchdog device /dev/watchdog1 appears. echo 1 > /dev/watchdog1 reboots the node in a few seconds.
But there seems to be another watchdog device, /dev/watchdog0 which causes watchdog-mux to fail.
I have no idea what device /dev/watchdog0 is, tried to disable all possible watchdogs with kernel boot param: nmi_watchdog=0 soft_watchdog=0, blacklisting modules, poking around in /sys/devices/virtual/watchdog/watchdog0, but it remains, and does not seem to like:
Any idea how to work around the situation, remove this watchdog0 device, make watchdog-mux use a specific watchdog device rather than the /dev/watchdog?
PVE4 installed on top of jessie, or from ISO does not seem to matter,
If I load the module, a new watchdog device /dev/watchdog1 appears. echo 1 > /dev/watchdog1 reboots the node in a few seconds.
But there seems to be another watchdog device, /dev/watchdog0 which causes watchdog-mux to fail.
Code:
strace -f watchdog-mux
...
stat("/run/watchdog-mux.active", 0x7ffee01e0c90) = -1 ENOENT (No such file or directory)
stat("/dev/watchdog", {st_mode=S_IFCHR|0600, st_rdev=makedev(10, 130), ...}) = 0
open("/dev/watchdog", O_WRONLY) = 3
ioctl(3, WDIOC_SETTIMEOUT, 0x603134) = -1 EINVAL (Invalid argument)
dup(2) = 4
fcntl(4, F_GETFL) = 0x8002 (flags O_RDWR|O_LARGEFILE)
fstat(4, {st_mode=S_IFCHR|0600, st_rdev=makedev(136, 1), ...}) = 0
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7f5a8eb9a000
lseek(4, 0, SEEK_CUR) = -1 ESPIPE (Illegal seek)
write(4, "watchdog set timeout: Invalid ar"..., 39watchdog set timeout: Invalid argument
) = 39
close(4) = 0
munmap(0x7f5a8eb9a000, 4096) = 0
write(3, "V", 1) = 1
close(3) = 0
exit_group(1) = ?
+++ exited with 1 +++
I have no idea what device /dev/watchdog0 is, tried to disable all possible watchdogs with kernel boot param: nmi_watchdog=0 soft_watchdog=0, blacklisting modules, poking around in /sys/devices/virtual/watchdog/watchdog0, but it remains, and does not seem to like:
Code:
ioctl(3, WDIOC_SETTIMEOUT, 0x603134)
Any idea how to work around the situation, remove this watchdog0 device, make watchdog-mux use a specific watchdog device rather than the /dev/watchdog?
PVE4 installed on top of jessie, or from ISO does not seem to matter,
Code:
uname -a
Linux proxmox4 4.2.2-1-pve #1 SMP Mon Oct 5 18:23:31 CEST 2015 x86_64 GNU/Linux
Attachments
Last edited: