A single node in our HA enabled Proxmox cluster crashed yesterday. The crash produced a segfault in the HA resource manager (pve‑ha‑lrm) and subsequently stopped the watchdog, which forced a reboot of that node. Below are the exact syslog lines that show the Perl warning (the likely root cause) and the resulting failure cascade.
Proxmox version: 9.1.9
Kernel: Linux 6.17.13-7-pve
Linux distro: Debian Trixie
Are there any known bugs in the current Proxmox 9/HA stack that produce the exact “Attempt to free unreferenced scalar” warning in PVE-HA-LRM (line 871) and the subsequent segfault?
Proxmox version: 9.1.9
Kernel: Linux 6.17.13-7-pve
Linux distro: Debian Trixie
Code:
2026-05-31T15:39:57.155Z Attempt to free unreferenced scalar: SV 0x5d19f136f858, Perl interpreter: 0x5d19ea3c22a0 at /usr/share/perl5/PVE/HA/LRM.pm line 871.
2026-05-31T15:39:57.155Z pve-ha-lrm: segfault at 55 ip 00005d19b06f46a2 sp 00007ffc31574af0 error 4 in perl[946a2,5d19b06a4000+1ae000] likely on CPU 22 (core 21, socket 0)
2026-05-31T15:39:57.155Z Code: 00 48 89 df e8 df 88 0f 00 e9 3d ff ff ff 66 2e 0f 1f 84 00 00 00 00 00 41 54 55 48 89 fd 53 48 8b 7e 08 48 89 f3 4c 8b 66 10 <48> 63 47 04 83 f8 fe 74 3d f6 44 07 09 04 74 1e e8 c9 6c 14 00 48
2026-05-31T15:39:57.163Z client (PID 5875) did not stop watchdog - disable watchdog updates
2026-05-31T15:39:57.167Z pve-ha-lrm.service: Main process exited, code=killed, status=11/SEGV
2026-05-31T15:39:57.170Z pve-ha-lrm.service: Failed with result 'signal'.
2026-05-31T15:39:57.174Z pve-ha-lrm.service: Consumed 2d 20h 22min 5.771s CPU time, 226.7M memory peak.
2026-05-31T15:39:58.164Z exit watchdog-mux with active connections
2026-05-31T15:39:58.169Z watchdog: watchdog0: watchdog did not stop!
2026-05-31T15:39:58.172Z watchdog-mux.service: Deactivated successfully.
Are there any known bugs in the current Proxmox 9/HA stack that produce the exact “Attempt to free unreferenced scalar” warning in PVE-HA-LRM (line 871) and the subsequent segfault?