3 node cluster with 1 node red, pvestatd problem

dendi

Renowned Member
Nov 17, 2011
126
9
83
Hi,
I have a problem with a three node cluster, PVE 4.3 iso version
The second node is not accessible from the GUI but VMs are still running, except for planned backups...

grep pvestat /var/log/daemon.log
Code:
grep pvestat /var/log/daemon.log
Jul  3 08:24:04 pvestatd[1830]: restarting server after 90430 cycles to reduce memory usage (free 75649024 (5246976) bytes)
Jul  3 08:24:04 pvestatd[1830]: server shutdown (restart)
Jul  4 16:33:53 systemd[1]: pvestatd.service stopping timed out. Terminating.
Jul  4 16:35:24 systemd[1]: pvestatd.service stop-sigterm timed out. Killing.
Jul  4 16:36:54 systemd[1]: pvestatd.service still around after SIGKILL. Ignoring.
Jul  4 16:38:24 systemd[1]: pvestatd.service stop-final-sigterm timed out. Killing.
Jul  4 16:39:54 systemd[1]: pvestatd.service still around after final SIGKILL. Entering failed mode.
Jul  4 16:39:54 systemd[1]: Unit pvestatd.service entered failed state.

On Jul 4 16:33 I tryed
Code:
systemctl stop pvestatd.service

ps auxw | grep pvestat shows:
Code:
root      1830  0.7  0.0 232696 59120 ?        Ds   Jun12 249:49 /usr/bin/perl /usr/bin/pvestatd start
root     31732  0.0  0.0 232596 59212 ?        Ds   Jul04   0:00 /usr/bin/perl /usr/bin/pvestatd stop

I tryed to kill -9 the two start and stop processes but nothing happens

Linux 4.4.19-1-pve #1 SMP Wed Sep 14 14:33:50 CEST 2016 x86_64 GNU/Linux

Could you suggest me how to find a solution without rebooting the node?

Thank you very much