I also found that my pvestatd was not running this morning.
After investigating i saw this in dmesg:
Out of memory: Kill process 3889 (pvestatd) score 406 or sacrifice child
OOM killed process 3889 (pvestatd) vm:28887328kB, rss:28311020kB, swap:357556kB
I cannot tell if pvestatd used all the memory but in the daemon log file i saw this as the last messages from pvestatd in daemon.log.1 on 5 december (one week later until i found out):
ec 5 19:06:08 NODE01 pvestatd[3889]: WARNING: command 'df -P -B 1 /var/lib/vz/' failed: open3: fork failed: Cannot allocate memory at /usr/share/perl5/PVE/
Tools.pm line 280
Dec 5 19:06:08 NODE01 snmpd[1854]: Connection from UDP: [127.0.0.1]:34268->[127.0.0.1]
Dec 5 19:06:09 NODE01 pvestatd[3889]: WARNING: command '/sbin/vgs --separator : --noheadings --units b --unbuffered --nosuffix --options vg_name,vg_size,vg_
free' failed: open3: fork failed: Cannot allocate memory at /usr/share/perl5/PVE/Tools.pm line 280
Dec 5 19:06:09 NODE01 pvestatd[3889]: WARNING: command '/sbin/vgscan --ignorelockingfailure --mknodes' failed: open3: fork failed: Cannot allocate memory at
/usr/share/perl5/PVE/Tools.pm line 280
Dec 5 19:06:09 NODE01 pvestatd[3889]: WARNING: Use of uninitialized value $storeid in hash element at /usr/share/perl5/PVE/Storage.pm line 665.
Dec 5 19:06:09 NODE01 pvestatd[3889]: WARNING: Use of uninitialized value $storeid in hash element at /usr/share/perl5/PVE/Storage.pm line 666.
Dec 5 19:06:09 NODE01 pvestatd[3889]: status update time (5.206 seconds)
Dec 5 19:06:09 NODE01 snmpd[1854]: Connection from UDP: [127.0.0.1]:34268->[127.0.0.1]
Dec 5 19:06:10 NODE01 snmpd[1854]: Connection from UDP: [127.0.0.1]:34268->[127.0.0.1]
Dec 5 19:06:11 NODE01 snmpd[1854]: Connection from UDP: [127.0.0.1]:34268->[127.0.0.1]
Dec 5 19:06:17 NODE01 pvestatd[3889]: WARNING: Use of uninitialized value $vmid in concatenation (.) or string at /usr/share/perl5/PVE/QemuServer.pm line 13
87.
Dec 5 19:06:17 NODE01 pvestatd[3889]: qemu status update error: unknown file type 'nodes/NODE01/qemu-server/.conf'
Dec 5 19:06:17 NODE01 pvestatd[3889]: WARNING: command 'df -P -B 1 /var/lib/vz/' failed: open3: fork failed: Cannot allocate memory at /usr/share/perl5/PVE/
Tools.pm line 280
Dec 5 19:06:17 NODE01 pvestatd[3889]: WARNING: command '/sbin/vgs --separator : --noheadings --units b --unbuffered --nosuffix --options vg_name,vg_size,vg_
free' failed: open3: fork failed: Cannot allocate memory at /usr/share/perl5/PVE/Tools.pm line 280
Dec 5 19:06:18 NODE01 pvestatd[3889]: WARNING: command '/sbin/vgscan --ignorelockingfailure --mknodes' failed: open3: fork failed: Cannot allocate memory at
/usr/share/perl5/PVE/Tools.pm line 280
Dec 5 19:06:18 NODE01 pvestatd[3889]: WARNING: Use of uninitialized value $storeid in hash element at /usr/share/perl5/PVE/Storage.pm line 665.
Dec 5 19:06:18 NODE01 pvestatd[3889]: WARNING: Use of uninitialized value $storeid in hash element at /usr/share/perl5/PVE/Storage.pm line 666.
These were the last messages and the pvestatd was killed. In my monitoring i can see that it freed up 30GB of memory.
On another cluster node i can see that pvestatd has consumed half of my memory right now so probably that one will be killed somewhere the coming days.
Almost all nodes up times are 118 days
Does an upgrade to 2.2 fix this?
I do have the logs so if you are interested i can post more information?