pvestatd.service crash every 24 hours

Anime4000

New Member
Jul 3, 2021
5
0
1
32
Hi, this is my first time on post on this forum, however, I encounter repeating error and crash, at first, I just simply restart the service every 14 days, but now it's too frequent

Code:
● pvestatd.service - PVE Status Daemon
     Loaded: loaded (/lib/systemd/system/pvestatd.service; enabled; vendor preset: enabled)
     Active: failed (Result: signal) since Tue 2023-03-07 05:41:04 +08; 1 day 16h ago
    Process: 4067838 ExecStart=/usr/bin/pvestatd start (code=exited, status=0/SUCCESS)
   Main PID: 4067840 (code=killed, signal=SEGV)
        CPU: 6min 46.916s

Mar 06 22:23:34 hitoha systemd[1]: Starting PVE Status Daemon...
Mar 06 22:23:34 hitoha pvestatd[4067840]: starting server
Mar 06 22:23:34 hitoha systemd[1]: Started PVE Status Daemon.
Mar 07 05:41:04 hitoha systemd[1]: pvestatd.service: Main process exited, code=killed, status=11/SEGV
Mar 07 05:41:04 hitoha systemd[1]: pvestatd.service: Failed with result 'signal'.
Mar 07 05:41:04 hitoha systemd[1]: pvestatd.service: Consumed 6min 46.916s CPU time.

I have been looking around the net for solution, it appear restarting the service is not the solution,

I installed on NVMe disk, and run filesystem check and the NVMe disk is healthy and no bad sector

I wondering what cause pvestatd.service to crash at specific time?
 
Hello, could you please post the output of pveversion -v?
 
Hello, could you please post the output of pveversion -v?
Here

Code:
root@hitoha:~# pveversion -v
proxmox-ve: 7.3-1 (running kernel: 5.15.85-1-pve)
pve-manager: 7.3-6 (running version: 7.3-6/723bb6ec)
pve-kernel-helper: 7.3-5
pve-kernel-5.15: 7.3-2
pve-kernel-5.13: 7.1-9
pve-kernel-5.15.85-1-pve: 5.15.85-1
pve-kernel-5.15.83-1-pve: 5.15.83-1
pve-kernel-5.15.74-1-pve: 5.15.74-1
pve-kernel-5.15.35-1-pve: 5.15.35-3
pve-kernel-5.13.19-6-pve: 5.13.19-15
pve-kernel-5.13.19-2-pve: 5.13.19-4
ceph-fuse: 15.2.15-pve1
corosync: 3.1.7-pve1
criu: 3.15-1+pve-1
glusterfs-client: 9.2-1
ifupdown2: 3.1.0-1+pmx3
ksm-control-daemon: 1.4-1
libjs-extjs: 7.0.0-1
libknet1: 1.24-pve2
libproxmox-acme-perl: 1.4.4
libproxmox-backup-qemu0: 1.3.1-1
libpve-access-control: 7.3-1
libpve-apiclient-perl: 3.2-1
libpve-common-perl: 7.3-2
libpve-guest-common-perl: 4.2-3
libpve-http-server-perl: 4.1-5
libpve-storage-perl: 7.3-2
libspice-server1: 0.14.3-2.1
lvm2: 2.03.11-2.1
lxc-pve: 5.0.2-2
lxcfs: 5.0.3-pve1
novnc-pve: 1.4.0-1
proxmox-backup-client: 2.3.3-1
proxmox-backup-file-restore: 2.3.3-1
proxmox-mail-forward: 0.1.1-1
proxmox-mini-journalreader: 1.3-1
proxmox-offline-mirror-helper: 0.5.1-1
proxmox-widget-toolkit: 3.5.5
pve-cluster: 7.3-2
pve-container: 4.4-2
pve-docs: 7.3-1
pve-edk2-firmware: 3.20220526-1
pve-firewall: 4.2-7
pve-firmware: 3.6-3
pve-ha-manager: 3.5.1
pve-i18n: 2.8-3
pve-qemu-kvm: 7.2.0-5
pve-xtermjs: 4.16.0-1
qemu-server: 7.3-4
smartmontools: 7.2-pve3
spiceterm: 3.2-2
swtpm: 0.8.0~bpo11+3
vncterm: 1.7-1
zfsutils-linux: 2.1.9-pve1
root@hitoha:~# uptime
 03:00:26 up 13 days,  9:22,  2 users,  load average: 1.60, 1.10, 0.94
 
Today, the service is killed again, where I can troubleshoot or log? VM and LXC are running fine

scan disk:
Code:
root@hitoha:~# badblocks -sv /dev/nvme0n1
Checking blocks 0 to 488386583
Checking for bad blocks (read-only test): done                                                 
Pass completed, 0 bad blocks found. (0/0/0 errors)


Code:
● pvestatd.service - PVE Status Daemon
     Loaded: loaded (/lib/systemd/system/pvestatd.service; enabled; vendor preset: enabled)
     Active: failed (Result: signal) since Thu 2023-03-09 18:42:58 +08; 4h 48min ago
    Process: 354198 ExecStart=/usr/bin/pvestatd start (code=exited, status=0/SUCCESS)
   Main PID: 354199 (code=killed, signal=SEGV)
        CPU: 16min 51.246s

Mar 08 22:19:17 hitoha systemd[1]: Starting PVE Status Daemon...
Mar 08 22:19:17 hitoha pvestatd[354199]: starting server
Mar 08 22:19:17 hitoha systemd[1]: Started PVE Status Daemon.
Mar 09 18:42:58 hitoha systemd[1]: pvestatd.service: Main process exited, code=killed, status=11/SEGV
Mar 09 18:42:58 hitoha systemd[1]: pvestatd.service: Failed with result 'signal'.
Mar 09 18:42:58 hitoha systemd[1]: pvestatd.service: Consumed 16min 51.246s CPU time.

if I do cron job every 24-hours, it should fix it right? but I losing the stats log
 
nothing in #dmesg or /var/log/kern.log ?
only have this

Code:
root@hitoha:~# cat /var/log/kern.log | grep pvestatd
Mar  7 05:41:04 hitoha kernel: [993804.442368] pvestatd[4067840]: segfault at 2008 ip 0000564fc6e87f90 sp 00007ffebb019f30 error 4 in perl[564fc6dd0000+185000]
 
Memory is fine, I check on other PC,
Downgrading Firmware fix the problem.

Apparently Firmware that I update contain "Mitigate the AMD potential security vulnerabilities for AMD Athlon™ processors and Ryzen™ processors" cause instability.
 
Last edited:
Interesting, I also have this same issue. Seems almost daily I find the service in a "Failed" state. Unfortunately for me, My system is an Intel. I will double check my memory even though it seems that was not your issue.

The system works great otherwise.

Anyone else have any deeper idea's?
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!