Hello,
After an update to the latest version, one of our PVE server crash every night during the backup to PBS.
We have the latest version from the entreprise repo (8.1.3).
The problem occurs every time we start the backup job. The problem was not present before the update.
The server become unresponsive at 00:30, just when the backup start. VMs are offline, SSH is offline, everything is down, etc ...
5-10min after, it's online again but with the VM 421 STOP. (cf logs)
We have other nodes with exactly the same version, the same hardware, etc ... and no problems.
It's seems the server crash during the backup of this specific VM.
Ceph is installed on all our nodes but this VM does not use CEPH and have dedicated local nvme drives (2 Kioxia drives KCD6XLUL960G) with a ZFS mirror.
It's an Ubuntu with a Postgresql DB. The QEMU agent is installed on the VM.
Hardware of the host :
Supermicro AS -1114S-WTRT/H12SSW-NT
48 x AMD EPYC 7443P 24-Core Processor (1 Socket)
128Gb RAM
Linux 6.5.11-7-pve (2023-12-05T09:44Z)
In normal times, the host has a load average of 10% CPU, plenty of free RAM and plenty of free disk space.
Thanks for helping.
After an update to the latest version, one of our PVE server crash every night during the backup to PBS.
We have the latest version from the entreprise repo (8.1.3).
The problem occurs every time we start the backup job. The problem was not present before the update.
The server become unresponsive at 00:30, just when the backup start. VMs are offline, SSH is offline, everything is down, etc ...
5-10min after, it's online again but with the VM 421 STOP. (cf logs)
We have other nodes with exactly the same version, the same hardware, etc ... and no problems.
It's seems the server crash during the backup of this specific VM.
Ceph is installed on all our nodes but this VM does not use CEPH and have dedicated local nvme drives (2 Kioxia drives KCD6XLUL960G) with a ZFS mirror.
It's an Ubuntu with a Postgresql DB. The QEMU agent is installed on the VM.
Hardware of the host :
Supermicro AS -1114S-WTRT/H12SSW-NT
48 x AMD EPYC 7443P 24-Core Processor (1 Socket)
128Gb RAM
Linux 6.5.11-7-pve (2023-12-05T09:44Z)
In normal times, the host has a load average of 10% CPU, plenty of free RAM and plenty of free disk space.
Thanks for helping.
Last edited: