Setup:
During one of the backup runs, a network error occurred and the backup process did not complete properly:
At the start of the snapshot backup, fsfreeze was called inside the VM:
According to the log, the VM was “resumed” after the backup failure:
But in reality, the filesystem inside the guest never got unfrozen. The VM stayed in a frozen state and eventually crashed.
It looks like freeze and unfreeze are tied to the same script, and if the process fails, unfreeze is skipped despite the log showing “resumed.”
Questions:
- Proxmox PVE 8.4.1 (LVM-thin filesystem)
– VM running Jenkins - Proxmox Backup Server 3.4.0
– Used as backup storage
– Backup job configured on PVE in snapshot mode
During one of the backup runs, a network error occurred and the backup process did not complete properly:
Bash:
INFO: 95% (66.9 GiB of 70.3 GiB) in 24m 17s, read: 48.3 KiB/s, write: 48.3 KiB/s
ERROR: backup write data failed: command error: protocol canceled
INFO: aborting backup job
INFO: resuming VM again
ERROR: Backup of VM 100 failed - backup write data failed: command error: protocol canceled
INFO: Failed at 2025-09-03 22:54:22
INFO: Backup job finished with errors
INFO: notified via target `mail-to-root`
TASK ERROR: job errors
At the start of the snapshot backup, fsfreeze was called inside the VM:
Bash:
Sep 03 22:30:04 jenkins qemu-ga[629]: info: guest-fsfreeze called
According to the log, the VM was “resumed” after the backup failure:
Bash:
INFO: resuming VM again
But in reality, the filesystem inside the guest never got unfrozen. The VM stayed in a frozen state and eventually crashed.
It looks like freeze and unfreeze are tied to the same script, and if the process fails, unfreeze is skipped despite the log showing “resumed.”
Questions:
- Has anyone else encountered this mismatch (log says resumed, but guest is still frozen)?
- Is there a way to modify the script or add a safeguard to guarantee fsfreeze --unfreeze is always executed even if the backup fails?