Trouble with Backup Configuration

GoodEarthIT

New Member
Oct 23, 2023
1
0
1
Hello, I am very new to Proxmox in general and am working to understand an issue we have run into for my company.

I have a server we recently installed with Proxmox 8 (attempted a lower version, but hardware didn't support it). We set up a Backup configuration by attaching SMB/CIFS storage to a VM and setting up an automatic schedule. It ran successfully the first couple of times with the following logs:

Code:
INFO: starting new backup job: vzdump 101 --quiet 1 --node goodearthmarkets --mode snapshot --compress zstd --notes-template '{{vmid}}{{guestname}}' --prune-backups 'keep-last=6' --storage Backups --mailnotification always --mailto ##############
INFO: Starting Backup of VM 101 (qemu)
INFO: Backup started at 2023-09-10 01:00:02
INFO: status = running
INFO: VM Name: ########
INFO: include disk 'scsi0' 'local-zfs:vm-101-disk-0' 400G
INFO: include disk 'scsi1' 'ServerData:vm-101-disk-0' 800G
INFO: backup mode: snapshot
INFO: ionice priority: 7
INFO: snapshots found (not included into backup)
INFO: creating vzdump archive '/mnt/pve/Backups/dump/vzdump-qemu-101-2023_09_10-01_00_02.vma.zst'
INFO: skipping guest-agent 'fs-freeze', agent configured but not running?
INFO: started backup task 'cefc159f-9c8a-42ef-8a88-c1caf468b325'
INFO: resuming VM again
INFO:   0% (512.4 MiB of 1.2 TiB) in 3s, read: 170.8 MiB/s, write: 160.0 MiB/s
INFO:   1% (12.0 GiB of 1.2 TiB) in 1m 33s, read: 131.3 MiB/s, write: 74.5 MiB/s
INFO:   2% (24.0 GiB of 1.2 TiB) in 3m 24s, read: 110.4 MiB/s, write: 58.9 MiB/s
INFO:   3% (36.1 GiB of 1.2 TiB) in 5m 3s, read: 125.1 MiB/s, write: 64.5 MiB/s
|...
INFO:  99% (1.2 TiB of 1.2 TiB) in 18m 30s, read: 6.6 GiB/s, write: 0 B/s
INFO: 100% (1.2 TiB of 1.2 TiB) in 18m 32s, read: 298.8 MiB/s, write: 213.8 MiB/s
INFO: backup is sparse: 1.10 TiB (93%) total zero data
INFO: transferred 1.17 TiB in 1112 seconds (1.1 GiB/s)
INFO: archive file size: 60.30GB
INFO: adding notes to backup
INFO: prune older backups with retention: keep-last=6
INFO: pruned 0 backup(s)
INFO: Finished Backup of VM 101 (00:18:41)
INFO: Backup finished at 2023-09-10 01:18:43
INFO: Backup job finished successfully
TASK OK

Then, seemingly out of the blue, the next time it runs it appears to stall when attempting to resume VM again:

Code:
INFO: starting new backup job: vzdump 101 --mailnotification always --mailto ############### --prune-backups 'keep-last=6' --storage Backups --notes-template '{{vmid}}{{guestname}}' --mode snapshot --quiet 1 --node goodearthmarkets --compress zstd
INFO: Starting Backup of VM 101 (qemu)
INFO: Backup started at 2023-09-17 01:00:02
INFO: status = running
INFO: VM Name: ########
INFO: include disk 'scsi0' 'local-zfs:vm-101-disk-0' 400G
INFO: include disk 'scsi1' 'ServerData:vm-101-disk-0' 800G
INFO: backup mode: snapshot
INFO: ionice priority: 7
INFO: snapshots found (not included into backup)
INFO: creating vzdump archive '/mnt/pve/Backups/dump/vzdump-qemu-101-2023_09_17-01_00_02.vma.zst'
INFO: skipping guest-agent 'fs-freeze', agent configured but not running?
INFO: started backup task '51c93560-a72f-4a0f-87f2-d58d5d87ea39'
INFO: resuming VM again
INFO:   0% (635.6 MiB of 1.2 TiB) in 3s, read: 211.9 MiB/s, write: 201.1 MiB/s

During this time the VM becomes unavailable and the database I am running on it cannot be accessed as needed. I've tried to find others with a similar issue, but haven't found much related to it. I have unfortunately had to power cycle the system to get the VM up and running again, but after the second time I adjusted the backup automation to "--mode stop" which then just errors out completely (code below) and I no longer have to hard reboot.

Code:
INFO: Starting Backup of VM 101 (qemu)
INFO: Backup started at 2023-10-22 01:00:06
INFO: status = running
INFO: backup mode: stop
INFO: ionice priority: 7
INFO: VM Name: ########
INFO: include disk 'scsi0' 'local-zfs:vm-101-disk-0' 400G
INFO: include disk 'scsi1' 'ServerData:vm-101-disk-0' 800G
INFO: stopping virtual guest
INFO: VM quit/powerdown failed
ERROR: Backup of VM 101 failed - command 'qm shutdown 101 --skiplock --keepActive --timeout 600' failed: exit code 255
INFO: Failed at 2023-10-22 01:10:06
INFO: Backup job finished with errors
TASK ERROR: job errors

I appreciate any help that can be offered and am not sure where to start in troubleshooting and fixing this issue.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!