PVE Backup to CIFS

shoffman

Member
Apr 13, 2021
10
1
8
40
I have created a PVE backup schedule for our production VMs to a network Windows based CIFS target. The backup schedule is in Stop Mode with ZSTD compression. Retention on the CIFS storage is 3 copies. All of our VMs back up successfully except one, VM 106. If I delete all of the VM 106 backup files on the CIFS storage I can get a successful backup. However, if I let the job continue to run the next day(s) I receive the following errors in the log. I have one backup job that backs all of the VMs up on the same schedule after hours. I have successful 3 copy backups of each of them without issue, except VM 106.

INFO: Starting Backup of VM 106 (qemu) INFO: Backup started at 2021-04-13 01:12:34 INFO: status = running INFO: backup mode: stop INFO: ionice priority: 7 INFO: VM Name: TCC-FS INFO: include disk 'virtio0' 'VMOS:vm-106-disk-0' 100G INFO: include disk 'virtio1' 'VMOS:vm-106-disk-1' 250G INFO: stopping vm INFO: creating vzdump archive '/mnt/pve/CIFS/dump/vzdump-qemu-106-2021_04_13-01_12_34.vma.zst' INFO: starting kvm to execute backup task INFO: started backup task 'b85d97d4-9101-4ea8-8687-fbbcc437e5f4' INFO: resuming VM again after 20 seconds INFO: 0% (435.0 MiB of 350.0 GiB) in 3s, read: 145.0 MiB/s, write: 130.7 MiB/s INFO: 1% (3.6 GiB of 350.0 GiB) in 29s, read: 125.4 MiB/s, write: 120.5 MiB/s INFO: 2% (7.0 GiB of 350.0 GiB) in 57s, read: 125.7 MiB/s, write: 125.3 MiB/s INFO: 3% (10.5 GiB of 350.0 GiB) in 1m 26s, read: 122.5 MiB/s, write: 122.3 MiB/s INFO: 4% (14.1 GiB of 350.0 GiB) in 1m 55s, read: 126.0 MiB/s, write: 125.8 MiB/s INFO: 5% (17.6 GiB of 350.0 GiB) in 2m 23s, read: 128.7 MiB/s, write: 128.6 MiB/s INFO: 6% (21.0 GiB of 350.0 GiB) in 2m 51s, read: 126.0 MiB/s, write: 125.8 MiB/s INFO: 7% (24.6 GiB of 350.0 GiB) in 3m 22s, read: 117.4 MiB/s, write: 117.4 MiB/s INFO: 8% (28.0 GiB of 350.0 GiB) in 3m 52s, read: 116.2 MiB/s, write: 116.2 MiB/s INFO: 9% (31.6 GiB of 350.0 GiB) in 4m 24s, read: 114.4 MiB/s, write: 114.4 MiB/s INFO: 10% (35.0 GiB of 350.0 GiB) in 4m 56s, read: 110.9 MiB/s, write: 110.9 MiB/s ... INFO: 90% (316.1 GiB of 350.0 GiB) in 1h 20m 57s, read: 1.2 GiB/s, write: 0 B/s INFO: 91% (319.7 GiB of 350.0 GiB) in 1h 21m, read: 1.2 GiB/s, write: 0 B/s INFO: 92% (322.8 GiB of 350.0 GiB) in 1h 21m 3s, read: 1.1 GiB/s, write: 0 B/s INFO: 93% (326.0 GiB of 350.0 GiB) in 1h 21m 6s, read: 1.1 GiB/s, write: 0 B/s INFO: 94% (329.5 GiB of 350.0 GiB) in 1h 21m 9s, read: 1.2 GiB/s, write: 0 B/s INFO: 95% (332.9 GiB of 350.0 GiB) in 1h 21m 12s, read: 1.1 GiB/s, write: 0 B/s INFO: 96% (336.3 GiB of 350.0 GiB) in 1h 21m 15s, read: 1.1 GiB/s, write: 0 B/s INFO: 97% (340.0 GiB of 350.0 GiB) in 1h 21m 18s, read: 1.2 GiB/s, write: 0 B/s INFO: 98% (343.8 GiB of 350.0 GiB) in 1h 21m 22s, read: 982.0 MiB/s, write: 0 B/s INFO: 99% (346.7 GiB of 350.0 GiB) in 1h 21m 25s, read: 996.0 MiB/s, write: 0 B/s zstd: /*stdout*\: Input/output error INFO: 100% (350.0 GiB of 350.0 GiB) in 1h 21m 29s, read: 833.0 MiB/s, write: 1.0 KiB/s INFO: backup is sparse: 81.83 GiB (23%) total zero data INFO: transferred 350.00 GiB in 4889 seconds (73.3 MiB/s) ERROR: Backup of VM 106 failed - zstd --rsyncable --threads=1 failed - wrong exit status 1 INFO: Failed at 2021-04-13 02:34:37

I have Googled the errors without finding a solution. I was hoping that someone could point me in the right direction as to why this VM 106 is giving me trouble. I have ample space on the CIFS target and on the DAS ceph where the VM(s) live. The cluster is on version 6.3-6.

Thank you for any information you can offer.
 
Last edited:
Have you tried a different compression algorithm for that VM?
 
Have you tried a different compression algorithm for that VM?
I will remove VM 106 from the main backup job and create a new one for it using a different compression mode for tonight and then report back, Thank you.