Dear All,
I think I have discovered an issue with the VZdump backups which can cause a lot of headaches.
If the backups are KVM/QEMU with the backup mode set to SNAPSHOT and the BACKUP storage runs out of space during a backup.
The backup process continues and causes all of the KVM/QEMU VM’s in the backup schedule to be shutdown.
If any container backups fail because the backup storage is full, they continue to work.
Here is a sanitised snippet from the backup log.
INFO: gzip: stdout: No space left on device
ERROR: Backup of VM 100 failed - command '(cd /mnt/vzsnap0/private/100;find . '(' -regex '^\.$' ')' -o '(' -type 's' -prune ')' -o -print0|sed 's/\\/\\\\/g'|tar cpf - --totals --sparse --numeric-owner --no-recursion --one-file-system --null -T -|gzip) >/mnt/pve/Backup-Storage/dump/vzdump-openvz-100-2015_02_14-23_59_02.tar.dat' failed: exit code 1
cp: closing `/mnt/pve/Backup-Storage/dump/vzdump-openvz-100-2015_02_14-23_59_02.log': No space left on device
INFO: Starting Backup of VM 104 (openvz)
INFO: CTID 104 exist mounted running
INFO: status = running
INFO: backup mode: snapshot
INFO: ionice priority: 7
INFO: creating lvm snapshot of /dev/mapper/pve-data ('/dev/pve/vzsnap-host1-0')
INFO: /dev/sdc1: read failed after 0 of 4096 at 103743488: Input/output error
INFO: /dev/sdc1: read failed after 0 of 4096 at 103800832: Input/output error
INFO: /dev/sdc1: read failed after 0 of 4096 at 0: Input/output error
INFO: /dev/sdc1: read failed after 0 of 4096 at 4096: Input/output error
INFO: Logical volume "vzsnap-host1-0" created
INFO: creating archive '/mnt/pve/Backup-Storage/dump/vzdump-openvz-104-2015_02_14-23_59_39.tar.gz'
INFO: gzip: stdout: No space left on device
ERROR: Backup of VM 104 failed - command '(cd /mnt/vzsnap0/private/104;find . '(' -regex '^\.$' ')' -o '(' -type 's' -prune ')' -o -print0|sed 's/\\/\\\\/g'|tar cpf - --totals --sparse --numeric-owner --no-recursion --one-file-system --null -T -|gzip) >/mnt/pve/Backup-Storage/dump/vzdump-openvz-104-2015_02_14-23_59_39.tar.dat' failed: exit code 1
cp: closing `/mnt/pve/Backup-Storage/dump/vzdump-openvz-104-2015_02_14-23_59_39.log': No space left on device
INFO: Starting Backup of VM 111 (openvz)
INFO: CTID 111 exist mounted running
INFO: status = running
INFO: backup mode: snapshot
INFO: ionice priority: 7
INFO: creating lvm snapshot of /dev/mapper/pve-data ('/dev/pve/vzsnap-host1-0')
INFO: /dev/sdc1: read failed after 0 of 4096 at 103743488: Input/output error
INFO: /dev/sdc1: read failed after 0 of 4096 at 103800832: Input/output error
INFO: /dev/sdc1: read failed after 0 of 4096 at 0: Input/output error
INFO: /dev/sdc1: read failed after 0 of 4096 at 4096: Input/output error
INFO: Logical volume "vzsnap-host1-0" created
INFO: creating archive '/mnt/pve/Backup-Storage/dump/vzdump-openvz-111-2015_02_15-00_00_17.tar.gz'
INFO: gzip: stdout: No space left on device
ERROR: Backup of VM 111 failed - command '(cd /mnt/vzsnap0/private/111;find . '(' -regex '^\.$' ')' -o '(' -type 's' -prune ')' -o -print0|sed 's/\\/\\\\/g'|tar cpf - --totals --sparse --numeric-owner --no-recursion --one-file-system --null -T -|gzip) >/mnt/pve/Backup-Storage/dump/vzdump-openvz-111-2015_02_15-00_00_17.tar.dat' failed: exit code 1
cp: closing `/mnt/pve/Backup-Storage/dump/vzdump-openvz-111-2015_02_15-00_00_17.log': No space left on device
INFO: Starting Backup of VM 113 (qemu)
INFO: status = running
INFO: update VM 113: -lock backup
INFO: backup mode: snapshot
INFO: ionice priority: 7
INFO: creating archive '/mnt/pve/Backup-Storage/dump/vzdump-qemu-113-2015_02_15-00_00_52.vma.gz'
ERROR: client closed connection
INFO: aborting backup job
ERROR: VM 113 not running
ERROR: Backup of VM 113 failed - client closed connection
I think I have discovered an issue with the VZdump backups which can cause a lot of headaches.
If the backups are KVM/QEMU with the backup mode set to SNAPSHOT and the BACKUP storage runs out of space during a backup.
The backup process continues and causes all of the KVM/QEMU VM’s in the backup schedule to be shutdown.
If any container backups fail because the backup storage is full, they continue to work.
Here is a sanitised snippet from the backup log.
INFO: gzip: stdout: No space left on device
ERROR: Backup of VM 100 failed - command '(cd /mnt/vzsnap0/private/100;find . '(' -regex '^\.$' ')' -o '(' -type 's' -prune ')' -o -print0|sed 's/\\/\\\\/g'|tar cpf - --totals --sparse --numeric-owner --no-recursion --one-file-system --null -T -|gzip) >/mnt/pve/Backup-Storage/dump/vzdump-openvz-100-2015_02_14-23_59_02.tar.dat' failed: exit code 1
cp: closing `/mnt/pve/Backup-Storage/dump/vzdump-openvz-100-2015_02_14-23_59_02.log': No space left on device
INFO: Starting Backup of VM 104 (openvz)
INFO: CTID 104 exist mounted running
INFO: status = running
INFO: backup mode: snapshot
INFO: ionice priority: 7
INFO: creating lvm snapshot of /dev/mapper/pve-data ('/dev/pve/vzsnap-host1-0')
INFO: /dev/sdc1: read failed after 0 of 4096 at 103743488: Input/output error
INFO: /dev/sdc1: read failed after 0 of 4096 at 103800832: Input/output error
INFO: /dev/sdc1: read failed after 0 of 4096 at 0: Input/output error
INFO: /dev/sdc1: read failed after 0 of 4096 at 4096: Input/output error
INFO: Logical volume "vzsnap-host1-0" created
INFO: creating archive '/mnt/pve/Backup-Storage/dump/vzdump-openvz-104-2015_02_14-23_59_39.tar.gz'
INFO: gzip: stdout: No space left on device
ERROR: Backup of VM 104 failed - command '(cd /mnt/vzsnap0/private/104;find . '(' -regex '^\.$' ')' -o '(' -type 's' -prune ')' -o -print0|sed 's/\\/\\\\/g'|tar cpf - --totals --sparse --numeric-owner --no-recursion --one-file-system --null -T -|gzip) >/mnt/pve/Backup-Storage/dump/vzdump-openvz-104-2015_02_14-23_59_39.tar.dat' failed: exit code 1
cp: closing `/mnt/pve/Backup-Storage/dump/vzdump-openvz-104-2015_02_14-23_59_39.log': No space left on device
INFO: Starting Backup of VM 111 (openvz)
INFO: CTID 111 exist mounted running
INFO: status = running
INFO: backup mode: snapshot
INFO: ionice priority: 7
INFO: creating lvm snapshot of /dev/mapper/pve-data ('/dev/pve/vzsnap-host1-0')
INFO: /dev/sdc1: read failed after 0 of 4096 at 103743488: Input/output error
INFO: /dev/sdc1: read failed after 0 of 4096 at 103800832: Input/output error
INFO: /dev/sdc1: read failed after 0 of 4096 at 0: Input/output error
INFO: /dev/sdc1: read failed after 0 of 4096 at 4096: Input/output error
INFO: Logical volume "vzsnap-host1-0" created
INFO: creating archive '/mnt/pve/Backup-Storage/dump/vzdump-openvz-111-2015_02_15-00_00_17.tar.gz'
INFO: gzip: stdout: No space left on device
ERROR: Backup of VM 111 failed - command '(cd /mnt/vzsnap0/private/111;find . '(' -regex '^\.$' ')' -o '(' -type 's' -prune ')' -o -print0|sed 's/\\/\\\\/g'|tar cpf - --totals --sparse --numeric-owner --no-recursion --one-file-system --null -T -|gzip) >/mnt/pve/Backup-Storage/dump/vzdump-openvz-111-2015_02_15-00_00_17.tar.dat' failed: exit code 1
cp: closing `/mnt/pve/Backup-Storage/dump/vzdump-openvz-111-2015_02_15-00_00_17.log': No space left on device
INFO: Starting Backup of VM 113 (qemu)
INFO: status = running
INFO: update VM 113: -lock backup
INFO: backup mode: snapshot
INFO: ionice priority: 7
INFO: creating archive '/mnt/pve/Backup-Storage/dump/vzdump-qemu-113-2015_02_15-00_00_52.vma.gz'
ERROR: client closed connection
INFO: aborting backup job
ERROR: VM 113 not running
ERROR: Backup of VM 113 failed - client closed connection