Hi,
I have a VM with two drives which is set to backup once a week by Proxmox internal backup tool. The backups are stored on an NFS share.
Last backup has failed due to space issues on the NFS server:
After that, the VM became unreachable. I tried to hard reboot it, but it stopped booting at mounting the 2nd virtual drive. Manual mounting the qemu image on the PM host thru guestmount didn't work out within a reasonable time as well.
Then I found the following lines in the host logs:
and was able to recover the virtual drive by cleaning the MMP bit within a single user boot mode:
Anyway, the failing backup should not affect the VM itself.
1. Is that a bug or I'm just missing something?
2. It would be great to add an option to check the storage space before starting the backup and alert by email if the space is not enough
I have a VM with two drives which is set to backup once a week by Proxmox internal backup tool. The backups are stored on an NFS share.
Last backup has failed due to space issues on the NFS server:
Code:
vzdump 122 127 --mailnotification failure --compress zstd --quiet 1 --storage trendy-images --mode snapshot --mailto root@domain
127: 2021-01-16 02:30:03 INFO: Starting Backup of VM 127 (qemu)
127: 2021-01-16 02:30:03 INFO: status = running
127: 2021-01-16 02:30:03 INFO: VM Name: vm-srv1
127: 2021-01-16 02:30:03 INFO: include disk 'scsi0' 'LVM-Storage:127/vm-127-disk-0.qcow2' 10G
127: 2021-01-16 02:30:03 INFO: include disk 'scsi1' 'LVM-Storage:127/vm-127-disk-2.qcow2' 178G
127: 2021-01-16 02:30:03 INFO: backup mode: snapshot
127: 2021-01-16 02:30:03 INFO: ionice priority: 7
127: 2021-01-16 02:30:03 INFO: creating vzdump archive '/mnt/pve/trendy-images/dump/vzdump-qemu-127-2021_01_16-02_30_03.vma.zst'
127: 2021-01-16 02:30:03 INFO: issuing guest-agent 'fs-freeze' command
127: 2021-01-16 02:30:04 INFO: issuing guest-agent 'fs-thaw' command
127: 2021-01-16 02:30:04 INFO: started backup task '3a81f20b-e387-40b9-8979-c85611a56dc7'
127: 2021-01-16 02:30:04 INFO: resuming VM again
127: 2021-01-16 02:30:07 INFO: 0% (224.2 MiB of 188.0 GiB) in 3s, read: 74.7 MiB/s, write: 62.7 MiB/s
...
127: 2021-01-16 04:45:32 ERROR: VM 127 qmp command 'query-backup' failed - got timeout
127: 2021-01-16 04:45:32 INFO: aborting backup job
127: 2021-01-16 04:55:32 ERROR: VM 127 qmp command 'backup-cancel' failed - unable to connect to VM 127 qmp socket - timeout after 5982 retries
127: 2021-01-16 04:57:42 ERROR: Backup of VM 127 failed - VM 127 qmp command 'query-backup' failed - got timeout
After that, the VM became unreachable. I tried to hard reboot it, but it stopped booting at mounting the 2nd virtual drive. Manual mounting the qemu image on the PM host thru guestmount didn't work out within a reasonable time as well.
Then I found the following lines in the host logs:
Code:
ext4_multi_mount_protect: MMP interval higher than expected
Code:
tune2fs -f -E clear_mmp /dev/sdb
Anyway, the failing backup should not affect the VM itself.
1. Is that a bug or I'm just missing something?
2. It would be great to add an option to check the storage space before starting the backup and alert by email if the space is not enough