We upgraded one of our nodes to the latest Proxmox version (pve-manager/6.2-4/9824574a (running kernel: 5.4.34-1-pve)) and now we encounter an issue during backup of one specific VM (Linux guest). The backup starts:
INFO: Starting Backup of VM 123 (qemu)
INFO: Backup started at 2020-05-19 03:01:41
INFO: status = running
INFO: VM Name: test-VM
INFO: include disk 'virtio0' 'local:123/vm-123-disk-1.raw' 220G
INFO: backup mode: snapshot
INFO: bandwidth limit: 250000 KB/s
INFO: ionice priority: 7
INFO: creating archive '/var/lib/vz/dump/vzdump-qemu-123-2020_05_19-03_01_41.vma.lzo'
INFO: started backup task 'bfa59d15-0af8-4726-b21d-6faa2e4dcb33'
INFO: resuming VM again
ERROR: VM 123 qmp command 'cont' failed - got timeout
INFO: aborting backup job
ERROR: Backup of VM 123 failed - VM 123 qmp command 'cont' failed - got timeout
And fails after a few seconds. The guest goes in read only mode and we have to reboot the vm and execute a filesystem check to fix it. This issue does not occur on any other vm's on this server. And there's enough space available at /var/lib/vz/dump.
INFO: Starting Backup of VM 123 (qemu)
INFO: Backup started at 2020-05-19 03:01:41
INFO: status = running
INFO: VM Name: test-VM
INFO: include disk 'virtio0' 'local:123/vm-123-disk-1.raw' 220G
INFO: backup mode: snapshot
INFO: bandwidth limit: 250000 KB/s
INFO: ionice priority: 7
INFO: creating archive '/var/lib/vz/dump/vzdump-qemu-123-2020_05_19-03_01_41.vma.lzo'
INFO: started backup task 'bfa59d15-0af8-4726-b21d-6faa2e4dcb33'
INFO: resuming VM again
ERROR: VM 123 qmp command 'cont' failed - got timeout
INFO: aborting backup job
ERROR: Backup of VM 123 failed - VM 123 qmp command 'cont' failed - got timeout
And fails after a few seconds. The guest goes in read only mode and we have to reboot the vm and execute a filesystem check to fix it. This issue does not occur on any other vm's on this server. And there's enough space available at /var/lib/vz/dump.