Unable to backup VM to Proxmox backup server.

tuxis

Famous Member
Jan 3, 2014
251
238
108
Ede, NL
www.tuxis.nl
When we try to backup a VM we get the following error:

364: 2023-08-31 01:34:08 INFO: Starting Backup of VM 364 (qemu)
364: 2023-08-31 01:34:08 INFO: status = running
364: 2023-08-31 01:34:08 INFO: VM Name: CORP-DC
364: 2023-08-31 01:34:08 INFO: include disk 'scsi0' 'Ceph:vm-364-disk-1' 800G
364: 2023-08-31 01:34:08 INFO: include disk 'scsi1' 'Ceph:vm-364-disk-3' 300G
364: 2023-08-31 01:34:08 INFO: include disk 'scsi2' 'Ceph:vm-364-disk-4' 200G
364: 2023-08-31 01:34:08 INFO: include disk 'scsi3' 'Ceph:vm-364-disk-5' 500G
364: 2023-08-31 01:34:08 INFO: include disk 'scsi4' 'Ceph:vm-364-disk-6' 300G
364: 2023-08-31 01:34:08 INFO: include disk 'scsi5' 'Ceph:vm-364-disk-7' 250G
364: 2023-08-31 01:34:08 INFO: include disk 'scsi6' 'Ceph:vm-364-disk-8' 350G
364: 2023-08-31 01:34:08 INFO: include disk 'efidisk0' 'Ceph:vm-364-disk-0' 1M
364: 2023-08-31 01:34:08 INFO: include disk 'tpmstate0' 'Ceph:vm-364-disk-2' 4M
364: 2023-08-31 01:34:08 INFO: backup mode: snapshot
364: 2023-08-31 01:34:08 INFO: ionice priority: 7
364: 2023-08-31 01:34:08 INFO: creating Proxmox Backup Server archive 'vm/364/2023-08-30T23:34:08Z'
364: 2023-08-31 01:34:08 INFO: attaching TPM drive to QEMU for backup
364: 2023-08-31 01:34:13 ERROR: VM 364 qmp command 'human-monitor-command' failed - got timeout
364: 2023-08-31 01:34:13 INFO: aborting backup job
364: 2023-08-31 01:44:14 ERROR: VM 364 qmp command 'backup-cancel' failed - got timeout
364: 2023-08-31 01:44:14 INFO: resuming VM again
364: 2023-08-31 01:45:04 ERROR: Backup of VM 364 failed - VM 364 qmp command 'cont' failed - unable to connect to VM 364 qmp socket - timeout after 450 retries

I noticed this: 364: 2023-08-31 01:34:13 ERROR: VM 364 qmp command 'human-monitor-command' failed - got timeout
I tried to build the exact same VPS on a different cluster. The backup works fine here.

Does anyone know what's causing this?
 
I think it could be useful to compare both hosts pveversion -v, list VM config qm config 364 on both and finally the command line used by qemu to start the VM. Maybe there's some differences on both hosts.

There are a few threads in the forum about "timeouts" at backup time, some were related to already solved bugs.