Hi,
We’ve got a cluster of 4 new servers, all running Ceph on NVMe and besides this issue seem to be running happily.
There is about 70 VMs in total, and we are utilising Proxmox Backup Server for backups primarily.
What we receive on one or two VMs during the backup run is;
INFO: creating Proxmox Backup Server archive 'vm/151/2024-11-20T13:21:24Z'
ERROR: QMP command query-proxmox-support failed - VM 151 qmp command 'query-proxmox-support' failed - unable to connect to VM 151 qmp socket - timeout after 51 retries
INFO: aborting backup job
ERROR: VM 151 qmp command 'backup-cancel' failed - unable to connect to VM 151 qmp socket - timeout after 5956 retries
We have, in the past, recreated the VM and reattached the disks manually and the problem has gone away for a small period of time, but then it may reoccur on the same VM or potentially another one.
With very little information to go on as to why its failing, its hard to troubleshoot but perhaps others may have encountered this same issue and resolved it or know how to get more information about what is going wrong under the hood so we can better target our troubleshooting.
It is worth noting that we also tried installing Veeam and using it to perform backups, and we’re also seeing a failure on the same VMs that PBS is having, which is why I thought it best to post in the Proxmox board than the PBS board, but happy to have it moved if deemed appropriate.
Thank you for any and all ideas.
We’ve got a cluster of 4 new servers, all running Ceph on NVMe and besides this issue seem to be running happily.
There is about 70 VMs in total, and we are utilising Proxmox Backup Server for backups primarily.
What we receive on one or two VMs during the backup run is;
INFO: creating Proxmox Backup Server archive 'vm/151/2024-11-20T13:21:24Z'
ERROR: QMP command query-proxmox-support failed - VM 151 qmp command 'query-proxmox-support' failed - unable to connect to VM 151 qmp socket - timeout after 51 retries
INFO: aborting backup job
ERROR: VM 151 qmp command 'backup-cancel' failed - unable to connect to VM 151 qmp socket - timeout after 5956 retries
We have, in the past, recreated the VM and reattached the disks manually and the problem has gone away for a small period of time, but then it may reoccur on the same VM or potentially another one.
With very little information to go on as to why its failing, its hard to troubleshoot but perhaps others may have encountered this same issue and resolved it or know how to get more information about what is going wrong under the hood so we can better target our troubleshooting.
It is worth noting that we also tried installing Veeam and using it to perform backups, and we’re also seeing a failure on the same VMs that PBS is having, which is why I thought it best to post in the Proxmox board than the PBS board, but happy to have it moved if deemed appropriate.
Thank you for any and all ideas.