Hello.
I run a cluster with 7 nodes - at the moment 60 VMs and a few lxc's. Since we are moving from vmware, the number of vm's will grow. For now, the goal is about 200-250 vm's, but it will grow. I have 6 more hosts potentially joining.
I suffer from timeout from the PBS, occasionally (random vm:s all the time):
I wonder if it's possible and/or a smart thing to split backup jobs, so node1 backs up at 21.00, node 2 @ 22.00 and so on? Possibly it hurts dirty-bitmap and dedup if vm:s are migrated from one host to another in that case? Any other drawbacks?
Is it possible to exclude vm:s if I go this route? VM:s will not be on the same host at all times, live migrations will take place. F.ex if I exclude VM106 on node 1, then i guess it won't be excluded if migrated to node 2...
Or should I just investigate why it's timing out...
The PBS is a 24 x Intel(R) Xeon(R) CPU E5-2640 0 @ 2.50GHz (2 Sockets) with 280GB RAM. The storage used for backup is a NFS mounted synology NAS.
PBS version: 3.1-2
Best regards
--
Markus
I run a cluster with 7 nodes - at the moment 60 VMs and a few lxc's. Since we are moving from vmware, the number of vm's will grow. For now, the goal is about 200-250 vm's, but it will grow. I have 6 more hosts potentially joining.
I suffer from timeout from the PBS, occasionally (random vm:s all the time):
INFO: Starting Backup of VM 159 (qemu)
INFO: Backup started at 2024-04-03 21:13:50
INFO: status = running
INFO: VM Name: blabla
INFO: include disk 'scsi0' 'storage:159/vm-159-disk-0.qcow2' 100G
INFO: backup mode: snapshot
INFO: ionice priority: 7
INFO: creating Proxmox Backup Server archive 'vm/159/2024-04-03T19:13:50Z'
ERROR: VM 159 qmp command 'backup' failed - got timeout
INFO: aborting backup job
INFO: resuming VM again
ERROR: Backup of VM 159 failed - VM 159 qmp command 'backup' failed - got timeout
INFO: Failed at 2024-04-03 21:17:30
I wonder if it's possible and/or a smart thing to split backup jobs, so node1 backs up at 21.00, node 2 @ 22.00 and so on? Possibly it hurts dirty-bitmap and dedup if vm:s are migrated from one host to another in that case? Any other drawbacks?
Is it possible to exclude vm:s if I go this route? VM:s will not be on the same host at all times, live migrations will take place. F.ex if I exclude VM106 on node 1, then i guess it won't be excluded if migrated to node 2...
Or should I just investigate why it's timing out...
The PBS is a 24 x Intel(R) Xeon(R) CPU E5-2640 0 @ 2.50GHz (2 Sockets) with 280GB RAM. The storage used for backup is a NFS mounted synology NAS.
PBS version: 3.1-2
Best regards
--
Markus