Hi everybody, I've got a small pve cluster in my home lab, and I started toying around with Proxmox Backup Server. I want to do a setup where after the end of a backup job the PBS machine is shutdown and powered off, more or less as described here:
https://forum.proxmox.com/threads/a...er-backup-finishes-hook-script-option.105493/
This works fine (as far as I can tell) if I do it on one node (e.g. backing up several virtual machines on the first node pve1).
But when I backup several pve nodes at once, the slowest backup is not able to finish, seemingly because another machine
in the cluster decided to power down the PBS, because its own backup job was done.
Can that interpretation be right, or should I look for a problem elsewhere?
If this interpretation of my problem is correct: Is there a way to check in the hook script if other nodes in the cluster are still running their parts of the backup jobs? How does this acutally work? Is the backup job executed on each node as a seperate process, each calling their own local copy of the hook script?
A part of the log of the failing job:
Any hints on how to best do this shutting down the pbs after the job has finished on all cluster nodes? Is the calling of the hook scripts logged somewhere in a more verbose way?
Thanks in advance,
Ralph
https://forum.proxmox.com/threads/a...er-backup-finishes-hook-script-option.105493/
This works fine (as far as I can tell) if I do it on one node (e.g. backing up several virtual machines on the first node pve1).
But when I backup several pve nodes at once, the slowest backup is not able to finish, seemingly because another machine
in the cluster decided to power down the PBS, because its own backup job was done.
Can that interpretation be right, or should I look for a problem elsewhere?
If this interpretation of my problem is correct: Is there a way to check in the hook script if other nodes in the cluster are still running their parts of the backup jobs? How does this acutally work? Is the backup job executed on each node as a seperate process, each calling their own local copy of the hook script?
A part of the log of the failing job:
Code:
INFO: 59% (18.9 GiB of 32.0 GiB) in 7m 30s, read: 76.0 MiB/s, write: 69.0 MiB/s
INFO: 59% (19.0 GiB of 32.0 GiB) in 7m 32s, read: 50.0 MiB/s, write: 36.0 MiB/s
ERROR: backup write data failed: command error: connection reset
INFO: aborting backup job
INFO: resuming VM again
ERROR: Backup of VM 107 failed - backup write data failed: command error: connection reset
INFO: Failed at 2024-01-30 14:37:07
Any hints on how to best do this shutting down the pbs after the job has finished on all cluster nodes? Is the calling of the hook scripts logged somewhere in a more verbose way?
Thanks in advance,
Ralph