Sync sometimes fails - where to start troubleshooting?

chriss

Member
Dec 5, 2020
10
0
21
43
My primary PVE host runs 24/7.
"PBS-local" is running on that host as an LXC.
Backups are run every 20 minutes for all containers (except the PBS container) and a single VM.
Hardware: Intel Core i3-N305 , 16GB RAM, storage is flash only

My secondary PVE host is only booted up once per day.
"PBS" is VM on that host, there are no other vms or containers.
Host boots up at 8 am.
PBS syncs, prunes, gcs and verify then.
Hardware: HPE ProLiant MicroServer Gen10 Plus, Intel Xeon E-2224 , 16GB, storage is on large spinning disks.

All on a single local network, 1Gbit.

Sometimes (sometimes multiple days in a row) it cannot sync backups of a single container: 103 which is PostgreSQL.
Error msg:
sync group ct/103 failed - error trying to connect: error connecting to https://pbs-local.domain.tld:8007/ - tcp connect error: Connection refused (os error 111): error connecting to https://pbs-local.domain.tld:8007/ - tcp connect error: Connection refused (os error 111)


For sure I would be glad if someone could tell me the root cause, however I do not expect that.
I don't know where to start. Can someone point me in the right direction?
 
Last edited:
Is the sync job a push or pull sync job? If it is a push from the source to the target, it might be that the target server is not fully booted up just yet. Might be better to setup a pull in that case, so the proxmox backup services are guaranteed to be running.
 
  • Like
Reactions: chriss
Is the sync job a push or pull sync job? If it is a push from the source to the target, it might be that the target server is not fully booted up just yet. Might be better to setup a pull in that case, so the proxmox backup services are guaranteed to be running.
It's pull sync.
"PBS" on the secondary host pulls from "PBS-local".
 
Is the PBS host reachable during the whole time, e.g. via ping from source to target during sync. Do you maybe have duplicate IPs, the connection being therefore established to the wrong host?