Hi everyone,
I’m facing a critical issue with my Proxmox Backup Server (PBS) setup, and I haven’t found a clear solution so far.
Setup Details:
PBS Garbage Collection (GC) is aggressively deleting chunks that are still referenced by existing backups. I usually discover this only when trying to restore a backup, and it fails due to missing chunks. This has made 100% of backups unreliable – I cannot trust any of them for restores.
Additional issues:
I’m facing a critical issue with my Proxmox Backup Server (PBS) setup, and I haven’t found a clear solution so far.
Setup Details:
- PBS version: 3.4.1
- Storage: Single ZFS pool with 35 HDDs (104TB usable space)
- Backups: 500+ backups from over 500 VMs across a 70+ node PVE cluster
- Backup Frequency: Daily (mandatory)
- Pruning: Keep last 1, weekly 1, and monthly 1 (total 3 per VM)
PBS Garbage Collection (GC) is aggressively deleting chunks that are still referenced by existing backups. I usually discover this only when trying to restore a backup, and it fails due to missing chunks. This has made 100% of backups unreliable – I cannot trust any of them for restores.
Additional issues:
- GC takes 5+ days to complete.
- Verification jobs take 2+ weeks, so they’re practically useless in a daily backup setup.
- Disk health and ZFS pool status are healthy.
- I can’t turn off GC because the server runs out of space within a month.
- Is this a known limitation of PBS with large-scale environments?
- How can I ensure that GC does not delete valid chunks?
- Is there a reliable strategy to make backups restorable in this kind of high-load environment?
Last edited: