Does my concept make sense?

Hunduster

New Member
May 13, 2023
24
5
3
Düsseldorf, Germany
Hello everyone,
I've recently switched to PVE and PBS and have been thinking about a backup concept that I'm not sure will work the way I think it will :D

I have a NAS with spinners and SSDs. I have integrated two datastores on the PBS:
  • Datastore 1 on SSD
  • Datastore 2 on HDD

I have a backup job in PVE that performs a backup to Datastore 1 every hour. The data sets are verified directly after creation on Datastore 1.
Under Datastore 2 there is a sync job that syncs all backups from Datastore 1 to Datastore 2 at 00:00 at night.
At 01:00 the Prune and GC jobs then run on both datastores.

The Prune jobs are set as follows:

Datastore 1

1730625798651.png

Datastore 2

1730625815815.png

At 02:00 an offsite PBS then goes and pulls the data from Datastore 2.

My thought or goal behind the concept is that the hourly backups run on the more power-efficient SSDs so that the spinners in the NAS can go into sleep mode.
Once a day, the daily, weekly and monthly backups should then be copied to the spinners for “long-term archiving”.

I am now wondering whether this makes sense and whether PBS understands the connection between the two datastores and whether the backup chains are consistent or whether I am fooling myself here, especially with regard to consistency and deduplication.

I currently had 3 warnings last night under the GC job of Datastore 1: WARN: warning: unable to access non-existent chunk.
This makes me worry that my concept is not as well thought out as I initially thought.
 
  • Like
Reactions: Johannes S
For me this setup looks ok. But I would expect bad Performance for the remote sync of the offsite pbs, verify and Garbage collection on the hdd datastore. Did you run some benchmarks?
What size do you have on your ssds and hdds? It might be worth to add a mirrored ssd Partition as a special device for Metadata of the hdd data.
I would also setup the remote pbs to sync from the ssds for better performance
 
Last edited:
For me this setup looks ok. But I would expect bad Performance for the remote sync of the offsite pbs, verify and Garbage collection on the hdd datastore. Did you run some benchmarks?
What size do you have on your ssds and hdds? It might be worth to add a mirrored ssd Partition as a special device for Metadata of the hdd data.
I would also setup the remote pbs to sync from the ssds for better performance
I only have 50 Mbit/s upload at my home, the remote sync to Offiste is not faster.
The SSDs run in RAID 5 with 1.5 TB. My HDDs in RAID 6 with ~20 TB
 
  • Like
Reactions: Johannes S
I only have 50 Mbit/s upload at my home, the remote sync to Offiste is not faster.
The SSDs run in RAID 5 with 1.5 TB. My HDDs in RAID 6 with ~20 TB

Ok then you are propably right that there is not much (if at all) to gain from a sync from the HDDs. I'm still curious whether garbage collection and verify jobs on the HDD datastore have sufficient performance.

Concerning the warning about the missing chunks: As far I know missing chunks should be resynced with the next backup or sync job. Did you encounter this warning again?
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!