S3 Storage via CEPH RGW & cache datastore?

Funar

Member
Oct 8, 2021
23
16
23
I'm curious about the cache disk for S3 storage. We're experimenting with S3 storage in PBS utilizing CEPH's RGW. CEPH being local to the PBS sever.

Is there a way to calculate how much storage the datastore used for cache needs to be?

We don't have any concerns about API calls to the CEPH server or storage requirements as it's also on our network. However, the cache datastore is growing alongside the S3 storage bucket size. So, we're trying to plan for the future as this function matures.
 
Is there a way to calculate how much storage the datastore used for cache needs to be?
No, the cache is greedy and will use as much space as it will get. It needs to at least fit the various metadata files for the PBS to operate normally, so you might get away with only using a few GiB of storage.
 
So, as a real world example, I have 233 snapshots with 14 VMs which comes to 2.1tb (34% dedup) in the S3 bucket that PBS uses. My cache disk is occupying 680gb of 2tb. This is only a small fraction of what we're currently utilizing PBS for. My goal is to come up with a forecast for full production use. Having S3 capability is huge for us.