Dear all,
I am about to increase the # of Placement groups on our 6 node cluster (soon to be expanded by a seventh node). Since I know the rebalancing is very demanding on the overall Cluster performance I would like to ask about your experiences regarding how long this procedure "usually" takes. Our setup is as follows:
Disclaimer:
So, tl;dr:
Thank you very much in advance and best regards,
Alex
I am about to increase the # of Placement groups on our 6 node cluster (soon to be expanded by a seventh node). Since I know the rebalancing is very demanding on the overall Cluster performance I would like to ask about your experiences regarding how long this procedure "usually" takes. Our setup is as follows:
- 6 nodes with 7 OSDs each (totallng 42 OSDs, a mix of Micron 7400 and Micron 7450 NVMEs)
- all NVMEs are of the same size (3,2 TB)
- Ceph is setup with a single pool for VM disks, Size/min is 3/2
- Usage of the pool is at 10% (12.7 TiB of 122 TiB)
- Currently 5 monitors and 6 managers
- All nodes are redundantly connected to a 25 GBit/s network / VLAN (dedicated network for Ceph only)
- no bottlenecks so far, iperf3 is sitting happily at ~ 22 GBit/s
- Hardware of the servers (ASUS) is 112 x AMD EPYC 7453 28-Core Processor (2 sockets) and 512 GB of ECC RAM
- Ceph is reef (18.4.2), PVE is 8.4.1, all packages coming from the enterprise repos.
Disclaimer:
- I guess since I forgot to specify a target ratio when I installed the cluster the autoscaler wouldn't do anything. We currently have 128 PGs (I know, not optimal), giving somewhere between 8 and 12 or so PGs per OSD (same, not optimal). The autoscaler is recommending 256 PGs as optimal (for now). I read that for around ~50 OSDs you should plan for 2048 PGs
- We don't expect "radical" storage growth in the future, i.e. this is really just VM storage (for operating system disks, no huge file storage etc.)
So, tl;dr:
- Increasing the PG count from 128 -> 256 and thinking to do this on a friday evening or some upcoming bank holiday: What is the estimated time for rebalancing? More like in the "a few hours" or in the "up to 2 days" spectrum?
- Is it recommended to do the increments in smaller steps (reaching a "power of 2" number of PGs) or is it better to do the leap at once, so let's say:
- 128 -> 256 -> 512 -> 1024 -> 2048 or
- 128 -> 2048
Thank you very much in advance and best regards,
Alex
Last edited: