Hi,
On my 3-node cluster I set up ceph using a custom device class (sas900 to identify my sas 900GB devices and put them all in one single pool), waiting for new pools to be created when new devices with different classes will be added to the nodes. I created a custom crush rule (replicated_sas900), associated the pool to the rule and renamed the pool. Everything went smoothly.
I will create new dedicated crush rules as new device type/dimension will arrive to the nodes.
The device_health_metrics pool is still on the default replicated_rule crush rule.
Now, I am trying to figure out some behiviours:
1) if I set the sas900 pool with autoscaler "on", it seems to never finish to recalculate the pgs, creating a high load on the storage
2) I cannot remove the default replicated_rule crush rule, being used by the device_health_metrics pool. I'd like to only have dedicated crush rules.
So: is it normal for the autoscaler continuing to work without end? Will it find a stable pg number? And, can I (should I) change the crush rule for the device_health_metrics pool? To wich one of the three or four dedicated crush rules?
On my 3-node cluster I set up ceph using a custom device class (sas900 to identify my sas 900GB devices and put them all in one single pool), waiting for new pools to be created when new devices with different classes will be added to the nodes. I created a custom crush rule (replicated_sas900), associated the pool to the rule and renamed the pool. Everything went smoothly.
I will create new dedicated crush rules as new device type/dimension will arrive to the nodes.
The device_health_metrics pool is still on the default replicated_rule crush rule.
Now, I am trying to figure out some behiviours:
1) if I set the sas900 pool with autoscaler "on", it seems to never finish to recalculate the pgs, creating a high load on the storage
2) I cannot remove the default replicated_rule crush rule, being used by the device_health_metrics pool. I'd like to only have dedicated crush rules.
So: is it normal for the autoscaler continuing to work without end? Will it find a stable pg number? And, can I (should I) change the crush rule for the device_health_metrics pool? To wich one of the three or four dedicated crush rules?