Were you talking about image #1 or image #2 when you said 'need to make sure used space can be no more than......'?
Thank you. I was referring to Image #2, which show available space. The size is showing total size of the entire pool. In your image example, the replicas are divided by 3, it should show 17.53TB as the available space (52.59 / 3)?
This is my opinion based on experience; I think using the following formula to decide on Replica number is good idea:
# of Node - 1 = Replica
So a 3 node CEPH cluster will have replica of 2. A 5 Node CEPH cluster will have replica of 4 and so on. Having 3 replicas 3 nodes will allow 2 simultaneous nodes failure and still keep functioning fine. Whereas with 2 replicas, 2 node failures may cause huge issue. What are the chances that 2 machines will fail at the same time. Dont be confuse when say with 3 replicas everything will work just fine even with 2 node failure, cluster will still face rebalancing thus making everything slower while cluster tries to recover. But i dont believe you will face any data loss due to stuck, unclean or stale PGs. Hope this makes sense.
Thank you much but I am confused and it doesn't make sense.
So let say I have 3 nodes of 10TB each. Total 30TB. 2 replicas will give me 15TB storage.
Let's add 4th node of of another 10TB, now the total is 40TB, 3 replicates will give me 13.33TB storage.
Let's add 5th node of of another 10TB, now the total is 50TB, 4 replicates will give me 12.5TB storage.
If I follow the rule of # of Node -1 = Replica, I don't gain any storage space and actually lose more storage space.
Another related question. Is there an advantage of having more than 3 replicas? Isn't 3 already a solid of no single point of failure?
Thank you very much for your help. I really appreciate it.
Last edited: