Hello,
I have a 5 node PVE cluster with Ceph (2 OSD every node on Intel DC SSD on dedicated 10G network).
I am actually using it for LXC containers and my idea is to always keep one node empty to move containers in case of node failure.
I will need to scale up in the near future probably to a total of 21 servers.
What are the best practices and bottlenecks to consider while scaling?
Will my 10GB network become the bottleneck adding nodes ? ( I have dual 10GB adapter so I could use bonding to get 20GB)
I could create separate smaller clusters (3 cluster of 7 nodes) but I will need more hot standby servers.
Any suggestion is appreciated.
Thank you.
P.
I have a 5 node PVE cluster with Ceph (2 OSD every node on Intel DC SSD on dedicated 10G network).
I am actually using it for LXC containers and my idea is to always keep one node empty to move containers in case of node failure.
I will need to scale up in the near future probably to a total of 21 servers.
What are the best practices and bottlenecks to consider while scaling?
Will my 10GB network become the bottleneck adding nodes ? ( I have dual 10GB adapter so I could use bonding to get 20GB)
I could create separate smaller clusters (3 cluster of 7 nodes) but I will need more hot standby servers.
Any suggestion is appreciated.
Thank you.
P.