Hello,
I have several Proxmox clusters of 3 dedicated servers each. Previously, I used GlusterFS to replicate the VM disks between the three servers in the cluster so that if one server went down, the VMs would restart on the two remaining servers without data loss. This method worked very well, but GlusterFS is no longer maintained, so I need to switch to Ceph to achieve equivalent functionality.
I have therefore installed a Proxmox cluster of 3 dedicated servers with a pair of disks in RAID 1 for the system and two additional 1TB disks for Ceph, for a total of 2TB per server and 6TB in total across the cluster. The cluster is working correctly. I therefore have 6TB in total for Ceph, theoretically 2TB of which is usable.
If one of the hypervisors encounters a problem, resulting in two OSDs being unavailable, Ceph will rebalancing the PGs present on those missing OSDs to the remaining OSDs. This will generate a lot of I/O and consume a lot of disk space unnecessarily. Therefore, I should theoretically use a maximum of 1.3 TB for Ceph to have enough space to rebalancing the missing PGs (i.e., a "Safe near-full ratio" of 0.67).
Given that:
- These three servers are strictly identical in terms of hardware.
- The number of servers on the cluster will always remain the same (3). No servers will be added or removed during the cluster's lifetime.
- The number of OSDs on the cluster will always remain the same (6). No OSDs will be added or removed during the cluster's lifetime.
- If one of the three servers fails, it will generally be repaired within an hour, or at worst, within 3 days.
- If the server is truly beyond repair, a new cluster will be created with three new servers, and the VMs will be migrated to it.
How can I disable the function that rebalances PGs to the remaining OSDs if one server fails, resulting in the loss of two OSDs?
I would prefer not to completely disable the "balancer" module, as it's important for rebalancing PGs when the cluster is functioning normally, if I understand correctly. I've seen that it's possible to set a specific date and time for the balancer to run, defining a maximum rebalancing percentage per execution. If the rebalancing amount is very limited, this could give me time to either repair the down server or create a new cluster.
Thank you for your help.
I have several Proxmox clusters of 3 dedicated servers each. Previously, I used GlusterFS to replicate the VM disks between the three servers in the cluster so that if one server went down, the VMs would restart on the two remaining servers without data loss. This method worked very well, but GlusterFS is no longer maintained, so I need to switch to Ceph to achieve equivalent functionality.
I have therefore installed a Proxmox cluster of 3 dedicated servers with a pair of disks in RAID 1 for the system and two additional 1TB disks for Ceph, for a total of 2TB per server and 6TB in total across the cluster. The cluster is working correctly. I therefore have 6TB in total for Ceph, theoretically 2TB of which is usable.
If one of the hypervisors encounters a problem, resulting in two OSDs being unavailable, Ceph will rebalancing the PGs present on those missing OSDs to the remaining OSDs. This will generate a lot of I/O and consume a lot of disk space unnecessarily. Therefore, I should theoretically use a maximum of 1.3 TB for Ceph to have enough space to rebalancing the missing PGs (i.e., a "Safe near-full ratio" of 0.67).
Given that:
- These three servers are strictly identical in terms of hardware.
- The number of servers on the cluster will always remain the same (3). No servers will be added or removed during the cluster's lifetime.
- The number of OSDs on the cluster will always remain the same (6). No OSDs will be added or removed during the cluster's lifetime.
- If one of the three servers fails, it will generally be repaired within an hour, or at worst, within 3 days.
- If the server is truly beyond repair, a new cluster will be created with three new servers, and the VMs will be migrated to it.
How can I disable the function that rebalances PGs to the remaining OSDs if one server fails, resulting in the loss of two OSDs?
I would prefer not to completely disable the "balancer" module, as it's important for rebalancing PGs when the cluster is functioning normally, if I understand correctly. I've seen that it's possible to set a specific date and time for the balancer to run, defining a maximum rebalancing percentage per execution. If the rebalancing amount is very limited, this could give me time to either repair the down server or create a new cluster.
Thank you for your help.