Hello Proxmox community,
I would like to ask for a technical clarification regarding the use of Ceph in a Proxmox VE architecture with only two storage nodes.
We are currently evaluating a virtualization infrastructure where the target architecture is based on:
My understanding is that Ceph generally requires at least three storage nodes with OSDs in order to provide proper high availability and data redundancy. With only two storage nodes, using a replication size of 2 may lead to problems when one node fails, because Ceph may not be able to continue write operations safely while maintaining the required number of replicas. On the other hand, reducing min_size to 1 may allow degraded operation, but it also means that data may temporarily exist with only one valid copy, which could increase the risk of data loss.
Therefore, I would appreciate clarification on the following points:
Thank you in advance for your clarification and recommendations.
Best regards,
I would like to ask for a technical clarification regarding the use of Ceph in a Proxmox VE architecture with only two storage nodes.
We are currently evaluating a virtualization infrastructure where the target architecture is based on:
- 2 Proxmox VE nodes used for virtualization and storage
- 1 additional witness/quorum node without storage OSDs
- Synchronous replication between the two storage nodes
- High availability for virtual machines
- Split-brain protection
- Continuity of service in case one storage node fails
My understanding is that Ceph generally requires at least three storage nodes with OSDs in order to provide proper high availability and data redundancy. With only two storage nodes, using a replication size of 2 may lead to problems when one node fails, because Ceph may not be able to continue write operations safely while maintaining the required number of replicas. On the other hand, reducing min_size to 1 may allow degraded operation, but it also means that data may temporarily exist with only one valid copy, which could increase the risk of data loss.
Therefore, I would appreciate clarification on the following points:
- Is Ceph recommended for production with only two storage nodes and one witness/quorum node without OSDs?
- Can a witness/quorum node without storage OSDs replace a third Ceph storage node from a data availability point of view?
- In case one of the two storage nodes fails, can Ceph continue to provide safe write operations while still maintaining proper data redundancy?
- Is the recommended Proxmox/Ceph architecture for production high availability based on at least three full storage nodes with OSDs?
- Would you consider a 2-node Ceph storage architecture suitable for a production virtualization platform requiring HA, or should another SDS/vSAN solution designed for 2-node deployments be considered?
Thank you in advance for your clarification and recommendations.
Best regards,