Ceph unavailable from single node

chell

New Member
Dec 6, 2023
11
0
1
Hi All

I have a 3 node cluster with ceph storage. I want to configure the ceph cluster to work with only one node. (in the even of a double failure)
Currently the ceph continues working correctly with the failure of one node, but as soon as two nodes are down, ceph becomes unavailable and you cant even open the ceph configuration pages.
All of the pools are configures for 3 x replication with a minimum of 1 and host failure domain.
Thanks
 
By default, a Ceph cluster requires a majority of nodes to maintain quorum!


All of the pools are configures for 3 x replication with a minimum of 1 and host failure domain.
Note, with min_size=1, there is a risk of data loss if only one replica is available during a write operation.
 
  • Like
Reactions: Johannes S
By default, a Ceph cluster requires a majority of nodes to maintain quorum!



Note, with min_size=1, there is a risk of data loss if only one replica is available during a write operation.

The normal operation will be three nodes. Only one node will be taken offline at a time for maintenance. Running on a single node will only occur if a fault occurs when another node is already down for another fault or maintenance.
This site is rural. Some repairs can take weeks, so a second failure while waiting for the first failure to be fixed is a real possibility.
 
Ceph needs a MON majority for quorum; losing this halts operations.... An option is to add a lightweight `tiebreaker` node to the cluster, which doesn't need to store data but can maintain quorum. Plus consider using the `noout` flag during maintenance to avoid triggering recovery processes.
 
Last edited:
  • Like
Reactions: Johannes S
Ceph needs a MON majority for quorum; losing this halts operations.... An option is to add a lightweight `tiebreaker` node to the cluster, which doesn't need to store data but can maintain quorum. Plus consider using the `noout` flag during maintenance to avoid triggering recovery processes.
Is this the reason why ceph become unavailable to configure when there is only one node?
Adding another server is not possible, however I could create a small vm on a couple of the workstations just for ceph mon's.
 
Is this the reason why ceph become unavailable to configure when there is only one node?
Yes.

Adding another server is not possible, however I could create a small vm on a couple of the workstations just for ceph mon's.
This won't help if the cluster itself loses quorum.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!