Hello all.
I deployed a 3-node cluster w/ ceph, one node per server room, to be able to keep working when (not if) one of the rooms goes down.
Too bad it seems that Proxmox stops IO even when ceph cluster would still be working: the result is that a downed room blocks all the VMs in the cluster as soon as they try to write.
In an old thread @leesteken says "If you want Ceph with redundancy, you need more than three nodes." but that's never said in the docs. On the contrary the docs says that the minimum is 3 nodes. A difference with that thread is that my config uses the default of:
With a downed room, the other nodes still have quorum, so why do they stop writes? And how should the config be changed to avoid this issue?
Tks
I deployed a 3-node cluster w/ ceph, one node per server room, to be able to keep working when (not if) one of the rooms goes down.
Too bad it seems that Proxmox stops IO even when ceph cluster would still be working: the result is that a downed room blocks all the VMs in the cluster as soon as they try to write.
In an old thread @leesteken says "If you want Ceph with redundancy, you need more than three nodes." but that's never said in the docs. On the contrary the docs says that the minimum is 3 nodes. A difference with that thread is that my config uses the default of:
osd_pool_default_min_size = 2
osd_pool_default_size = 3
With a downed room, the other nodes still have quorum, so why do they stop writes? And how should the config be changed to avoid this issue?
Tks