I have a 12 node ceph cluster running on Proxmox 6 where each node is having 4 X 1.92TB SSD configured in the OSD Pool. The nodes are deployed in Blade chassis with each chassis having 4 Node each. Thus there are 3 chassis each having 4 Nodes
I have the following observation
whenever all nodes are up by doing rados bench, I am getting around 1000MB/s performance and no write issues
Now I have given a power off to one of the chassis so now I am having 8 nodes running, in this scenario
rados bench initially showed 1000MB/s and then subsequently it is reduced to 0 MB/s and no write is happening
This behaviour is erratic as all the VM running now are not able to write and goes to hang state
What could be the reason
I have the following observation
whenever all nodes are up by doing rados bench, I am getting around 1000MB/s performance and no write issues
Now I have given a power off to one of the chassis so now I am having 8 nodes running, in this scenario
rados bench initially showed 1000MB/s and then subsequently it is reduced to 0 MB/s and no write is happening
This behaviour is erratic as all the VM running now are not able to write and goes to hang state
What could be the reason