So i dont know what exactly happened but today i saw that one of the monitors on one of my notes was stopped.
I tryed to start it but nothing.
I rebooted the node, and it still did not start.
The problem was that the ceph-mon@xxx.service was running, just that the cluster was saying it was down.
So pveceph destroy did not work (it sayd it did not find the id).
So i ended up doing this:
- disable the ceph-mon service
- manually remove the ip of the mon host in /etc/pve/ceph.conf along with [mon.xxxx] entry (it then synced to the other nodes)
- run: rm -rf /var/lib/ceph/mon/ceph-xxxxx
- rebooted the node
Then if i try to create the mon again i can see that the service is created, the ceph.conf is changed, and the ceph-mon@xxx.service is running, but it does not show up in ceph-s
Did anyone had this kind of problems ?
What can i do\check ?
I tryed to start it but nothing.
I rebooted the node, and it still did not start.
The problem was that the ceph-mon@xxx.service was running, just that the cluster was saying it was down.
So pveceph destroy did not work (it sayd it did not find the id).
So i ended up doing this:
- disable the ceph-mon service
- manually remove the ip of the mon host in /etc/pve/ceph.conf along with [mon.xxxx] entry (it then synced to the other nodes)
- run: rm -rf /var/lib/ceph/mon/ceph-xxxxx
- rebooted the node
Then if i try to create the mon again i can see that the service is created, the ceph.conf is changed, and the ceph-mon@xxx.service is running, but it does not show up in ceph-s
Did anyone had this kind of problems ?
What can i do\check ?