Excuse the bad language - but the situation calls for it.
So , We have a 3 Node Cluster with CEPH ( well had ). It was decided to implement a full-mesh setup for Ceph, seperating the storage traffic from the rest...All good no worries...until we had catastrophic host hdd failure , forcing us to re-install PVE and re-join cluster , also not much of an issue.
UNTIL.. Ceph installed on the new node...and froze up completely...forcing a reboot... which somehow annihilated the whole cluster...monitors dead, mgr dead, mds...yes also dead.
I have attempted purge and install with no luck.
ceph -s times out
create mon times out
PVE cluster service and corosync all fine
I have the OSD data , I do have a copy of ONE monitor store and keyrings.
Running contingency - essential service VM's restored from backup to "static" un-cephed drives - very much not ideal.
Please someone send up a flare in the darkness and help a noob-ish guy out to sort this nightmare out.
So , We have a 3 Node Cluster with CEPH ( well had ). It was decided to implement a full-mesh setup for Ceph, seperating the storage traffic from the rest...All good no worries...until we had catastrophic host hdd failure , forcing us to re-install PVE and re-join cluster , also not much of an issue.
UNTIL.. Ceph installed on the new node...and froze up completely...forcing a reboot... which somehow annihilated the whole cluster...monitors dead, mgr dead, mds...yes also dead.
I have attempted purge and install with no luck.
ceph -s times out
create mon times out
PVE cluster service and corosync all fine
I have the OSD data , I do have a copy of ONE monitor store and keyrings.
Running contingency - essential service VM's restored from backup to "static" un-cephed drives - very much not ideal.
Please someone send up a flare in the darkness and help a noob-ish guy out to sort this nightmare out.