[SOLVED] Removing a busted node with ceph from cluster

King Tiger

New Member
Dec 27, 2022
6
0
1
Hi all,

I am running a 3 node HA ceph cluster.
PVE: 6.4-1
Ceph: 15.2.17
Each node has 4 OSD's
Monitor installed on all nodes
Manager installed on node 1 and 2.

Proxmox and (not suppose to Ceph 16.2.9) upgrade on node 3 sadly went bad. Fastest way to resolve is, i think to completely reinstall node 3?

Need some advice on how to do that safely without messing with the other 2 nodes/data?

1. Destroy OSD's
2. Destroy monitor
3. Remove it from the Proxmox cluster by using the pvecm delnode command
4. Cleanup ceph.conf on node 1 and 2
5. Removed entry from /etc/pve/nodes on node 1 and 2.
6. Reinstall Proxmox and Ceph
7. Rejoin Cluster
8. Blank the disks, and re-add them as OSDs to the Ceph cluster.

Everything is running on node 1 and 2 now.

VM & container backups are done daily. Same goes for complete \etc folder.

Thanks in advance!