[SOLVED] Removing a busted node with ceph from cluster

King Tiger

New Member
Dec 27, 2022
5
0
1
Hi all,

I am running a 3 node HA ceph cluster.
PVE: 6.4-1
Ceph: 15.2.17
Each node has 4 OSD's
Monitor installed on all nodes
Manager installed on node 1 and 2.

Proxmox and (not suppose to Ceph 16.2.9) upgrade on node 3 sadly went bad. Fastest way to resolve is, i think to completely reinstall node 3?

Need some advice on how to do that safely without messing with the other 2 nodes/data?

1. Destroy OSD's
2. Destroy monitor
3. Remove it from the Proxmox cluster by using the pvecm delnode command
4. Cleanup ceph.conf on node 1 and 2
5. Removed entry from /etc/pve/nodes on node 1 and 2.
6. Reinstall Proxmox and Ceph
7. Rejoin Cluster
8. Blank the disks, and re-add them as OSDs to the Ceph cluster.

Everything is running on node 1 and 2 now.

VM & container backups are done daily. Same goes for complete \etc folder.

Thanks in advance!
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!