Ceph cluster completely lost

mgiammarco

Well-Known Member
Feb 18, 2010
161
7
58
Hello,
I have an 8 hosts proxmox+ceph cluster.
Yesterday one hdd broke. While reconstructing proxmox ha decided to hard reboot some servers.

Now ceph has some incomplete pgs and refuses to do i/o on all pool.
So I cannot start and/or recover any vm.

What can I do?

I am in contact with ceph guys but they told me to ask also here.

Thanks,
Mario
 
without very detailed information it would be hard to help.

pls post more details about your problem.
 
Hello,
I have an 8 hosts proxmox+ceph cluster.
Yesterday one hdd broke. While reconstructing proxmox ha decided to hard reboot some servers.

Now ceph has some incomplete pgs and refuses to do i/o on all pool.
So I cannot start and/or recover any vm.

What can I do?

I am in contact with ceph guys but they told me to ask also here.

Thanks,
Mario
Hi,
like Tom wrote - not easy to say without more infos...

Look, which OSDs are down with
Code:
ceph osd tree
and try to get as much as possible to bring up (depends on your replica/crush one host can fail).

Further info with
Code:
ceph health detail
ceph -s
Udo
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!