OSD Full and CEPH crashed completely - please help

berkaybulut

New Member
Feb 8, 2023
7
0
1
I just received a few warnings that CEPH's OSD disks were full. When I started the process to install a new OSD, I started getting a got timeout (500) error. When I check the logs, I get the following errors. Currently Ceph is completely closed and I cannot open it.

Code:
root@*:~# journalctl -b -u "ceph-mon@*.service"
Feb 02 02:31:46 cmt6770 systemd[1]: Started ceph-mon@*.service - Ceph cluster monitor daemon.
Feb 02 02:31:46 cmt6770 systemd[1]: Started ceph-mon@*.service - Ceph cluster monitor daemon.
Feb 02 02:31:46 cmt6770 systemd[1]: ceph-mon@*.service: Main process exited, code=exited, status=28/n/a
Feb 02 02:31:46 cmt6770 systemd[1]: ceph-mon@*.service: Failed with result 'exit-code'.
Feb 02 02:31:46 cmt6770 systemd[1]: ceph-mon@*.service: Main process exited, code=exited, status=1/FAILURE
Feb 02 02:31:46 cmt6770 systemd[1]: ceph-mon@*.service: Failed with result 'exit-code'.
Feb 02 02:31:56 cmt6770 systemd[1]: ceph-mon@*.service: Scheduled restart job, restart counter is at 1.
Feb 02 02:31:56 cmt6770 systemd[1]: ceph-mon@*.service: Scheduled restart job, restart counter is at 1.
Feb 02 02:31:56 cmt6770 systemd[1]: Stopped ceph-mon@*.service - Ceph cluster monitor daemon.
Feb 02 02:31:56 cmt6770 systemd[1]: Started ceph-mon@*.service - Ceph cluster monitor daemon.
Feb 02 02:31:56 cmt6770 systemd[1]: Stopped ceph-mon@*.service - Ceph cluster monitor daemon.
Feb 02 02:31:56 cmt6770 systemd[1]: Started ceph-mon@*.service - Ceph cluster monitor daemon.
Feb 02 02:31:56 cmt6770 systemd[1]: ceph-mon@*.service: Main process exited, code=exited, status=28/n/a
Feb 02 02:31:56 cmt6770 systemd[1]: ceph-mon@*.service: Failed with result 'exit-code'.
Feb 02 02:31:56 cmt6770 systemd[1]: ceph-mon@*.service: Main process exited, code=exited, status=1/FAILURE
Feb 02 02:31:56 cmt6770 systemd[1]: ceph-mon@*.service: Failed with result 'exit-code'.
Feb 02 02:32:06 cmt6770 systemd[1]: ceph-mon@*.service: Scheduled restart job, restart counter is at 2.
Feb 02 02:32:06 cmt6770 systemd[1]: ceph-mon@*.service: Scheduled restart job, restart counter is at 2.
Feb 02 02:32:06 cmt6770 systemd[1]: Stopped ceph-mon@*.service - Ceph cluster monitor daemon.
Feb 02 02:32:06 cmt6770 systemd[1]: Started ceph-mon@*.service - Ceph cluster monitor daemon.
Feb 02 02:32:06 cmt6770 systemd[1]: Stopped ceph-mon@*.service - Ceph cluster monitor daemon.

Code:
root@*:~# ceph -s
got timeout
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!