Unable to start Ceph

maomaocake

Member
Feb 13, 2022
45
3
13
21
Hi, I just did a hard disk swap and all the osds on a node is not able to start with the service

`systemctl start ceph-osd@0`
the output of systemctl status ceph-osd@0 is

Code:
ceph-osd@0.service - Ceph object storage daemon osd.0
     Loaded: loaded (/lib/systemd/system/ceph-osd@.service; enabled-runtime; preset: enabled)
    Drop-In: /usr/lib/systemd/system/ceph-osd@.service.d
             └─ceph-after-pve-cluster.conf
     Active: failed (Result: exit-code) since Tue 2023-10-03 18:36:51 +07; 16min ago
   Duration: 1.136s
    Process: 8295 ExecStartPre=/usr/libexec/ceph/ceph-osd-prestart.sh --cluster ${CLUSTER} --id 0 (code=exited, status=0/SUCCESS)
    Process: 8331 ExecStart=/usr/bin/ceph-osd -f --cluster ${CLUSTER} --id 0 --setuser ceph --setgroup ceph (code=exited, status=1/FAILURE)
   Main PID: 8331 (code=exited, status=1/FAILURE)
        CPU: 87ms

Oct 03 18:41:40 pve1 systemd[1]: Failed to start ceph-osd@0.service - Ceph object storage daemon osd.0.
Oct 03 18:46:45 pve1 systemd[1]: ceph-osd@0.service: Start request repeated too quickly.
Oct 03 18:46:45 pve1 systemd[1]: ceph-osd@0.service: Failed with result 'exit-code'.
Oct 03 18:46:45 pve1 systemd[1]: Failed to start ceph-osd@0.service - Ceph object storage daemon osd.0.
Oct 03 18:49:22 pve1 systemd[1]: ceph-osd@0.service: Start request repeated too quickly.
Oct 03 18:49:22 pve1 systemd[1]: ceph-osd@0.service: Failed with result 'exit-code'.
Oct 03 18:49:22 pve1 systemd[1]: Failed to start ceph-osd@0.service - Ceph object storage daemon osd.0.
Oct 03 18:49:31 pve1 systemd[1]: ceph-osd@0.service: Start request repeated too quickly.
Oct 03 18:49:31 pve1 systemd[1]: ceph-osd@0.service: Failed with result 'exit-code'.
Oct 03 18:49:31 pve1 systemd[1]: Failed to start ceph-osd@0.service - Ceph object storage daemon osd.0.


However the command to start the OSD:
/usr/bin/ceph-osd -f --cluster ceph --id 0 --setuser ceph --setgroup ceph &
works fine

what am I missing
 
Hello,

You have too many restart attempts for the systemd service associated to such OSD, please run

Code:
systemctl reset-failed

If the OSD does not start working after that, you could try to restart the OSD service again. Should that fail, restart all Ceph services with

Code:
systemctl restart ceph.target
 
  • Like
Reactions: maomaocake

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!