Unable to start Ceph

maomaocake

Member
Feb 13, 2022
47
5
13
22
Hi, I just did a hard disk swap and all the osds on a node is not able to start with the service

`systemctl start ceph-osd@0`
the output of systemctl status ceph-osd@0 is

Code:
ceph-osd@0.service - Ceph object storage daemon osd.0
     Loaded: loaded (/lib/systemd/system/ceph-osd@.service; enabled-runtime; preset: enabled)
    Drop-In: /usr/lib/systemd/system/ceph-osd@.service.d
             └─ceph-after-pve-cluster.conf
     Active: failed (Result: exit-code) since Tue 2023-10-03 18:36:51 +07; 16min ago
   Duration: 1.136s
    Process: 8295 ExecStartPre=/usr/libexec/ceph/ceph-osd-prestart.sh --cluster ${CLUSTER} --id 0 (code=exited, status=0/SUCCESS)
    Process: 8331 ExecStart=/usr/bin/ceph-osd -f --cluster ${CLUSTER} --id 0 --setuser ceph --setgroup ceph (code=exited, status=1/FAILURE)
   Main PID: 8331 (code=exited, status=1/FAILURE)
        CPU: 87ms

Oct 03 18:41:40 pve1 systemd[1]: Failed to start ceph-osd@0.service - Ceph object storage daemon osd.0.
Oct 03 18:46:45 pve1 systemd[1]: ceph-osd@0.service: Start request repeated too quickly.
Oct 03 18:46:45 pve1 systemd[1]: ceph-osd@0.service: Failed with result 'exit-code'.
Oct 03 18:46:45 pve1 systemd[1]: Failed to start ceph-osd@0.service - Ceph object storage daemon osd.0.
Oct 03 18:49:22 pve1 systemd[1]: ceph-osd@0.service: Start request repeated too quickly.
Oct 03 18:49:22 pve1 systemd[1]: ceph-osd@0.service: Failed with result 'exit-code'.
Oct 03 18:49:22 pve1 systemd[1]: Failed to start ceph-osd@0.service - Ceph object storage daemon osd.0.
Oct 03 18:49:31 pve1 systemd[1]: ceph-osd@0.service: Start request repeated too quickly.
Oct 03 18:49:31 pve1 systemd[1]: ceph-osd@0.service: Failed with result 'exit-code'.
Oct 03 18:49:31 pve1 systemd[1]: Failed to start ceph-osd@0.service - Ceph object storage daemon osd.0.


However the command to start the OSD:
/usr/bin/ceph-osd -f --cluster ceph --id 0 --setuser ceph --setgroup ceph &
works fine

what am I missing
 
Hello,

You have too many restart attempts for the systemd service associated to such OSD, please run

Code:
systemctl reset-failed

If the OSD does not start working after that, you could try to restart the OSD service again. Should that fail, restart all Ceph services with

Code:
systemctl restart ceph.target
 
  • Like
Reactions: maomaocake