Unable to start Ceph

maomaocake

Member
Feb 13, 2022
47
5
13
22
Hi, I just did a hard disk swap and all the osds on a node is not able to start with the service

`systemctl start ceph-osd@0`
the output of systemctl status ceph-osd@0 is

Code:
ceph-osd@0.service - Ceph object storage daemon osd.0
     Loaded: loaded (/lib/systemd/system/ceph-osd@.service; enabled-runtime; preset: enabled)
    Drop-In: /usr/lib/systemd/system/ceph-osd@.service.d
             └─ceph-after-pve-cluster.conf
     Active: failed (Result: exit-code) since Tue 2023-10-03 18:36:51 +07; 16min ago
   Duration: 1.136s
    Process: 8295 ExecStartPre=/usr/libexec/ceph/ceph-osd-prestart.sh --cluster ${CLUSTER} --id 0 (code=exited, status=0/SUCCESS)
    Process: 8331 ExecStart=/usr/bin/ceph-osd -f --cluster ${CLUSTER} --id 0 --setuser ceph --setgroup ceph (code=exited, status=1/FAILURE)
   Main PID: 8331 (code=exited, status=1/FAILURE)
        CPU: 87ms

Oct 03 18:41:40 pve1 systemd[1]: Failed to start ceph-osd@0.service - Ceph object storage daemon osd.0.
Oct 03 18:46:45 pve1 systemd[1]: ceph-osd@0.service: Start request repeated too quickly.
Oct 03 18:46:45 pve1 systemd[1]: ceph-osd@0.service: Failed with result 'exit-code'.
Oct 03 18:46:45 pve1 systemd[1]: Failed to start ceph-osd@0.service - Ceph object storage daemon osd.0.
Oct 03 18:49:22 pve1 systemd[1]: ceph-osd@0.service: Start request repeated too quickly.
Oct 03 18:49:22 pve1 systemd[1]: ceph-osd@0.service: Failed with result 'exit-code'.
Oct 03 18:49:22 pve1 systemd[1]: Failed to start ceph-osd@0.service - Ceph object storage daemon osd.0.
Oct 03 18:49:31 pve1 systemd[1]: ceph-osd@0.service: Start request repeated too quickly.
Oct 03 18:49:31 pve1 systemd[1]: ceph-osd@0.service: Failed with result 'exit-code'.
Oct 03 18:49:31 pve1 systemd[1]: Failed to start ceph-osd@0.service - Ceph object storage daemon osd.0.


However the command to start the OSD:
/usr/bin/ceph-osd -f --cluster ceph --id 0 --setuser ceph --setgroup ceph &
works fine

what am I missing
 
Hello,

You have too many restart attempts for the systemd service associated to such OSD, please run

Code:
systemctl reset-failed

If the OSD does not start working after that, you could try to restart the OSD service again. Should that fail, restart all Ceph services one by one by, e.g.

Code:
systemctl restart ceph-osd@0.service

for restarting OSD 0 or

Code:
systemctl restart ceph-mon@$(hostname).service

to restart the MON service. Please check Ceph's health before the next command via `ceph -s`. You can get a full list of Ceph services running on the current host via:

Code:
systemctl list-units --type=service |grep ceph
 
Last edited:
  • Like
Reactions: maomaocake