[SOLVED] ceph upgraded then: Failed to restart ceph.service: Unit ceph.service not found.

RobFantini · Dec 19, 2019

I just did this

Code:

# apt dist-upgrade
Reading package lists... Done
Building dependency tree     
Reading state information... Done
Calculating upgrade... Done
The following packages will be upgraded:
  ceph ceph-base ceph-common ceph-fuse ceph-mds ceph-mgr ceph-mon ceph-osd libcephfs2 librados2 libradosstriper1 librbd1 librgw2 python-ceph-argparse
  python-cephfs python-rados python-rbd python-rgw
18 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.
Need to get 0 B/54.1 MB of archives.
After this operation, 2,172 kB disk space will be freed.
Do you want to continue? [Y/n]

that seemed to work

but then not a minute later , all 3 osds on that node are down . i set noout.

Code:

# ceph -s
  cluster:
    id:     220b9a53-4556-48e3-a73c-28deff665e45
    health: HEALTH_WARN
            noout flag(s) set

  services:
    mon: 3 daemons, quorum pve3,pve10,pve15 (age 44h)
    mgr: pve15(active, since 12d), standbys: pve3, pve10
    mds: cephfs:1 {0=pve2=up:active} 2 up:standby
    osd: 21 osds: 18 up (since 7d), 17 in (since 7d)
         flags noout

  data:
    pools:   3 pools, 288 pgs
    objects: 1.71M objects, 6.3 TiB
    usage:   18 TiB used, 47 TiB / 65 TiB avail
    pgs:     288 active+clean

  io:
    client:   14 KiB/s rd, 7.9 MiB/s wr, 3 op/s rd, 367 op/s wr

Code:

# systemctl restart ceph.service
Failed to restart ceph.service: Unit ceph.service not found.

any advise?

Alwin · Dec 19, 2019

RobFantini said:
Failed to restart ceph.service: Unit ceph.service not found.

The unit is called ceph.target.

RobFantini · Dec 19, 2019

Code:

# ceph start ceph.target
no valid command found; 10 closest matches:
osd crush weight-set reweight-compat <item> <float[0.0-]> [<float[0.0-]>...]
osd crush weight-set reweight <poolname> <item> <float[0.0-]> [<float[0.0-]>...]
osd crush weight-set rm-compat
osd crush weight-set rm <poolname>
osd crush weight-set create <poolname> flat|positional
osd crush weight-set create-compat
osd crush weight-set dump
osd crush weight-set ls
osd crush get-device-class <ids> [<ids>...]
osd crush class ls-osd <class>
Error EINVAL: invalid command

RobFantini · Dec 19, 2019

reboot node seemed to fix.

Question - why does the normal upgrade cause ceph to break?

I have 6 more nodes to upgrade, and do not mind trying attempted fixes.

sg90 · Dec 19, 2019

RobFantini said:
reboot node seemed to fix.

Question - why does the normal upgrade cause ceph to break?

I have 6 more nodes to upgrade, and do not mind trying attempted fixes.

What version were you going from? -> to?

From your CEPH output, it doesn't look like there were any issues, CEPH OSD's dont actually update till you restart the OSD process.

Some recent versions the OSD didn't reconnect to the new MON's till they were running the new version.

RobFantini · Dec 19, 2019

sg90 said:
What version were you going from? -> to?

From your CEPH output, it doesn't look like there were any issues, CEPH OSD's dont actually update till you restart the OSD process.

Some recent versions the OSD didn't reconnect to the new MON's till they were running the new version.

we upgraded from 14.2.4.1 to 14.2.5

I just noticed after reboot the osd's are down on the upgraded node. so the issue with needing to upgrade the MON's may have been run in to.

What should I do next?

sg90 · Dec 19, 2019

What does "ceph health detail" show?

RobFantini · Dec 19, 2019

RobFantini said:
we upgraded from 14.2.4.1 to 14.2.5

I just noticed after reboot the osd's are down on the upgraded node. so the issue with needing to upgrade the MON's may have been run in to.

What should I do next?

Never mind, the osd's were fine, I had to reload pve screen

sg90 · Dec 19, 2019

RobFantini said:
Never mind, the osd's were fine, I had to reload pve screen

Great, but yes after the upgrade of the packages you need to restart the OSD's / Server for the upgrade to take effect.

RobFantini · Dec 19, 2019

sg90 said:
Great, but yes after the upgrade of the packages you need to restart the OSD's / Server for the upgrade to take effect.

But the restart did not seem to work, check above.

I can do another node soon, can you tell me which command to use?

sg90 · Dec 19, 2019

RobFantini said:
But the restart did not seem to work, check above.

I can do another node soon, can you tell me which command to use?

You replied just saying the OSD's are fine and you just needed to refresh the screen.

If still an issue what does

ceph -s
&
ceph health detail

Show?

RobFantini · Dec 19, 2019

sg90 said:
You replied just saying the OSD's are fine and you just needed to refresh the screen.

If still an issue what does

ceph -s
&
ceph health detail

Show?

the osd's were fine after reboot.

sg90 · Dec 19, 2019

RobFantini said:
the osd's were fine after reboot.

How did you try and restart the OSD's?

What was the output of the command?

RobFantini · Dec 19, 2019

sg90 said:
How did you try and restart the OSD's?

What was the output of the command?

for restart see #3 above. the output is there too.

sg90 · Dec 19, 2019

RobFantini said:
for restart see #3 above. the output is there too.

That is the wrong command as the output says that command don't exist.

"systemctl restart ceph.target " Is correct

RobFantini · Dec 19, 2019

sg90 said:
That is the wrong command as the output says that command don't exist.

"systemctl restart ceph.target " Is correct

thanks for that....coffee had not kicked in and i was rushing. i'll take my time and upgrade another node now

RobFantini · Dec 19, 2019

next node upgraded fine. Thank you for the help.

RobFantini · Dec 19, 2019

also after each upgrade there is a lot of backfilling . I do not recall that on some other upgrades, this upgrade may be doing something different?

so be sure to check ceph -s and wait until that settles before upgrading the next node.

Only one node had major backfills.

I think that it is best to start with a node that runs a mon . then osd's will have a mon of same version.

sg90 · Dec 20, 2019

RobFantini said:
also after each upgrade there is a lot of backfilling . I do not recall that on some other upgrades, this upgrade may be doing something different?

so be sure to check ceph -s and wait until that settles before upgrading the next node.

Only one node had major backfills.

I think that it is best to start with a node that runs a mon . then osd's will have a mon of same version.

You should ideally always upgrade all the MON's first as seen here : https://docs.ceph.com/docs/master/install/upgrading-ceph/

RobFantini · Dec 20, 2019

sg90 said:
You should ideally always upgrade all the MON's first as seen here : https://docs.ceph.com/docs/master/install/upgrading-ceph/

OK I put that to our important to be check pre ceph upgrade notes.

[SOLVED] ceph upgraded then: Failed to restart ceph.service: Unit ceph.service not found.

Famous Member

Proxmox Retired Staff

Famous Member

Famous Member

Renowned Member

Famous Member

Renowned Member

Famous Member

Renowned Member

Famous Member

Renowned Member

Famous Member

Renowned Member

Famous Member

Renowned Member

Famous Member

Famous Member

Famous Member

Renowned Member

Famous Member