[SOLVED] Ceph Pacific upgrade to Quincy osd boot problem

czechsys

Renowned Member
Nov 18, 2015
454
50
93
Hi,
i followed pve documentation on upgrading ceph pacific to quincy, so, full upgrade to last pve 7, reboot, change to quincy, upgrade, restart mons/mgrs/osds, all went ok. But after this i rebooted one upgraded host and osds doesn't come up:

Code:
Jul 11 02:31:03 proxmox-03 systemd[1]: Starting Ceph object storage daemon osd.4...
Jul 11 02:31:03 proxmox-03 systemd[1]: Started Ceph object storage daemon osd.4.
Jul 11 02:31:03 proxmox-03 ceph-osd[9053]: 2023-07-11T02:31:03.229+0200 7faa969183c0 -1 auth: unable to find a keyring on /var/lib/ceph/osd/ceph-4/keyring: (2) No such file or directory
Jul 11 02:31:03 proxmox-03 ceph-osd[9053]: 2023-07-11T02:31:03.229+0200 7faa969183c0 -1 AuthRegistry(0x55e57e5f8140) no keyring found at /var/lib/ceph/osd/ceph-4/keyring, disabling cephx
Jul 11 02:31:03 proxmox-03 ceph-osd[9053]: 2023-07-11T02:31:03.233+0200 7faa969183c0 -1 auth: unable to find a keyring on /var/lib/ceph/osd/ceph-4/keyring: (2) No such file or directory
Jul 11 02:31:03 proxmox-03 ceph-osd[9053]: 2023-07-11T02:31:03.233+0200 7faa969183c0 -1 AuthRegistry(0x7ffd31921e90) no keyring found at /var/lib/ceph/osd/ceph-4/keyring, disabling cephx
Jul 11 02:31:03 proxmox-03 ceph-osd[9053]: failed to fetch mon config (--no-mon-config to skip)
Jul 11 02:31:03 proxmox-03 systemd[1]: ceph-osd@4.service: Main process exited, code=exited, status=1/FAILURE
Jul 11 02:31:03 proxmox-03 systemd[1]: ceph-osd@4.service: Failed with result 'exit-code'.
Jul 11 02:31:13 proxmox-03 systemd[1]: ceph-osd@4.service: Scheduled restart job, restart counter is at 3.
Jul 11 02:31:13 proxmox-03 systemd[1]: Stopped Ceph object storage daemon osd.4.
Jul 11 02:31:13 proxmox-03 systemd[1]: ceph-osd@4.service: Start request repeated too quickly.
Jul 11 02:31:13 proxmox-03 systemd[1]: ceph-osd@4.service: Failed with result 'exit-code'.
Jul 11 02:31:13 proxmox-03 systemd[1]: Failed to start Ceph object storage daemon osd.4.

Code:
root@proxmox-03:/var/log# lvs
File descriptor 9 (pipe:[101154]) leaked on lvs invocation. Parent PID 15233: bash
File descriptor 11 (pipe:[101155]) leaked on lvs invocation. Parent PID 15233: bash
  LV                                             VG                                        Attr       LSize   Pool Origin Data%  Meta%  Move Log Cpy%Sync Convert
  osd-block-a72ae29a-e413-4848-bc70-c2c76563a567 ceph-220d4477-940f-4970-b3a7-1836b88b11fc -wi-a-----  <1.75t
  osd-block-2ff9ed5f-075a-4e42-bc22-da303006b98c ceph-4962b11b-3aaf-4a65-9c1f-c66cb9a4dbd8 -wi-a-----  <1.75t

Code:
root@proxmox-03:/var/log# ceph -s
  cluster:
    id:     99c06983-733f-482b-b59e-ad21f55c20e3
    health: HEALTH_WARN
            Degraded data redundancy: 200020/600060 objects degraded (33.333%), 129 pgs degraded, 129 pgs undersized

  services:
    mon: 3 daemons, quorum proxmox-01,proxmox-02,proxmox-03 (age 42m)
    mgr: proxmox-02(active, since 54m), standbys: proxmox-01, proxmox-03
    osd: 6 osds: 4 up (since 55m), 4 in (since 21m)

  data:
    pools:   2 pools, 129 pgs
    objects: 200.02k objects, 743 GiB
    usage:   1.2 TiB used, 5.8 TiB / 7.0 TiB avail
    pgs:     200020/600060 objects degraded (33.333%)
             129 active+undersized+degraded

How to get osds up? The /var/lib/ceph/osd/ceph-X directory is empty.
 
Code:
proxmox-ve: 7.4-1 (running kernel: 5.15.108-1-pve)
pve-manager: 7.4-15 (running version: 7.4-15/a5d2a31e)
pve-kernel-5.15: 7.4-4
pve-kernel-5.13: 7.1-9
pve-kernel-5.15.108-1-pve: 5.15.108-1
pve-kernel-5.15.83-1-pve: 5.15.83-1
pve-kernel-5.13.19-6-pve: 5.13.19-15
ceph: 17.2.6-pve1
ceph-fuse: 17.2.6-pve1
corosync: 3.1.7-pve1
criu: 3.15-1+pve-1
glusterfs-client: 9.2-1
ifupdown: 0.8.36+pve2
ksm-control-daemon: 1.4-1
libjs-extjs: 7.0.0-1
libknet1: 1.24-pve2
libproxmox-acme-perl: 1.4.4
libproxmox-backup-qemu0: 1.3.1-1
libproxmox-rs-perl: 0.2.1
libpve-access-control: 7.4.1
libpve-apiclient-perl: 3.2-1
libpve-common-perl: 7.4-2
libpve-guest-common-perl: 4.2-4
libpve-http-server-perl: 4.2-3
libpve-rs-perl: 0.7.7
libpve-storage-perl: 7.4-3
libspice-server1: 0.14.3-2.1
lvm2: 2.03.11-2.1
lxc-pve: 5.0.2-2
lxcfs: 5.0.3-pve1
novnc-pve: 1.4.0-1
openvswitch-switch: 2.15.0+ds1-2+deb11u4
proxmox-backup-client: 2.4.2-1
proxmox-backup-file-restore: 2.4.2-1
proxmox-kernel-helper: 7.4-1
proxmox-mail-forward: 0.1.1-1
proxmox-mini-journalreader: 1.3-1
proxmox-widget-toolkit: 3.7.3
pve-cluster: 7.3-3
pve-container: 4.4-6
pve-docs: 7.4-2
pve-edk2-firmware: 3.20230228-4~bpo11+1
pve-firewall: 4.3-4
pve-firmware: 3.6-5
pve-ha-manager: 3.6.1
pve-i18n: 2.12-1
pve-qemu-kvm: 7.2.0-8
pve-xtermjs: 4.16.0-2
qemu-server: 7.4-4
smartmontools: 7.2-pve3
spiceterm: 3.2-2
swtpm: 0.8.0~bpo11+3
vncterm: 1.7-1
zfsutils-linux: 2.1.11-pve1