Just ran into this in the lab, haven't gone digging in prod yet.
Cluster is alive, working, zero issues, everything in GUI is happy, 100% alive -- however... the "ceph device" table appears to have NOT updated itself for a *very* long time, probably 2+ years? Not only are 30+ devices missing from the list, but there's things no longer present.
"ceph osd df" for that host looks *much* different (and is correct)
No idea how to get ceph to rebuild the devices table -- an hour on google, and I can't find any answers. How do you make it match?
OFF TOPIC - found a wonderful cheatsheet worth sharing: https://github.com/TheJJ/ceph-cheatsheet
Code:
pve-manager/8.1.3/b46aac3b42da5d15 (running kernel: 6.2.16-20-pve)
Cluster is alive, working, zero issues, everything in GUI is happy, 100% alive -- however... the "ceph device" table appears to have NOT updated itself for a *very* long time, probably 2+ years? Not only are 30+ devices missing from the list, but there's things no longer present.
Code:
root@pmx1:~# ceph device ls
<snip>
INTEL_SSDSC2BA800G3_BTTV504000EU800JGN pmx2:sdd osd.19 0%
INTEL_SSDSC2BA800G3_BTTV5040012Z800JGN pmx5:sdb osd.39 1%
INTEL_SSDSC2BA800G3_BTTV5040019C800JGN 1%
INTEL_SSDSC2BA800G3_BTTV504001ML800JGN 1%
INTEL_SSDSC2BA800G3_BTTV504002QC800JGN 0%
INTEL_SSDSC2BA800G3_BTTV504002UM800JGN 1%
INTEL_SSDSC2BA800G4_BTHV505202SE800OGN pmx2:sda osd.3 0%
INTEL_SSDSC2BA800G4_BTHV513605MW800OGN pmx1:sda osd.4 0%
INTEL_SSDSC2BA800G4_BTHV535200VM800OGN pmx5:sdj osd.0 1%
</snip>
root@pmx1:~# ceph device ls-by-host pmx1
DEVICE DEV DAEMONS EXPECTED FAILURE
INTEL_SSDSC2BA800G3_BTTV451404HT800JGN sdh osd.25
INTEL_SSDSC2BA800G4_BTHV513605MW800OGN sda osd.4
"ceph osd df" for that host looks *much* different (and is correct)
Code:
ID CLASS WEIGHT REWEIGHT SIZE RAW USE DATA OMAP META AVAIL %USE VAR PGS STATUS
101 hdd 7.27739 1.00000 7.3 TiB 4.0 TiB 4.0 TiB 31 KiB 11 GiB 3.2 TiB 55.43 1.06 79 up
102 hdd 7.27699 1.00000 7.3 TiB 4.0 TiB 4.0 TiB 32 KiB 10 GiB 3.3 TiB 54.69 1.05 79 up
103 hdd 7.27739 1.00000 7.3 TiB 4.0 TiB 4.0 TiB 34 KiB 9.4 GiB 3.3 TiB 55.24 1.06 82 up
104 hdd 7.27699 1.00000 7.3 TiB 4.7 TiB 4.7 TiB 29 KiB 12 GiB 2.6 TiB 64.29 1.23 89 up
105 hdd 7.27739 1.00000 7.3 TiB 4.1 TiB 4.1 TiB 28 KiB 10 GiB 3.1 TiB 56.93 1.09 83 up
4 ssd 0.72769 1.00000 745 GiB 388 GiB 383 GiB 1.1 GiB 4.0 GiB 357 GiB 52.13 1.00 116 up
25 ssd 0.72800 1.00000 745 GiB 462 GiB 457 GiB 1.4 GiB 3.3 GiB 283 GiB 62.00 1.19 145 up
No idea how to get ceph to rebuild the devices table -- an hour on google, and I can't find any answers. How do you make it match?
OFF TOPIC - found a wonderful cheatsheet worth sharing: https://github.com/TheJJ/ceph-cheatsheet
Last edited: