In a 5 node cluster, I had to replace some failed SSD's and now the CEPH cluster is stuck with "Reduced data availability: 40 pgs inactive, 42 pgs incomplete"
Code:
Reduced data availability: 40 pgs inactive, 42 pgs incomplete
pg 2.57 is incomplete, acting [1,35,14] (reducing pool CephFS_data min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 15.2a is incomplete, acting [41,27,0] (reducing pool SSD_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 15.2c is incomplete, acting [48,26,40] (reducing pool SSD_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 15.64 is incomplete, acting [24,27,46] (reducing pool SSD_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.0 is incomplete, acting [7,23,29] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.2 is stuck inactive for 6h, current state undersized+degraded+remapped+backfilling+peered, last acting [4]
pg 17.3 is incomplete, acting [15,2,26] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.6 is stuck inactive for 2h, current state undersized+degraded+remapped+backfilling+peered, last acting [37]
pg 17.7 is stuck inactive for 2h, current state undersized+degraded+remapped+backfilling+peered, last acting [21]
pg 17.8 is incomplete, acting [37,28,20] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.b is incomplete, acting [18,32,10] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.c is stuck inactive for 3h, current state undersized+degraded+remapped+backfilling+peered, last acting [36]
pg 17.d is incomplete, acting [70,45,38] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.10 is incomplete, acting [51,46,11] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.11 is stuck inactive for 4h, current state undersized+degraded+remapped+backfilling+peered, last acting [51]
pg 17.14 is stuck inactive for 6h, current state undersized+degraded+remapped+backfilling+peered, last acting [3]
pg 17.17 is incomplete, acting [48,70,14] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.1d is incomplete, acting [12,35,14] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.1e is incomplete, acting [24,3,48] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.20 is stuck inactive for 3h, current state undersized+degraded+remapped+backfilling+peered, last acting [3]
pg 17.22 is incomplete, acting [7,18,20] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.26 is incomplete, acting [30,48,20] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.27 is incomplete, acting [31,24,58] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.29 is stuck inactive for 6h, current state undersized+degraded+remapped+backfilling+peered, last acting [68]
pg 17.2c is incomplete, acting [1,7,30] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.2e is incomplete, acting [15,5,44] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.30 is stuck inactive for 111m, current state undersized+degraded+remapped+backfilling+peered, last acting [30]
pg 17.32 is incomplete, acting [48,31,3] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.3a is incomplete, acting [16,25,46] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.3b is incomplete, acting [2,7,1] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.3d is incomplete, acting [48,70,33] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.40 is incomplete, acting [11,45,43] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.44 is incomplete, acting [43,18,6] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.4a is incomplete, acting [14,22,35] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.4d is stuck inactive for 6h, current state undersized+degraded+remapped+backfilling+peered, last acting [50]
pg 17.4f is incomplete, acting [0,44,15] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.53 is incomplete, acting [33,27,26] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.54 is incomplete, acting [31,41,14] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.55 is incomplete, acting [27,44,14] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.56 is incomplete, acting [41,55,27] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.57 is stuck inactive for 6h, current state undersized+degraded+remapped+backfilling+peered, last acting [14]
pg 17.58 is incomplete, acting [35,9,50] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.59 is incomplete, acting [58,25,31] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.6c is incomplete, acting [27,20,25] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.70 is stuck inactive for 11h, current state undersized+degraded+remapped+backfilling+peered, last acting [36]
pg 17.71 is incomplete, acting [0,1,35] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.74 is stuck inactive for 5h, current state undersized+degraded+remapped+backfilling+peered, last acting [23]
pg 17.76 is incomplete, acting [40,27,70] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.77 is incomplete, acting [6,9,20] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.79 is incomplete, acting [35,27,25] (reducing pool Ceph_Storage min_size from 2 may help; search ceph.com/docs for 'incomplete')
pg 17.7b is stuck inactive for 6h, current state undersized+degraded+remapped+backfilling+peered, last acting [3]
Code:
Degraded data redundancy: 284751/3351945 objects degraded (8.495%), 38 pgs degraded, 38 pgs undersized
pg 15.1b is stuck undersized for 10m, current state active+undersized+degraded+remapped+backfilling, last acting [1,37]
pg 15.77 is stuck undersized for 10m, current state active+undersized+degraded+remapped+backfilling, last acting [50,31]
pg 17.2 is stuck undersized for 106m, current state undersized+degraded+remapped+backfilling+peered, last acting [4]
pg 17.6 is stuck undersized for 88m, current state undersized+degraded+remapped+backfilling+peered, last acting [37]
pg 17.7 is stuck undersized for 23m, current state undersized+degraded+remapped+backfilling+peered, last acting [21]
pg 17.c is stuck undersized for 106m, current state undersized+degraded+remapped+backfilling+peered, last acting [36]
pg 17.e is stuck undersized for 10m, current state active+undersized+degraded+remapped+backfilling, last acting [68,52]
pg 17.11 is stuck undersized for 10m, current state undersized+degraded+remapped+backfilling+peered, last acting [51]
pg 17.14 is stuck undersized for 10m, current state undersized+degraded+remapped+backfilling+peered, last acting [3]
pg 17.19 is stuck undersized for 3h, current state active+undersized+degraded+remapped+backfilling, last acting [36,8]
pg 17.1a is stuck undersized for 90m, current state active+undersized+degraded+remapped+backfilling, last acting [35,3]
pg 17.1f is stuck undersized for 10m, current state active+undersized+degraded+remapped+backfilling, last acting [50,46]
pg 17.20 is stuck undersized for 2h, current state undersized+degraded+remapped+backfilling+peered, last acting [3]
pg 17.24 is stuck undersized for 3h, current state active+undersized+degraded+remapped+backfilling, last acting [3,68]
pg 17.29 is stuck undersized for 11m, current state undersized+degraded+remapped+backfilling+peered, last acting [68]
pg 17.2a is stuck undersized for 2h, current state active+undersized+degraded+remapped+backfilling, last acting [14,37]
pg 17.2d is stuck undersized for 2h, current state active+undersized+degraded+remapped+backfilling, last acting [36,7]
pg 17.30 is stuck undersized for 89m, current state undersized+degraded+remapped+backfilling+peered, last acting [30]
pg 17.31 is stuck undersized for 11m, current state active+undersized+degraded+remapped+backfilling, last acting [21,46]
pg 17.35 is stuck undersized for 2h, current state active+undersized+degraded+remapped+backfilling, last acting [68,5]
pg 17.3f is stuck undersized for 101m, current state active+undersized+degraded+remapped+backfilling, last acting [3,26]
pg 17.43 is stuck undersized for 2h, current state active+undersized+degraded+remapped+backfilling, last acting [29,51]
pg 17.48 is stuck undersized for 10m, current state active+undersized+degraded+remapped+backfilling, last acting [43,37]
pg 17.49 is stuck undersized for 3h, current state active+undersized+degraded+remapped+backfilling, last acting [8,30]
pg 17.4d is stuck undersized for 88m, current state undersized+degraded+remapped+backfilling+peered, last acting [50]
pg 17.57 is stuck undersized for 89m, current state undersized+degraded+remapped+backfilling+peered, last acting [14]
pg 17.61 is stuck undersized for 2h, current state undersized+degraded+remapped+backfilling+peered, last acting [8]
pg 17.62 is stuck undersized for 2h, current state active+undersized+degraded+remapped+backfilling, last acting [37,21]
pg 17.65 is stuck undersized for 11m, current state undersized+degraded+remapped+backfilling+peered, last acting [21]
pg 17.66 is stuck undersized for 10m, current state undersized+degraded+remapped+backfilling+peered, last acting [33]
pg 17.69 is stuck undersized for 10m, current state active+undersized+degraded+remapped+backfilling, last acting [40,46]
pg 17.6b is stuck undersized for 10m, current state undersized+degraded+remapped+backfilling+peered, last acting [42]
pg 17.6f is stuck undersized for 59m, current state active+undersized+degraded+remapped+backfilling, last acting [11,58]
pg 17.70 is stuck undersized for 11m, current state undersized+degraded+remapped+backfilling+peered, last acting [36]
pg 17.74 is stuck undersized for 101m, current state undersized+degraded+remapped+backfilling+peered, last acting [23]
pg 17.7a is stuck undersized for 10m, current state active+undersized+degraded+remapped+backfilling, last acting [7,42]
pg 17.7b is stuck undersized for 91m, current state undersized+degraded+remapped+backfilling+peered, last acting [3]
pg 17.7c is stuck undersized for 3h, current state active+undersized+degraded+remapped+backfilling, last acting [3,46]
Code:
root@PVE1:~# ceph osd tree
ID CLASS WEIGHT TYPE NAME STATUS REWEIGHT PRI-AFF
-1 27.96579 root default
-3 7.02707 host PVE1
0 ssd 0.43939 osd.0 up 1.00000 1.00000
2 ssd 0.43619 osd.2 up 1.00000 1.00000
3 ssd 0.90919 osd.3 up 1.00000 1.00000
4 ssd 0.43939 osd.4 up 1.00000 1.00000
5 ssd 0.43939 osd.5 up 1.00000 1.00000
9 ssd 0.43939 osd.9 up 1.00000 1.00000
11 ssd 0.43619 osd.11 up 1.00000 1.00000
14 ssd 0.43619 osd.14 up 1.00000 1.00000
18 ssd 0.43639 osd.18 up 1.00000 1.00000
21 ssd 0.43939 osd.21 up 1.00000 1.00000
25 ssd 0.87279 osd.25 up 1.00000 1.00000
28 ssd 0.43619 osd.28 up 1.00000 1.00000
29 ssd 0.42760 osd.29 up 1.00000 1.00000
30 ssd 0.43939 osd.30 up 1.00000 1.00000
-5 5.75449 host PVE2
6 ssd 0.45459 osd.6 up 1.00000 1.00000
7 ssd 0.87279 osd.7 up 1.00000 1.00000
10 ssd 0.43619 osd.10 up 1.00000 1.00000
12 ssd 0.43619 osd.12 up 1.00000 1.00000
15 ssd 0.45459 osd.15 up 1.00000 1.00000
22 ssd 0.43939 osd.22 up 1.00000 1.00000
27 ssd 0.87279 osd.27 up 1.00000 1.00000
45 ssd 0.45459 osd.45 up 1.00000 1.00000
48 ssd 0.45459 osd.48 up 1.00000 1.00000
57 ssd 0.43939 osd.57 down 0 1.00000
58 ssd 0.43939 osd.58 up 1.00000 1.00000
-7 8.16016 host PVE3
1 ssd 0.43649 osd.1 up 1.00000 1.00000
8 ssd 0.43939 osd.8 up 1.00000 1.00000
16 ssd 0.45459 osd.16 up 1.00000 1.00000
17 ssd 0.45430 osd.17 up 1.00000 1.00000
20 ssd 0.43649 osd.20 up 1.00000 1.00000
24 ssd 0.43619 osd.24 up 1.00000 1.00000
32 ssd 0.45459 osd.32 up 1.00000 1.00000
33 ssd 0.43649 osd.33 up 1.00000 1.00000
34 ssd 0.43639 osd.34 up 1.00000 1.00000
38 ssd 0.43649 osd.38 up 1.00000 1.00000
40 ssd 0.43649 osd.40 up 1.00000 1.00000
41 ssd 0.43649 osd.41 up 1.00000 1.00000
42 ssd 0.43649 osd.42 up 1.00000 1.00000
43 ssd 0.43649 osd.43 up 1.00000 1.00000
44 ssd 0.43639 osd.44 up 1.00000 1.00000
47 ssd 0.25000 osd.47 up 0.34996 1.00000
50 ssd 0.43939 osd.50 up 1.00000 1.00000
51 ssd 0.43939 osd.51 up 1.00000 1.00000
52 ssd 0.42760 osd.52 up 1.00000 1.00000
-13 7.02408 host PVE4
13 ssd 0.43939 osd.13 up 1.00000 1.00000
19 ssd 0.43939 osd.19 up 1.00000 1.00000
23 ssd 0.43619 osd.23 up 1.00000 1.00000
26 ssd 0.43939 osd.26 up 0.85004 1.00000
31 ssd 0.90919 osd.31 up 1.00000 1.00000
35 ssd 0.87279 osd.35 up 1.00000 1.00000
36 ssd 0.43619 osd.36 up 1.00000 1.00000
37 ssd 0.43619 osd.37 up 0.85004 1.00000
39 ssd 0.43619 osd.39 up 1.00000 1.00000
46 ssd 0.87279 osd.46 up 1.00000 1.00000
55 ssd 0.42760 osd.55 up 1.00000 1.00000
68 ssd 0.43939 osd.68 up 1.00000 1.00000
70 ssd 0.43939 osd.70 up 1.00000 1.00000
Last edited: