Hey,
We are having a slow recovery after adding another node, 10 OSDs and increasing our pgs from 512 to 1024 and we want to know if there is any way to help speed the process up.
This is our environment:
node-a-01
2x 1.92TB SSD
1x 12TB HDD
node-a-02
2x 1.92TB SSD
1x 12TB HDD
node-a-03
2x 1.92TB SSD
1x 12TB HDD
node-a-04
2x 1.92TB SSD
1x 12TB HDD
node-a-05
2x 1.92TB SSD
1x 12TB HDD
node-a-06 < Newly added.
2x 1.92TB SSD < Newly added.
1x 12TB HDD < Newly added.
node-b-01
4x 1.92TB SSD < Newly added.
node-b-02
4x 1.92TB SSD < Newly added.
node-c-01
2x 3.8TB SSD
2x 1.92TB SSD < Newly added.
We are having a slow recovery after adding another node, 10 OSDs and increasing our pgs from 512 to 1024 and we want to know if there is any way to help speed the process up.
This is our environment:
node-a-01
2x 1.92TB SSD
1x 12TB HDD
node-a-02
2x 1.92TB SSD
1x 12TB HDD
node-a-03
2x 1.92TB SSD
1x 12TB HDD
node-a-04
2x 1.92TB SSD
1x 12TB HDD
node-a-05
2x 1.92TB SSD
1x 12TB HDD
node-a-06 < Newly added.
2x 1.92TB SSD < Newly added.
1x 12TB HDD < Newly added.
node-b-01
4x 1.92TB SSD < Newly added.
node-b-02
4x 1.92TB SSD < Newly added.
node-c-01
2x 3.8TB SSD
2x 1.92TB SSD < Newly added.
Code:
cluster:
id: b2f1455f-5ba3-403c-a82a-659aad72638f
health: HEALTH_ERR
599520/11645616 objects misplaced (5.148%)
Reduced data availability: 54 pgs inactive
2678 slow requests are blocked > 32 sec
89270 stuck requests are blocked > 4096 sec
services:
mon: 17 daemons, quorum *omitted*
mgr: 4c-03-ceph(active), standbys: *omitted*
osd: 30 osds: 30 up, 30 in; 76 remapped pgs
data:
pools: 1 pools, 1024 pgs
objects: 3790k objects, 15065 GB
usage: 45679 GB used, 67874 GB / 110 TB avail
pgs: 5.273% pgs not active
599520/11645616 objects misplaced (5.148%)
948 active+clean
54 activating+remapped
16 active+remapped+backfilling
6 active+remapped+backfill_wait
io:
client: 6756 B/s wr, 0 op/s rd, 1 op/s wr
recovery: 33740 kB/s, 8 objects/s