Hello!
Quick recap: I have 5 hosts, 5 x 4TB (enterprise SSD, Micron Pro) per host with Ceph on its own 10Gbp network. All has been running great for about 6 months.
Last night I had a server host kernel panic. I rebooted the host. All came up just fine, cluster returned to HEALTH_OK within minutes, and the rebalancing begun. Now I think it had started to rebalance prior to me getting to the host, but when the existing PGs came alive on the rebooted host, the rebalance appeared to go pretty quick.
Almost 12 hours later, and its still rebalancing. However, I started with 256 PG, and now its up to 344.

Watching the logs, I don't see any issues about hardware failure, or stuck PG. Would that status here show me a stuck PG? Is it just doing its thing, and will continue to grow PG until it feels it is happy? Why was 256 ok for the longest time, but last night started growing? Did it just not want to / need to?

One thing that to note, the misplaced objects, will drop to about 427k, and then something happens and its back to 490k and works its way down.
Much appreciated for your input!
Quick recap: I have 5 hosts, 5 x 4TB (enterprise SSD, Micron Pro) per host with Ceph on its own 10Gbp network. All has been running great for about 6 months.
Last night I had a server host kernel panic. I rebooted the host. All came up just fine, cluster returned to HEALTH_OK within minutes, and the rebalancing begun. Now I think it had started to rebalance prior to me getting to the host, but when the existing PGs came alive on the rebooted host, the rebalance appeared to go pretty quick.
Almost 12 hours later, and its still rebalancing. However, I started with 256 PG, and now its up to 344.

Watching the logs, I don't see any issues about hardware failure, or stuck PG. Would that status here show me a stuck PG? Is it just doing its thing, and will continue to grow PG until it feels it is happy? Why was 256 ok for the longest time, but last night started growing? Did it just not want to / need to?

One thing that to note, the misplaced objects, will drop to about 427k, and then something happens and its back to 490k and works its way down.
Much appreciated for your input!










