Some strange stuff has been happening since i upgraded to v8 this week.
I have a six node cluster with ceph. The actual upgrade process was fine. Basically I did one a day over the course of the week and everything seemed fine.
I then had an issue with two nodes which have started to drop their OSD. Between the two nodes I am down 7 OSD. When it was one drive I thought OK may be hardware but not this amount over a 24-36 window. '573 daemons have recently crashed' was not happening before the upgrade and these seem to point to the 7 OSD,
Before I start doing something radical and probably stupid is there any recommendations as to approach this?
I have a six node cluster with ceph. The actual upgrade process was fine. Basically I did one a day over the course of the week and everything seemed fine.
I then had an issue with two nodes which have started to drop their OSD. Between the two nodes I am down 7 OSD. When it was one drive I thought OK may be hardware but not this amount over a 24-36 window. '573 daemons have recently crashed' was not happening before the upgrade and these seem to point to the 7 OSD,
Before I start doing something radical and probably stupid is there any recommendations as to approach this?
Last edited: