Hi Everyone, I'm a homelabber. I have a 3 node cluster with ceph and I have one OSD that consistently causes my problems. It is the only one that always crashes. I consistently have either the Monitor or OSD on this node crashing. This causes pgs to show up as inconsistent. I can manually repair them to bring back the cluster to a healthy state, but it happens again. I think it typically happens when there are network issues. I should put the ceph traffic on a separate network, but I haven't gotten to that point yet. Any advice based on the log attached?
The only thing that I can do is delete and recreate the OSD. It's strange to me that it's the same node all the time.
The only thing that I can do is delete and recreate the OSD. It's strange to me that it's the same node all the time.
Attachments
Last edited: