C cola16 Member Feb 2, 2024 47 2 8 Mar 31, 2025 #1 The disk that is causing this problem is fine when SMART scanned. When I do dmesg | grep ‘ sd ’, I get nothing but a message that the device is connected. What should I check? Attachments ceph-error.txt ceph-error.txt 34.3 KB · Views: 2
The disk that is causing this problem is fine when SMART scanned. When I do dmesg | grep ‘ sd ’, I get nothing but a message that the device is connected. What should I check?
gurubert Distinguished Member Mar 12, 2015 1,164 324 153 Berlin, Germany www.heinlein-consulting.de Mar 31, 2025 #2 Remove this OSD and redeploy it. There may be just a bit error on the disk.
C cola16 Member Feb 2, 2024 47 2 8 Mar 31, 2025 #3 gurubert said: There may be just a bit error on the disk. Click to expand... I redeployed last night. This morning I found similar errors. When I ran journalctl -eu ceph-osd@14, last night I could see errors related to IO errors. Today, I cannot find the errors from the attached files. I have osd crash twice. I couldn't see any hardware errors in s.m.a.r.t and dmesg, so I asked to get if there was anything else I could check.
gurubert said: There may be just a bit error on the disk. Click to expand... I redeployed last night. This morning I found similar errors. When I ran journalctl -eu ceph-osd@14, last night I could see errors related to IO errors. Today, I cannot find the errors from the attached files. I have osd crash twice. I couldn't see any hardware errors in s.m.a.r.t and dmesg, so I asked to get if there was anything else I could check.
gurubert Distinguished Member Mar 12, 2015 1,164 324 153 Berlin, Germany www.heinlein-consulting.de Apr 1, 2025 #4 I would replace the disk now.