Hi ,
I got 2 OSD's (osd.13 and 17) failed, there are a lot of error message is dmesg. df is just hang.
[53623085.515011] XFS (sdc1): xfs_log_force: error -5 returned.
[53623088.039118] XFS (sdd1): xfs_log_force: error -5 returned.
[53623115.516356] XFS (sdc1): xfs_log_force: error -5 returned.
[53623118.040473] XFS (sdd1): xfs_log_force: error -5 returned.
sdc 8:32 0 1.8T 0 disk
└─sdc1 8:33 0 1.8T 0 part /var/lib/ceph/osd/ceph-13
sdd 8:48 0 1.8T 0 disk
└─sdd1 8:49 0 1.8T 0 part /var/lib/ceph/osd/ceph-17
13 1.81850 osd.13 down 0 1.00000
17 1.81850 osd.17 down 0 1.00000
# uname -a
Linux ceph-ric-sata-03.mgt.tpgtelecom.com.au 4.4.1-1.el7.elrepo.x86_64 #1 SMP Sun Jan 31 16:49:23 EST 2016 x86_64 x86_64 x86_64 GNU/Linux
Smartctl comes good, should no issue on physical drive. Not sure if xfs corrupted bring osd down? Any advise how to fix this problem? Please comments.
Thanks so much.
I got 2 OSD's (osd.13 and 17) failed, there are a lot of error message is dmesg. df is just hang.
[53623085.515011] XFS (sdc1): xfs_log_force: error -5 returned.
[53623088.039118] XFS (sdd1): xfs_log_force: error -5 returned.
[53623115.516356] XFS (sdc1): xfs_log_force: error -5 returned.
[53623118.040473] XFS (sdd1): xfs_log_force: error -5 returned.
sdc 8:32 0 1.8T 0 disk
└─sdc1 8:33 0 1.8T 0 part /var/lib/ceph/osd/ceph-13
sdd 8:48 0 1.8T 0 disk
└─sdd1 8:49 0 1.8T 0 part /var/lib/ceph/osd/ceph-17
13 1.81850 osd.13 down 0 1.00000
17 1.81850 osd.17 down 0 1.00000
# uname -a
Linux ceph-ric-sata-03.mgt.tpgtelecom.com.au 4.4.1-1.el7.elrepo.x86_64 #1 SMP Sun Jan 31 16:49:23 EST 2016 x86_64 x86_64 x86_64 GNU/Linux
Smartctl comes good, should no issue on physical drive. Not sure if xfs corrupted bring osd down? Any advise how to fix this problem? Please comments.
Thanks so much.