strange ceph osd issue

KeyzerSuze

Member
Aug 16, 2024
33
1
8
Hi

Bit of strange one. i will try and explain the best way I can


server - with local drives

usb attached enclosure

I have 4 Sata drives in the enclose

when the server boots up for some reason drive 2 always turns off - some time after it has started to boot into linux.

what i have to do, whilst its in its boot up phase I have to pop the drive and push it back in for it to power up again and work normally ... on cold boot all of the drives are okay - its only once proxmox starts to boot - 8.4


if i don't get to do this on reboot. the OSD is not found and the drive is not seen by proxmox.

when I pop the drive and re insert - once proxmox has full loaded.

it has the side affect of turning off drive 1 as well, so slot 1 &2 seem to go through a reboot / power cycle - the usb connect is fine and the drives in slot 3 + 4 work fine and stay connected.

I'm using a Terramaster D8 Hybrid

lets say its OSD.12 on slot 1 and sdn

then i pull slot 2 and slot 1 cycles as well.

in proxmox OSD.12 dies , the LV is still there and it looks like its still mounted <<<

both slot 1 and slot 2 come back

slot one comes back as sdo (next available ) and slot 2 comes back as sdp


I can't get OSD.12 to restart with sdp ... not sure what i should do I can't restart the service . the lv is still there and its still mount. I figure I should be able to do this remotely - last time I just destoyed the OSD and created a new one - but that mean rebuilding and rebalancing ..

any thoughts on what how I can fix this when it happens