[SOLVED] Bricked SSD after crash

anima

New Member
Feb 20, 2023
3
0
1
After installing proxmox had to shutdown the server when I did the reboot was getting initramfs so I tryed repairing the drive with fsck

1676930098226.png

I then tryed to retrieve the superblock

1676930299696.png
Output of fdisk -l
1676930418527.png
pvs
1676930471189.png
pvscan
1676930508499.png
vgcfgrestore
1676930607328.png

also attempted to wipe off the data with dd without success the drive remain read only. I ran gparted afterward it gave me this

1676930802439.png

I then tryed to reinstall on a second drive I had laying around but upon reboot I had the same issue

I don't care about the data I just want my drives back ...
 
Hi,

can you run smartctl -x /dev/sdf and see what it reports? This kind of sounds like a failing drive unfortunately.
Also, what version of PVE are you running (pveversion -v)?
 
Hi,

can you run smartctl -x /dev/sdf and see what it reports? This kind of sounds like a failing drive unfortunately.
Also, what version of PVE are you running (pveversion -v)?
It was right after a clean install of pve 7.3 (Can't run smart on initramfs or I don't know how sorry that's all I can provide for the version info). It's not a recent drive but I didn't see any error in the smartctl also it happened on two different drive same brand and model.

1676972328392.png
1676973149553.png
1676973174636.png
 
Last edited:
Found out that I have a failing DIMM. Could this be the cause?
I don't know how that could be related - but good find, that at least prevents some future problems.

As for the drive:
It does show one "Failure Prediction Threshold Exceeded", meaning it might fail soon (or already has, as it seems).
Additionally, the current drive temperature is 52°, which is pretty high for a drive - sustained usage at such temperatures might indeed cause (sooner) drive failures.
And with roughly 223 TBW (terabytes written), it already way into it's life.
What you can do is performing a SMART selftests on the drive and see what that reports, here's a nice guide for that.

My suggestion based on all of this would be to simply replace this failing drive with a new one.
 
  • Like
Reactions: anima