smartd false positive SSD CurrentPendingSector?

Discussion in 'Proxmox VE: Installation and configuration' started by jermudgeon, Jun 15, 2018.

  1. jermudgeon

    jermudgeon New Member
    Proxmox VE Subscriber

    Joined:
    Apr 7, 2016
    Messages:
    8
    Likes Received:
    0
    Since April 30, I've gotten 7 warning emails from one host, 5 from another, and 2 from another;

    each email claims there is 1 CurrentPendingSector failed; that is, currently unreadable (pending).

    On each host, it's the same type of drive, a CT1000MX500SSD1 (Crucial 1TB)

    Running smartctl manually shows *no* sectors failed.

    systemctl status shows info like this:
    Jun 13 21:29:19 pm2 smartd[1196]: Device: /dev/sda [SAT], 1 Currently unreadable (pending) sectors
    Jun 14 00:29:18 pm2 smartd[1196]: Device: /dev/sda [SAT], No more Currently unreadable (pending) sectors, warning condition reset after 1 email

    Does this imply that the drives are actually kicking up errors but fixing them? (I thought SSDs did that silently, until wearout, with a different SMART attribute indicating percentage.)

    It doesn't appear to be service impacting, but I'm not finding good info via Google.

    Has anybody else seen this, or is this an obvious question for Crucial?
     
  2. Andrew Hart

    Andrew Hart Member

    Joined:
    Dec 1, 2017
    Messages:
    41
    Likes Received:
    3
    At least with the newest INTEL ssd drives Pending Sector no longer means the same thing. It is used to indicate that a block will be re-mapped soon, (as far as I can tell.)
    On hdd it always meant that a sector could not be read and the drive is hoping that you write to it so that it can be re-mapped. (Also, as far as I know.)
     
  3. jermudgeon

    jermudgeon New Member
    Proxmox VE Subscriber

    Joined:
    Apr 7, 2016
    Messages:
    8
    Likes Received:
    0
    Thanks, Andrew. That makes sense.
     
  4. 123paul

    123paul New Member

    Joined:
    Aug 31, 2018
    Messages:
    4
    Likes Received:
    0
    Did you manage to find a solution for this? As I started to get these messages on my homelab since today.

    I did find this specification from Micron for all there SMART variables btw, might be useful for someone who is having the same problems and start worrying about wearout on their disks.
     
  5. jermudgeon

    jermudgeon New Member
    Proxmox VE Subscriber

    Joined:
    Apr 7, 2016
    Messages:
    8
    Likes Received:
    0
    No change, I still get these occasionally, and then the count resets to zero.
     
  6. Andrew Hart

    Andrew Hart Member

    Joined:
    Dec 1, 2017
    Messages:
    41
    Likes Received:
    3
    If it is the same problem, you'll find the pending sectors will increase maybe up to 17 and then reset to 0 and remapped will increase by just 1.

    17 is the highest I've seen I think. So keep an eye on it and check that your ssd has new firmware.

    If you think that there are pending sectors you can read the disk "dd if=/dev/sda of=/dev/null bs=1M". It will crash your system if there are real pending sectors probably.
     
  7. stormtronix

    stormtronix New Member

    Joined:
    Jul 23, 2014
    Messages:
    3
    Likes Received:
    0
    We have the same Issue with the same SSDs CT1000MX500SSD1 (Crucial 1TB).
    I also noticed that smart does not know many attributes from the ssds:

    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate 0x002f 100 100 000 Pre-fail Always - 0
    5 Reallocated_Sector_Ct 0x0032 100 100 010 Old_age Always - 0
    9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 561
    12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 7
    171 Unknown_Attribute 0x0032 100 100 000 Old_age Always - 0
    172 Unknown_Attribute 0x0032 100 100 000 Old_age Always - 0
    173 Unknown_Attribute 0x0032 098 098 000 Old_age Always - 34
    174 Unknown_Attribute 0x0032 100 100 000 Old_age Always - 6
    180 Unused_Rsvd_Blk_Cnt_Tot 0x0033 000 000 000 Pre-fail Always - 42
    183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0
    184 End-to-End_Error 0x0032 100 100 000 Old_age Always - 0
    187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
    194 Temperature_Celsius 0x0022 074 055 000 Old_age Always - 26 (Min/Max 0/45)
    196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0
    197 Current_Pending_Sector 0x0032 100 100 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0
    199 UDMA_CRC_Error_Count 0x0032 100 100 000 Old_age Always - 0
    202 Unknown_SSD_Attribute 0x0030 098 098 001 Old_age Offline - 2
    206 Unknown_SSD_Attribute 0x000e 100 100 000 Old_age Always - 0
    210 Unknown_Attribute 0x0032 100 100 000 Old_age Always - 0
    246 Unknown_Attribute 0x0032 100 100 000 Old_age Always - 4555280040
    247 Unknown_Attribute 0x0032 100 100 000 Old_age Always - 74855687
    248 Unknown_Attribute 0x0032 100 100 000 Old_age Always - 991046634

    I did not find anything about what these unknown attributes could be - any idea?
     
  8. 123paul

    123paul New Member

    Joined:
    Aug 31, 2018
    Messages:
    4
    Likes Received:
    0
    I see I forgot to link the document I found in my previous reply. Seems I can't paste external links as my account is too new.

    Just Google this "tnfd22_client_ssd_smart_attributes.pdf"

    I'm still having this issue, seems to be a Crucial specific issue.
     
  9. 123paul

    123paul New Member

    Joined:
    Aug 31, 2018
    Messages:
    4
    Likes Received:
    0
    Found out there has been released a firmware (M3CR022) update in june to fix this issue. I will try this and report back in a couple of days to inform if this fixed it.
     
  10. stormtronix

    stormtronix New Member

    Joined:
    Jul 23, 2014
    Messages:
    3
    Likes Received:
    0
    Hi Paul,
    any success with the new firmware?
     
  11. 123paul

    123paul New Member

    Joined:
    Aug 31, 2018
    Messages:
    4
    Likes Received:
    0
    I wasn't able to update the firmware yet as I didn't manage to boot from a USB drive with the firmware release from crucial. I would need to attach it to a windows machine to try the firmware update there. But only have macs around for now.

    So if someone else manages to test it sooner I would be curious to know the outcome.
     
  1. This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
    By continuing to use this site, you are consenting to our use of cookies.
    Dismiss Notice