[SOLVED] disk wearout question

RobFantini

Famous Member
May 24, 2012
2,043
111
133
Boston,Mass
is there a way to use smartctl or another cli tool to verify the wearout number at pve > system > disks 'wearout' value ?

[ we have 30 Intel s3610 , and all of them show 0% wearout. ].
 
is there a way to use smartctl or another cli tool to verify the wearout number at pve > system > disks 'wearout' value ?

[ we have 30 Intel s3610 , and all of them show 0% wearout. ].

That's strange, s3610 are supported by smarmontools. (used by proxmox).

Do you have a passthrough controller ? or a raid controller ?

you can try to update database of disk model with:

/usr/sbin/update-smart-drivedb
 
That's strange, s3610 are supported by smarmontools. (used by proxmox).

Do you have a passthrough controller ? or a raid controller ?

you can try to update database of disk model with:

/usr/sbin/update-smart-drivedb

Hello - thank you for the reply,

we have IT mode LSI cards .

I'll work on updating the database..

Here is more info. I assume 233 = wearout , the 233 value seems very high:
Code:
# smartctl -a /dev/sdn
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.15.18-4-pve] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     INTEL SSDSC2BX400G4R
Serial Number:    BTHC521505KZ400VGN
LU WWN Device Id: 5 5cd2e4 04c037f45
Add. Product Id:  DELL(tm)
Firmware Version: G201DL22
User Capacity:    400,088,457,216 bytes [400 GB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ACS-2 T13/2015-D revision 3
SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Thu Sep 13 01:39:13 2018 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x02) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (    2) seconds.
Offline data collection
capabilities:                    (0x79) SMART execute Offline immediate.
                                        No Auto Offline data collection support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        (  60) minutes.
Conveyance self-test routine
recommended polling time:        (  60) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000e   130   130   039    Old_age   Always       -       387002749
  5 Reallocated_Sector_Ct   0x0033   100   100   001    Pre-fail  Always       -       0
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       19186
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       28
 13 Read_Soft_Error_Rate    0x001e   130   100   000    Old_age   Always       -       4681970045
179 Used_Rsvd_Blk_Cnt_Tot   0x0033   100   100   010    Pre-fail  Always       -       0
180 Unused_Rsvd_Blk_Cnt_Tot 0x0032   100   100   000    Old_age   Always       -       10920
181 Program_Fail_Cnt_Total  0x003a   100   100   000    Old_age   Always       -       0
182 Erase_Fail_Count_Total  0x003a   100   100   000    Old_age   Always       -       0
184 End-to-End_Error        0x0032   100   100   000    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   100   100   000    Old_age   Always       -       32
195 Hardware_ECC_Recovered  0x0032   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   100   100   000    Old_age   Always       -       0
201 Unknown_SSD_Attribute   0x0033   100   100   010    Pre-fail  Always       -       373932431126
202 Unknown_SSD_Attribute   0x0027   100   100   000    Pre-fail  Always       -       0
226 Unknown_SSD_Attribute   0x0032   100   100   000    Old_age   Always       -       1126
227 Unknown_SSD_Attribute   0x0032   100   100   000    Old_age   Always       -       17
228 Power-off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       1150957
233 Media_Wearout_Indicator 0x0032   100   100   000    Old_age   Always       -       2184793
245 Unknown_Attribute       0x0032   099   099   000    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     18621         -
# 2  Short offline       Completed without error       00%     18621         -
# 3  Short offline       Completed without error       00%         3         -
# 4  Extended offline    Completed without error       00%         2         -
# 5  Short offline       Completed without error       00%         2         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
 
Here is more info. I assume 233 = wearout , the 233 value seems very high:
233 Media_Wearout_Indicator 0x0032 100 100 000 Old_age Always - 2184793

what is important here is not the 'raw' value, but the normalized one, and this is stlll at 100
from [1]:
Reports the number of cycles the NAND media has undergone. The normalized value declines linearly from 100 to 1 as the average erase cycle count increases from 0 to the maximum rated cycles. Once the normalized value reaches 1, the number will not decrease, although it is likely that significant additional wear can be put on the device.

1: https://www.intel.com/content/dam/s...te-drives/Intel_SSD_Smart_Attrib_for_SATA.PDF
 
what is important here is not the 'raw' value, but the normalized one, and this is stlll at 100
from [1]:


1: https://www.intel.com/content/dam/s...te-drives/Intel_SSD_Smart_Attrib_for_SATA.PDF

thanks for the reply.

I had read that 'normalized' is the number to look for, however in pve > disks and smartctl -a the word normal does not appear so there is a little guess work to be done.

the good news is that all 30 used disks [ bought at less then $100 each ] are as good as new. with our write rate will last 50 years.
 
I had read that 'normalized' is the number to look for, however in pve > disks and smartctl -a the word normal does not appear so there is a little guess work to be done.
the gui has a column which is named 'normalized'