[SOLVED] Wrong wearout indicator/parsing on smart values

Kadrim

Well-Known Member
May 20, 2018
48
5
48
42
Hi there,

i am currently on proxmox 6.1-8 and have a Samsung 970 Evo NVME SSD.

The Smart values differ quite alot from other SSDs i had:

Bash:
root@pve:~# smartctl -a /dev/nvme0
smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.3.18-3-pve] (local build)
Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:                       Samsung SSD 970 EVO Plus 250GB
Serial Number:                      S4EUNF0M707690P
Firmware Version:                   2B2QEXM7
PCI Vendor/Subsystem ID:            0x144d
IEEE OUI Identifier:                0x002538
Total NVM Capacity:                 250,059,350,016 [250 GB]
Unallocated NVM Capacity:           0
Controller ID:                      4
Number of Namespaces:               1
Namespace 1 Size/Capacity:          250,059,350,016 [250 GB]
Namespace 1 Utilization:            250,058,153,984 [250 GB]
Namespace 1 Formatted LBA Size:     512
Namespace 1 IEEE EUI-64:            002538 5791b07e96
Local Time is:                      Mon Apr 13 09:25:31 2020 CEST
Firmware Updates (0x16):            3 Slots, no Reset required
Optional Admin Commands (0x0017):   Security Format Frmw_DL Self_Test
Optional NVM Commands (0x005f):     Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat Timestmp
Maximum Data Transfer Size:         512 Pages
Warning  Comp. Temp. Threshold:     85 Celsius
Critical Comp. Temp. Threshold:     85 Celsius

Supported Power States
St Op     Max   Active     Idle   RL RT WL WT  Ent_Lat  Ex_Lat
 0 +     7.80W       -        -    0  0  0  0        0       0
 1 +     6.00W       -        -    1  1  1  1        0       0
 2 +     3.40W       -        -    2  2  2  2        0       0
 3 -   0.0700W       -        -    3  3  3  3      210    1200
 4 -   0.0100W       -        -    4  4  4  4     2000    8000

Supported LBA Sizes (NSID 0x1)
Id Fmt  Data  Metadt  Rel_Perf
 0 +     512       0         0

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

SMART/Health Information (NVMe Log 0x02)
Critical Warning:                   0x00
Temperature:                        48 Celsius
Available Spare:                    100%
Available Spare Threshold:          10%
Percentage Used:                    12%
Data Units Read:                    9,650,756 [4.94 TB]
Data Units Written:                 31,335,549 [16.0 TB]
Host Read Commands:                 281,661,987
Host Write Commands:                294,804,240
Controller Busy Time:               4,297
Power Cycles:                       82
Power On Hours:                     1,368
Unsafe Shutdowns:                   51
Media and Data Integrity Errors:    0
Error Information Log Entries:      21
Warning  Comp. Temperature Time:    0
Critical Comp. Temperature Time:    0
Temperature Sensor 1:               48 Celsius
Temperature Sensor 2:               51 Celsius

Error Information (NVMe Log 0x01, max 64 entries)
No Errors Logged

But the Proxmox Web-UI reports only 2%:

proxmox-wearout.png

I think the numbers get parsed wrong. I did not notice that at first, because the wearout indicator was correct:

real (i.e. smartclt) valueValue reported by Proxmox UI
0%0%
1%1%
2%2%
3%3%
4%4%
5%5%
6%6%
7%7%
8%8%
9%9%
10%cannot remember anymore ...
11%1%
12%2%

So it seems, the correct field is taken into account but the parsing might be broken.

Can anyone else confirm this?