[SOLVED] S.M.A.R.T ?

killmasta93

Renowned Member
Aug 13, 2017
959
56
68
30
Hi,
I was wondering if someone else has encountered the Non-medium error count.
On proxmox i check the Status of the disk which shows that everything is OK .
But not sure what it means the

Non-medium error count: 222

This is the log test

root@prometheus4:~# smartctl -a /dev/sda
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.15.18-12-pve] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Vendor: HP
Product: MM2000JEFRC
Revision: HPD6
Compliance: SPC-4
User Capacity: 2,000,398,934,016 bytes [2.00 TB]
Logical block size: 512 bytes
Rotation Rate: 7200 rpm
Form Factor: 2.5 inches
Logical Unit id: 0x5000c5009e7873c3
Serial number: W46018510000K724K67T
Device type: disk
Transport protocol: SAS (SPL-3)
Local Time is: Wed Jul 22 00:12:23 2020 -05
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Enabled

=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK

Current Drive Temperature: 31 C
Drive Trip Temperature: 60 C

Manufactured in week 01 of year 2017
Specified cycle count over device lifetime: 10000
Accumulated start-stop cycles: 49
Specified load-unload count over device lifetime: 300000
Accumulated load-unload cycles: 1206
Elements in grown defect list: 0

Vendor (Seagate) cache information
Blocks sent to initiator = 2664148568
Blocks received from initiator = 1268975600
Blocks read from cache and sent to initiator = 2284944291
Number of read and write commands whose size <= segment size = 738681331
Number of read and write commands whose size > segment size = 1445955

Vendor (Seagate/Hitachi) factory information
number of hours powered up = 28006.45
number of minutes until next internal SMART test = 41

Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9 bytes] errors
read: 0 0 0 0 0 258649.766 0
write: 0 0 0 0 0 55679.375 0

Non-medium error count: 222

No self-tests have been logged

each disks shows something different

root@prometheus4:~# smartctl -a /dev/sdb
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.15.18-12-pve] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Vendor: HP
Product: EG1800JEHMD
Revision: HPD4
Compliance: SPC-4
User Capacity: 1,800,360,124,416 bytes [1.80 TB]
Logical block size: 512 bytes
Rotation Rate: 10520 rpm
Form Factor: 2.5 inches
Logical Unit id: 0x5000cca02c29eaf8
Serial number: 08GS1N8A
Device type: disk
Transport protocol: SAS (SPL-3)
Local Time is: Wed Jul 22 00:15:50 2020 -05
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Enabled

=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK

Current Drive Temperature: 33 C
Drive Trip Temperature: 60 C

Manufactured in week 28 of year 2016
Specified cycle count over device lifetime: 50000
Accumulated start-stop cycles: 38
Specified load-unload count over device lifetime: 600000
Accumulated load-unload cycles: 875
Elements in grown defect list: 0

Vendor (Seagate) cache information
Blocks sent to initiator = 4615780398268416

Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9 bytes] errors
read: 0 1475 0 2 0 44571.236 0
write: 0 30 0 30 0 31397.066 0

Non-medium error count: 144

No self-tests have been logged


root@prometheus4:~# smartctl -a /dev/sdd
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.15.18-12-pve] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Vendor: HP
Product: MM2000JEFRC
Revision: HPD6
Compliance: SPC-4
User Capacity: 2,000,398,934,016 bytes [2.00 TB]
Logical block size: 512 bytes
Rotation Rate: 7200 rpm
Form Factor: 2.5 inches
Logical Unit id: 0x5000c5009e754803
Serial number: W4600ASS0000K724703J
Device type: disk
Transport protocol: SAS (SPL-3)
Local Time is: Wed Jul 22 00:17:59 2020 -05
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Enabled

=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK

Current Drive Temperature: 30 C
Drive Trip Temperature: 60 C

Manufactured in week 52 of year 2016
Specified cycle count over device lifetime: 10000
Accumulated start-stop cycles: 48
Specified load-unload count over device lifetime: 300000
Accumulated load-unload cycles: 1205
Elements in grown defect list: 0

Vendor (Seagate) cache information
Blocks sent to initiator = 3007409464
Blocks received from initiator = 232132432
Blocks read from cache and sent to initiator = 744370585
Number of read and write commands whose size <= segment size = 690604353
Number of read and write commands whose size > segment size = 1424016

Vendor (Seagate/Hitachi) factory information
number of hours powered up = 28006.48
number of minutes until next internal SMART test = 31

Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9 bytes] errors
read: 0 0 0 0 0 245631.376 0
write: 0 0 0 0 0 46352.663 0

Non-medium error count: 179

No self-tests have been logged
 
Hi,
from the documentation:
non-medium error counter (only a single number displayed). This represents the number of recoverable events other than write, read or verify errors.

This means that these errors are not caused by problems with the disk itself, but maybe by a (minor) problem with cable/controller/firmware. All of these errors were recoverable and the count is rather low, so I wouldn't worry too much for the moment.
 
Thanks for the reply, whats odd normally those errors i could ignore it? but a problem with the controller? its flashed IT mode dell h310
Even though it sometimes creeps up the errors every day or week?
Thank you
 
Here is an example of a kind of error which increases the non-medium error counter. In that case, it's really just a minor thing.
Is there anything from S.M.A.R.T. or about the disks in your /var/log/syslog?
 
Thanks for the reply, so i should be worried when i get alerted by this
if it changes from 0 to any number?
Elements in grown defect list: 0

i havent seen anything on the syslog
 
Yes, that's one thing to look out for. If that number grows quickly, it's a pretty good indicator that the disk is approaching its end.
 
  • Like
Reactions: killmasta93

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!