SMART errors with Megaraid

May 28, 2019
5
0
1
46
Every day since installing, I get this error:

This message was generated by the smartd daemon running on:

host name: sam
DNS domain: mclean.harvard.edu

The following warning/error was logged by the smartd daemon:

Device: /dev/bus/0 [megaraid_disk_11], Read SMART Self-Test Log Failed

Device info:
[HITACHI H106060SDSUN600G A3D0], lu id: 0x5000cca03c51d8ec, S/N: 001245PG0DND PZHG0DND, 600 GB

For details see host's SYSLOG.

You can also use the smartctl utility for further investigation.
The original message about this issue was sent at Fri May 24 15:35:48 2019 EDT
Another message will be sent in 24 hours if the problem persists.


But MegaCli64 begs to differ:

root@sam:~# /opt/MegaRAID/MegaCli/MegaCli64 -CfgDsply -aALL -nolog|grep ^State
State : Optimal
State : Optimal



It appears SMART isn't supported?

root@sam:~# smartctl -a /dev/sdb -d megaraid,22
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.15.18-12-pve] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www smartmontools org

=== START OF INFORMATION SECTION ===
Vendor: ORACLE
Product: CONCORD26
Revision: 0d00
Logical Unit id: 0x508002000000007d
Device type: enclosure
Transport protocol: SAS (SPL-3)
Local Time is: Tue May 28 10:13:30 2019 EDT
SMART support is: Unavailable - device lacks SMART capability.

=== START OF READ SMART DATA SECTION ===
Current Drive Temperature: 0 C
Drive Trip Temperature: 0 C

Error Counter logging not supported

Device does not support Self Test logging
root@sam:~#




So, what is the right thing to do here?
 
you can change the devices which are scanned by smartd in /etc/smartd.conf (this file is self-documented)
or you can disable smartd via systemd completely
 
For my own, and others' future reference:

First, I checked what devices I'll need to use:

root@sam:~# storcli /c0 /eall /sall show
CLI Version = 007.0813.0000.0000 Dec 14, 2018
Operating system = Linux 4.15.18-12-pve
Controller = 0
Status = Success
Description = Show Drive Information Succeeded.


Drive Information :
=================

----------------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp Type
----------------------------------------------------------------------------------
22:0 24 Onln 0 557.861 GB SAS HDD N N 512B H109060SESUN600G U -
22:1 25 Onln 0 557.861 GB SAS HDD N N 512B H109060SESUN600G U -
22:2 10 Onln 0 557.861 GB SAS HDD N N 512B H106060SDSUN600G U -
22:3 11 DHS 0 557.861 GB SAS HDD N N 512B H106060SDSUN600G U -
22:4 12 Onln 0 557.861 GB SAS HDD N N 512B H106060SDSUN600G U -
22:5 23 Onln 0 557.861 GB SAS HDD N N 512B ST960005SSUN600G U -
22:6 14 Onln 0 557.861 GB SAS HDD N N 512B H106060SDSUN600G U -
22:7 15 Onln 0 557.861 GB SAS HDD N N 512B H106060SDSUN600G U -
22:8 16 Onln 0 557.861 GB SAS HDD N N 512B H106060SDSUN600G U -
22:9 17 Onln 0 557.861 GB SAS HDD N N 512B H106060SDSUN600G U -
22:10 18 Onln 1 92.200 GB SATA SSD N N 512B INTEL SSDSA2BZ100G3S U -
22:11 19 Onln 1 92.200 GB SATA SSD N N 512B INTEL SSDSA2BZ100G3S U -
22:24 20 Onln 0 557.861 GB SAS HDD N N 512B H106060SDSUN600G U -
22:25 21 Onln 0 557.861 GB SAS HDD N N 512B H106060SDSUN600G U -
----------------------------------------------------------------------------------

EID-Enclosure Device ID|Slt-Slot No.|DID-Device ID|DG-DriveGroup
DHS-Dedicated Hot Spare|UGood-Unconfigured Good|GHS-Global Hotspare
UBad-Unconfigured Bad|Onln-Online|Offln-Offline|Intf-Interface
Med-Media Type|SED-Self Encryptive Drive|PI-Protection Info
SeSz-Sector Size|Sp-Spun|U-Up|D-Down/PowerSave|T-Transition|F-Foreign
UGUnsp-Unsupported|UGShld-UnConfigured shielded|HSPShld-Hotspare shielded
CFShld-Configured shielded|Cpybck-CopyBack|CBShld-Copyback Shielded


The column I need is 'DID', which is column 2 of each of the lines that starts with 22:.
Now I generate the lines I'm going to put into smartd.conf:

root@sam:~# for i in $(storcli /c0 /eall /sall show | grep "^22"|awk '{print $2}'); do echo "/dev/sda -a -d megaraid,$i -m root -M exec /usr/share/smartmontools/smartd-runner"; done
/dev/sda -a -d megaraid,24 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,25 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,10 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,11 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,12 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,23 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,14 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,15 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,16 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,17 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,18 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,19 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,20 -m root -M exec /usr/share/smartmontools/smartd-runner
/dev/sda -a -d megaraid,21 -m root -M exec /usr/share/smartmontools/smartd-runner



I commented out the DEVICESCAN line, and added those instead.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!