Proxmox and HP RAID controller

Ramil

New Member
Nov 3, 2016
1
0
1
33
Hello. I updated pve core to version 2.6.32-48-pve and now i have this errors in kernel log:

hpsa 0000:04:00.0: scsi 0:0:0:0 addition failed, device not added.
levelhpsa 0000:04:00.0: added scsi 0:0:0:0: RAID HP P410i controller SSDSmartPathCap- En- Exp=1 qd=1024
hpsa 0000:04:00.0: scsi 0:0:0:0 addition failed, device not added.
levelhpsa 0000:04:00.0: added scsi 0:0:0:0: RAID

Also doesn't work hpacucli utility. Can anybody help me with this?

More information:

lspci | grep RAID
04:00.0 RAID bus controller: Hewlett-Packard Company Smart Array G6 controllers (rev 01)

lsmod | grep hp
hpwdt 7009 0
hpilo 7535 0
shpchp 29162 0
hpsa 95349 2
scsi_transport_sas 35463 1 hpsa

lsmod | grep sg
sg 29475 0
 
What sourcesentry do you use for the hptools, and what version of hpacucli / hp-health?
 
Code:
# /etc/apt/sources.list.d/hp.list
deb http://downloads.linux.HPE.com/SDR/downloads/MCP/Debian wheezy/current non-free

Code:
Package: hpacucli
Status: install ok installed
Priority: optional
Section: admin
Installed-Size: 18828
Maintainer: craig lamparter <Linux_SWdeliverables@external.groups.hp.com>
Architecture: amd64
Version: 9.40.1-1.
Depends: libc6 (>= 2.3.4), libgcc1 (>= 1:4.1.1), libstdc++6 (>= 4.1.1)
Description: HP Command Line Array Configuration Utility
The HP Command Line Array Configuration Utility is the disk
array configuration program for Array Controllers.

Package: hp-health
Status: deinstall ok config-files
Priority: optional
Section: admin
Installed-Size: 1886
Maintainer: bryan gartner <bryan.gartner@hp.com>
Architecture: amd64
Version: 9.4.0.1.7-5.
Config-Version: 9.4.0.1.7-5.
Depends: libc6 (>= 2.7), binutils, dmidecode, pciutils, libc6-i686 | lib32gcc1
Recommends: logrotate
Conffiles:
/etc/init.d/hp-asrd 052eab496f5fbfd1897336b85664e239
/etc/init.d/hp-health 750d79a7ec723cc5a86c435220a47f89
/etc/logrotate.d/hp-health d501c2186cc5a5b09a227f10f5c8a12e
Description: hp System Health Application and Command line Utility Package
This package contains the System Health Monitor for all hp Proliant systems
with ASM, ILO, & ILO2 embedded management asics.  Also contained are the
command line utilities.

This taken from cli:
hpacucli - HP Array Configuration Utility CLI 9.40.12.0

UPD:

Code:
05:00.0 RAID bus controller: Hewlett-Packard Company Smart Array G6 controllers (rev 01)
    Subsystem: Hewlett-Packard Company Smart Array P410i
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0, Cache Line Size: 64 bytes
    Interrupt: pin A routed to IRQ 28
    Region 0: Memory at fbc00000 (64-bit, non-prefetchable) [size=4M]
    Region 2: Memory at fbbf0000 (64-bit, non-prefetchable) [size=4K]
    Region 4: I/O ports at 4000 [size=256]
    [virtual] Expansion ROM at e7000000 [disabled] [size=512K]
    Capabilities: [40] Power Management version 3
        Flags: PMEClk- DSI- D1+ D2- AuxCurrent=375mA PME(D0+,D1-,D2-,D3hot-,D3cold-)
        Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
    Capabilities: [50] MSI: Enable- Count=1/1 Maskable- 64bit+
        Address: 0000000000000000  Data: 0000
    Capabilities: [70] Express (v2) Endpoint, MSI 00
        DevCap:    MaxPayload 256 bytes, PhantFunc 0, Latency L0s <1us, L1 <8us
            ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
        DevCtl:    Report errors: Correctable- Non-Fatal+ Fatal+ Unsupported-
            RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+
            MaxPayload 256 bytes, MaxReadReq 4096 bytes
        DevSta:    CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend-
        LnkCap:    Port #0, Speed 5GT/s, Width x8, ASPM L0s, Latency L0 <512ns, L1 <64us
            ClockPM- Surprise- LLActRep- BwNot-
        LnkCtl:    ASPM Disabled; RCB 64 bytes Disabled- Retrain- CommClk+
            ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
        LnkSta:    Speed 5GT/s, Width x4, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
        DevCap2: Completion Timeout: Not Supported, TimeoutDis+
        DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-
        LnkCtl2: Target Link Speed: 5GT/s, EnterCompliance- SpeedDis-, Selectable De-emphasis: -6dB
             Transmit Margin: Normal Operating Range, EnterModifiedCompliance- ComplianceSOS-
             Compliance De-emphasis: -6dB
        LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete-, EqualizationPhase1-
             EqualizationPhase2-, EqualizationPhase3-, LinkEqualizationRequest-
    Capabilities: [ac] MSI-X: Enable+ Count=16 Masked-
        Vector table: BAR=0 offset=001c2000
        PBA: BAR=0 offset=001c4000
    Capabilities: [100 v1] Advanced Error Reporting
        UESta:    DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
        UEMsk:    DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
        UESvrt:    DLP- SDES+ TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
        CESta:    RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr-
        CEMsk:    RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr-
        AERCap:    First Error Pointer: 00, GenCap+ CGenEn- ChkCap+ ChkEn-
    Kernel driver in use: hpsa
 
Code:
06:00.0 RAID bus controller: Hewlett-Packard Company Smart Array G6 controllers (rev 01)
    Subsystem: Hewlett-Packard Company Smart Array P212
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0, Cache Line Size: 256 bytes
    Interrupt: pin A routed to IRQ 24
    Region 0: Memory at fb800000 (64-bit, non-prefetchable) [size=4M]
    Region 2: Memory at fbdff000 (64-bit, non-prefetchable) [size=4K]
    Region 4: I/O ports at d800 [size=256]
    Expansion ROM at fbd00000 [disabled] [size=512K]
    Capabilities: [40] Power Management version 3
        Flags: PMEClk- DSI- D1+ D2- AuxCurrent=375mA PME(D0+,D1-,D2-,D3hot-,D3cold-)
        Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
    Capabilities: [50] MSI: Enable- Count=1/1 Maskable- 64bit+
        Address: 0000000000000000  Data: 0000
    Capabilities: [70] Express (v2) Endpoint, MSI 00
        DevCap:    MaxPayload 256 bytes, PhantFunc 0, Latency L0s <1us, L1 <8us
            ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
        DevCtl:    Report errors: Correctable- Non-Fatal+ Fatal+ Unsupported-
            RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+
            MaxPayload 256 bytes, MaxReadReq 4096 bytes
        DevSta:    CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr- TransPend-
        LnkCap:    Port #0, Speed 5GT/s, Width x8, ASPM L0s, Latency L0 <512ns, L1 <64us
            ClockPM- Surprise- LLActRep- BwNot-
        LnkCtl:    ASPM Disabled; RCB 64 bytes Disabled- Retrain- CommClk+
            ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
        LnkSta:    Speed 2.5GT/s, Width x8, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
        DevCap2: Completion Timeout: Not Supported, TimeoutDis+
        DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-
        LnkCtl2: Target Link Speed: 5GT/s, EnterCompliance- SpeedDis-, Selectable De-emphasis: -6dB
             Transmit Margin: Normal Operating Range, EnterModifiedCompliance- ComplianceSOS-
             Compliance De-emphasis: -6dB
        LnkSta2: Current De-emphasis Level: -3.5dB, EqualizationComplete-, EqualizationPhase1-
             EqualizationPhase2-, EqualizationPhase3-, LinkEqualizationRequest-
    Capabilities: [ac] MSI-X: Enable+ Count=16 Masked-
        Vector table: BAR=0 offset=001c2000
        PBA: BAR=0 offset=001c4000
    Capabilities: [100 v1] Advanced Error Reporting
        UESta:    DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
        UEMsk:    DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt+ RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
        UESvrt:    DLP+ SDES+ TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol-
        CESta:    RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr+
        CEMsk:    RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr+
        AERCap:    First Error Pointer: 00, GenCap+ CGenEn- ChkCap+ ChkEn-
    Kernel driver in use: hpsa
 
IIRC there were some issues regarding some HP smart arrays and older kernels (2.6.x series (and as far as I rememmber fixed starting from 2.6.37) when the hpsa driver overlapped with cciss driver.

What you could do is add hpsa.hpsa_allow_any=1 to the boot (grub) params and see if it helps.


If anything for sure it works on 4.x kernels:

Code:
root@box:~# uname -a
Linux box 4.4.21-1-pve #1 SMP Thu Oct 27 09:31:44 CEST 2016 x86_64 GNU/Linux

root@box:~# hpssacli ctrl all show config | grep Smart
Smart Array P420i in Slot 0 (Embedded)
Smart Array P431 in Slot 1
Smart Array P431 in Slot 2


Also the HP has switched the hpacucli to hpssacli ..
Hpacucli doesn't always work with newer controllers..
 
I've been hit by the same "addition failed, device not added" message on redhat, too, on a DL 380 G5 machine. that's actually how I found this thread.

what I've discovered by installing/uninstalling one by one each and every version of hpsa kernel module is that version kmod-hpsa-3.4.10-120.rhel7u1.x86_64 is the last that works okay. notice that 3.4.10 is the version of the hpsa kernel module, not the kernel version.

newer versions exposes ok the logical volumes to the os, but hpacucli/hpssacli/ssacli no longer 'sees' the arrays/controllers. so you can use it but cannot modify/query the controller

so check in your dmesg what version hpsa reports

Here it looks like this:

[ 1.430705] HP HPSA Driver (v 3.4.10-120)
 
what I've discovered by installing/uninstalling one by one each and every version of hpsa kernel module is that version kmod-hpsa-3.4.10-120.rhel7u1.x86_64 is the last that works okay. notice that 3.4.10 is the version of the hpsa kernel module, not the kernel version.
Not sure about RHEL but in general all the 4.x kernel tree works just fine.

For example on an old ProLiant DL360 G5 (on a 4.10.8 kernel / OpenSUSE Leap) with 3.4.16 which is the latest ( https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-stable.git/tree/drivers/scsi/hpsa.c )

Code:
box:~ # dmidecode | grep Name
        Product Name: ProLiant DL360 G5


box:~ # modinfo hpsa
filename:       /lib/modules/4.10.8-2.gea9dcd4-default/kernel/drivers/scsi/hpsa.ko
license:        GPL
version:        3.4.16-0
description:    Driver for HP Smart Array Controller version 3.4.16-0
author:         Hewlett-Packard Company


box:~ # hpssacli ctrl all show config
Smart Array E200i in Slot 0 (Embedded)
..

Haven't seen any issues either on G5, G6, G8 or G9 so Proxmox with the current 4.4.x shouldn't have any problems either.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!