HP PROLIANT DL180 GEN9 AND PCI Express error

ufuk

Renowned Member
Dec 4, 2015
5
0
66
Hi,
My HP HP Smart Array P440 Controller on PCI Express Slot1 and i am getting below error



Uncorrectable PCI Express Error (Slot 1, Bus 0, Device 3, Function 0, Error status 0x00000020)
Unrecoverable System Error (NMI) has occurred. System Firmware will log additional details in a separate IML entry if possible
PCI Bus Error (Slot 0, Bus 0, Device 3, Function 0)

i want to know that is that about proxmox driver or other than this...

need any advice...

thanks
 
What says the IML log? You can read this easy with the HPtools.

deb http://downloads.linux.hp.com/SDR/downloads/MCP jessie/current non-free
deb http://downloads.linux.hp.com/SDR/downloads/MCP/debian jessie/current non-free

apt-key adv --recv-keys --keyserver keyserver.ubuntu.com 527BC53A2689B887
apt-key adv --recv-keys --keyserver keyserver.ubuntu.com FADD8D64B1275EA3
apt-get update
apt-get dist-upgrade
apt-get install hp-health hpssacli hponcfg

Or you can read important things in ILO.

Best Regards
 
thanks for your reply...

i wrote what ILO said...

150
icon_status_01_critical.gif
Critical PCI Bus 12/29/2015 18:15 12/29/2015 18:15 1 Uncorrectable PCI Express Error (Slot 1, Bus 0, Device 3, Function 0, Error status 0x00000020)
149
icon_status_01_critical.gif
Critical System Error 12/29/2015 18:15 12/29/2015 18:15 1 Unrecoverable System Error (NMI) has occurred. System Firmware will log additional details in a separate IML entry if possible
148
icon_status_01_critical.gif
Critical PCI Bus 12/29/2015 18:15 12/29/2015 18:15 1 PCI Bus Error (Slot 0, Bus 0, Device 3, Function 0)

above lines from my HP DL180 GEN9 ILO...
 
Sorry, your 3 images are not visible in your post. Is in the ILO generally an HW Error? When yes. You should contact HP Support for warranty. Can you post Screenshot from ILO status and IML?

Does the system is running normaly or is only this message in IML your problem? Or you are not able to install the system?
 
No image inmy post!?

my system works perfectly, but some times 4-5 days period, proxmox and also al my Virtual Apliance down, theni chack ILO i see these error linesa my P440 Smart Array controller and its pci controller

PCI-E Slot 1 HP Smart Array P440 Controller 749797-001 PDNMF0ARH7U2BX B 3.52

error is :
Critical PCI Bus 12/29/2015 18:15 12/29/2015 18:15 1 Uncorrectable PCI Express Error (Slot 1, Bus 0, Device 3, Function 0, Error status 0x00000020)
Critical System Error 12/29/2015 18:15 12/29/2015 18:15 1 Unrecoverable System Error (NMI) has occurred. System Firmware will log additional details in a separate IML entry if possible
Critical PCI Bus 12/29/2015 18:15 12/29/2015 18:15 1 PCI Bus Error (Slot 0, Bus 0, Device 3, Function 0)

P440 is on this PCI slot...
 
Hmm, ok yes, looks like an HW fault. For the first contact HP Support. They send you an script. With this you can generate an full systemreport from proxmox. I've done this a lot of times. And you can also send an report from ILO.
 
I got the same on both P420, and P420i(Integrated to systemboard)
My symptoms were an unusually hot drive controller 65°C, and similar errors. "Uncorrectable PCI Express Error (Slot 5, Bus 32, Device 2, Function 2, Error status 0x00000000)"

I pulled out the cache module and it solved all the problems.
The first time it happened I thought it was a band cache, but it happened to me on different HW.