Strange kernel messages

Patschi

Member
Jul 23, 2013
51
0
6
29
Austria
pkern.at
Some days ago I got strange kernel messages, but the system still runs without noticeable problems. But anyway, I'd like to know why this happens.

Message from syslogd@pve at Apr 6 20:04:02 ...
kernel:Uhhuh. NMI received for unknown reason 00 on CPU 1.

Message from syslogd@pve at Apr 6 20:04:02 ...
kernel:Do you have a strange power saving mode enabled?

Message from syslogd@pve at Apr 6 20:04:02 ...
kernel:Dazed and confused, but trying to continue

My setup is a PX60 server from Hetzner in Germany:
Intel® Xeon® E3-1270 v3
32 GB ECC RAM (4x 8 GB Micron Technology 18KSF1G72AZ-1G6E1)
2 x 2 TB SATA 6 Gb/s 7200 rpm (WD2000FYYZ-01UL1B1)
Intel i210 Gigabit Ethernet Card

Thanks for any suggestions!
 
Just received the nearly the same messages again (but now with a other reason):
Message from syslogd@pve at Apr 10 13:52:19 ...
kernel:Uhhuh. NMI received for unknown reason 30 on CPU 0.

Message from syslogd@pve at Apr 10 13:52:19 ...
kernel:Do you have a strange power saving mode enabled?

Message from syslogd@pve at Apr 10 13:52:19 ...
kernel:Dazed and confused, but trying to continue
 
Could be here that the kernel gets an event from a monitoring/fencing device on your motherboard ? Probably you should ask your hoster about this.
 
And always disable power management on servers, this has lead to a big pile of problems in the past and I don't think all bugs are fixed yet.
 
Dec 9 08:40:01 hp1 kernel: php[4188]: segfault at f7777524 ip f7777524 sp fff868e8 error 15 in ld-2.11.2.so[f7777000+1000]
Dec 9 09:40:01 hp1 kernel: php[4485]: segfault at f77f5524 ip f77f5524 sp ffee1df8 error 15 in ld-2.11.2.so[f77f5000+1000]
Dec 9 10:40:01 hp1 kernel: php[4503]: segfault at f7722524 ip f7722524 sp ff91b108 error 15 in ld-2.11.2.so[f7722000+1000]
Dec 9 14:40:01 hp1 kernel: php[13679]: segfault at f77af524 ip f77af524 sp ffac23c8 error 15 in ld-2.11.2.so[f77af000+1000]
 
That is php and has nothing to do with Proxmox VE. Where is php running in your system? Are you using some LXC container with php in it?
 
Yes. Squeeze is running on LXC client container.

This system does not have any LTS anymore. Update and the problems are probably gone. Maybe they are attacks on your PHP to gain more privileges?

Still it's not happy making to see such errors on Host log.

I would not be happy to see the errors at all :-D
Please consider updating to Jessie (6->8)
 
Clone it, upgrade it with another IP, unplug virtual network cable, change IP back to original then poweroff, replug cable, stop the old and start the new. Downtime of a few seconds.
 
You could use the clone to test everything and time the upgrade time, downtime on the production server is still a must.

If you do not want to take it down in general, please consider to migrate to a non-single-point-of-failure setup. Multiple servers for everything with load balancing etc. Downtime is normal and even 99.99 % uptime is almost an hour per year planed downtime (https://uptime.is/99.99). Works perfect for me. Every company should allow this or has a multi million budget for ha-setup :-D
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!