Node auto reboot.

DuyQuy

Member
Apr 26, 2022
17
0
6
Hi all.

We are having a problem with 1 node in our cluster that reboots automatically and we can't find the cause yet.

We are running with ProLiant DL360 Gen9. E5-2680v4, 196GB RAM.

we have checked the whole hardware and it is no problem and updated the ILO and raid firmware to the latest version.

System Health: OK

1678274742743.png

Messages log doesn't give us too much information. It just shows the system is rebooted and nothing. I have attached dmesg and pveversion at the bottom. Can anyone give me a hint on what to do?

Thanks for everything
 

Attachments

  • dmesg.txt
    107.9 KB · Views: 2
  • pveversion.txt
    1.4 KB · Views: 1
Hello,

Thank you for the dmesg and pveversion outputs!

Can you please also post/attach the Syslog from the mentioned node and another node from your cluster at the time when the server got rebooted? You can sort the syslog using journalctl tool as the following command (you may change the time/date):

Bash:
journalctl --since "2023-03-08 00:00" --until "2023-03-08 08:00" > /tmp/Syslog.log
 
Hi.

Thanks for your repply.

I've attached the syslog of the failed node as well as that of another node in the cluster below. Hope it helps.
 

Attachments

  • anothernode.txt
    20.2 KB · Views: 2
  • nodeproblem.txt
    196.4 KB · Views: 5
This is corosync config. My cluster has been running stable for several months since I upgraded it from pve 6 to 7
 

Attachments

  • corosync.txt
    1.9 KB · Views: 2
Hi.

I have an additional update that. This reboot happens randomly when I do some operation on proxmox gui for example start vm. or when I close any process on the VM it can also reboot our node.
 
Hi,

Thank you for the corosync config and the additional information!

Can you test the memtest on the mentioned server? I would also check both the power supplies and any components on the motherboard that handles power.
 
hello,

Almost the same thing is happening to me.

My server is randomly rebooting and I'm not being able to find the error. I also have proxmox v7.4.



Could you solve it?

Greetings
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!