Fencing and cluster state logging

Jun 8, 2016
344
69
68
47
Johannesburg, South Africa
We had a node fence itself as the IPMI reset counter reached zero (Motherboard logs) but we can't locate logging information leading up to this event.

Surely corosync or pvecm logs events such as losing quorum, occasional heartbeat messages being lost or discarded?

We previously appear to have identified an issue with a node being fenced after systemd-timesync jumped time due to a problem with one or more members of pool.ntp.org. We now subsequently run ntp which drifts to correct time instead of jumping and discards answers from ntp servers which are not in quorum with the majority...
 
Surely corosync or pvecm logs events such as losing quorum, occasional heartbeat messages being lost or discarded?
They are logged in the syslog, but the last messages before a reset might not be written to the log anymore.

We previously appear to have identified an issue with a node being fenced after systemd-timesync jumped time due to a problem with one or more members of pool.ntp.org. We now subsequently run ntp which drifts to correct time instead of jumping and discards answers from ntp servers which are not in quorum with the majority...
For a stable time, best use a local ntp server for the cluster, then the time is only synced from one source, lose to the cluster.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!