Marking ringid 0 interface FAULTY

itvietnam

Renowned Member
Aug 11, 2015
132
4
83
Hi,

Our corosync is usually said this and recover after 1 second. may i know how to debug?

Test omping in 5 minutes and ok. No packet lost.

2018-10-28_17-41-27.png

This is the test on ringid 1, they are also has the same problem with ringid 0.

2018-10-28_17-49-07.png

Error of ringid 1 (10.20.30.0/24)

2018-10-28_18-00-08.png
 
Last edited:
The `omping` command invocations are explained in our documentation.

For the tcpdump, I'd do something like
`tcpdump -s0 -w corosyncproblem.pcap -ni $IFACE`, where `$IFACE` is replaced by the interface your corosync network is bound on.

after gathering enough data (the log saying that the ring became faulty and ok again) - stop tcpdump with Ctrl-C, take the resulting corosyncproblem.pcap and open it in wireshark.
 
You could try to set corosync to log debug messages - and see if the logs show anything with more information