Each one of the machines are on their own switch together with their own connectionsThe classic suspicion is that your corosync network is weak, has high latency or is not separate from a (congested) "main" network cable. Remember that VLANs do NOT have separate wires per LAN ;-)
Then the obvious recommendation would be to establish physically one (or two) independent cable-networks exclusively for corosync...
Disclaimer: pure guessing...
Okay, great!Each one of the machines are on their own switch together with their own connections
~# journalctl --grep fenc
-- Boot 2052e14b8b124d3e8cca747c9c998d64 --
-- Boot 8a67ec26cb82418592d5b22a0cd97c3d --
-- Boot 46c01a76137346028f356e323f5f8bcd --
Dec 18 19:03:27 pvem pve-ha-crm[2232]: node 'pvei': state changed from 'unknown' => 'fence'
Dec 18 19:04:28 pvem pve-ha-crm[2232]: fencing: acknowledged - got agent lock for node 'pvei'
Dec 18 19:04:28 pvem pve-ha-crm[2232]: node 'pvei': state changed from 'fence' => 'unknown'
~# corosync-cfgtool -n
Local node ID 10, transport knet
nodeid: 2 reachable
LINK: 0 udp (10.3.16.13->10.3.16.9) enabled connected mtu: 1397
LINK: 1 udp (10.11.16.13->10.11.16.9) enabled connected mtu: 1397
...
We use essential cookies to make this site work, and optional cookies to enhance your experience.