Built a three node cluster which was all working correctly.
All networking working tested, for failover using dual nics, across two switches.
PVE-HYP-04P10
(warning)
PVE-HYP-05P10
(red/offline)
PVE-HYP-06P10
(red/offline)
pvecm status from PVE-HYP-04P10:
Nodes: 1
Quorate: No
Activity blocked
Members: 0x00000003 172.20.34.4 (local) only
Ping results:
ping 172.20.34.5 → 100% packet loss
ping 172.20.34.6 → 0% packet loss
corosync-cfgtool -s:
nodeid: 1 disconnected
nodeid: 2 connected
nodeid: 3 localhost
This shows 04P10 could reach 06P10 but not 05P10 — and the cluster had lost quorum overnight.
What is even weirder, is the corosync network is a VLAN on the same trunk which carries three other VLANs, which were all working and could ping each other.
But the corosync vLAN could not reach 04P10 and 05P10
Any ideas ?
All networking working tested, for failover using dual nics, across two switches.
PVE-HYP-04P10
PVE-HYP-05P10
PVE-HYP-06P10
pvecm status from PVE-HYP-04P10:
Nodes: 1
Quorate: No
Activity blocked
Members: 0x00000003 172.20.34.4 (local) only
Ping results:
ping 172.20.34.5 → 100% packet loss
ping 172.20.34.6 → 0% packet loss
corosync-cfgtool -s:
nodeid: 1 disconnected
nodeid: 2 connected
nodeid: 3 localhost
This shows 04P10 could reach 06P10 but not 05P10 — and the cluster had lost quorum overnight.
What is even weirder, is the corosync network is a VLAN on the same trunk which carries three other VLANs, which were all working and could ping each other.
But the corosync vLAN could not reach 04P10 and 05P10
Any ideas ?