HI,
it took me some time, but I finally understood that is going on.
Before I open bugs on bugzilla, I want to make sure I get things right.
1. BUG: Proxmox docs state, that PM 6 cluster does not require multicast.
"Note: Proxmox VE 6.0 uses corosync 3 which switched out the underlying transport stack with Kronosnet (knet). Kronosnet currently only supports unicast."
https://pve.proxmox.com/wiki/Multicast_notes
However during a joining of a new node, it actually sends multicast packets! So docs should clearly state that!
Which brings us to bug numero 2.
2. Proxmox bridge blocks multicast by default.
As Proxmox uses multicast packets for joining node into the cluster, it should by default allow them.
If one set's Proxmox cluster network on a Linux bridge, it will not join cluster, because Linux bridge will block multicast traffic.
If the same node (after rollback) is added again, but this time using a physical interface, it joins perfectly.
Yeah and I know what igmp snooping is and it is disabled on my switches for cluster communication.
It guess it has even been disabled by default in linux bridge in first PM 6.* releases, however somewhere along the line, you changed that default behaviour and it blocks now.
So I would suggest that for 1. fix the docs and for 2. enable querier by default or disable snooping by default or fix the docs and say, PM clusters are not supported when cluster ring is on a bridge.
Should I open bug reports now?
it took me some time, but I finally understood that is going on.
Before I open bugs on bugzilla, I want to make sure I get things right.
1. BUG: Proxmox docs state, that PM 6 cluster does not require multicast.
"Note: Proxmox VE 6.0 uses corosync 3 which switched out the underlying transport stack with Kronosnet (knet). Kronosnet currently only supports unicast."
https://pve.proxmox.com/wiki/Multicast_notes
However during a joining of a new node, it actually sends multicast packets! So docs should clearly state that!
Which brings us to bug numero 2.
2. Proxmox bridge blocks multicast by default.
As Proxmox uses multicast packets for joining node into the cluster, it should by default allow them.
If one set's Proxmox cluster network on a Linux bridge, it will not join cluster, because Linux bridge will block multicast traffic.
If the same node (after rollback) is added again, but this time using a physical interface, it joins perfectly.
Yeah and I know what igmp snooping is and it is disabled on my switches for cluster communication.
It guess it has even been disabled by default in linux bridge in first PM 6.* releases, however somewhere along the line, you changed that default behaviour and it blocks now.
Code:
(0.53) #show igmpsnooping
Admin Mode..................................... Disable
Multicast Control Frame Count.................. 0
IGMP Router-Alert check........................ Disabled
Interfaces Enabled for IGMP Snooping........... None
VLANs enabled for IGMP snooping................ None
root@p35:~# cat /sys/devices/virtual/net/vmbr0/bridge/multicast_snooping
1
root@p35:~# cat /sys/devices/virtual/net/vmbr0/bridge/multicast_querier
0
So I would suggest that for 1. fix the docs and for 2. enable querier by default or disable snooping by default or fix the docs and say, PM clusters are not supported when cluster ring is on a bridge.
Should I open bug reports now?
Last edited: