Hah... that really sounds strange... and interesting
I will be honest, I have used some AI to analyse the information provided by you. So what I found (what the AI recommends me to check):
1. ---
"
Partner Mac Address: 00:00:00:00:00:00
Partner Churn State: churned
In a healthy LACP (802.3ad) setup, the "Partner Mac Address" should be the MAC of your physical switch. The fact that it is all zeros means your server has not received a single LACP Control Protocol Data Unit (PDU) from the switch."
IMHO - that means that you should check your switch side configuration (probably the LACP is not configured properly on the switch side)
2. ---
"
Number of ports: 1
Active Aggregator ID: 2 (only eno2 is active)
Even though you have four interfaces (eno1 through eno4), they are all in different Aggregator IDs (1, 2, 3, and 4). Because they can't see a common partner, the bonding driver is forced to pick just one interface to keep the network alive, rather than "bonding" them into a single 4Gbps pipe."
IMHO - this leads to the same direction - the configuration on the switch
3. ---
"
Critical Typo:
In your configuration, you have:
bond-lcap-rate fast
It should be:
bond-lacp-rate fast (The 'p' and 'c' are swapped)."
And the recommended change is:
Code:
auto bond0
iface bond0 inet manual
bond-slaves eno1 eno2 eno3 eno4
bond-mode 802.3ad
bond-miimon 100
bond-lacp-rate fast # Fixed the typo here
bond-xmit-hash-policy layer2+3
bond-min-links 1 # Ensures the bond stays up if at least 1 link is alive
Last but not least... when the things work, on eno1
...
details partner lacp pdu:
...
system mac address: f4:74:70:3b:e2:53
....
while on all others it is:
....
details partner lacp pdu:
...
system mac address: 00:00:00:00:00:00
....
which again leads me to the idea of a switch side missconfiguration.
I would proceed as follows:
- On the host side, I would leave the bond with only one interface (let's say eno1). I would comment on the others, even if I had the chance, I would remove their cables
- on the switch itself, I would put an IP address from the same network (192.168.3.x/24)
In this configuration (Linux host + bond 802.3ad + vmbr0 + eno1 (only) <--> switch with 4 ports in LACP), I will try to verify that everything works at least between the host and the switch.
When I am sure of the above, I will start adding the remaining 3 interfaces one by one, and after adding each interface, I will make sure that things continue to work.
Only when I have added all 4 interfaces to the host's bond, and I am convinced that I have a connection to the switch, I will start investigating problems with the connection to the gateway (if there are still any).
I'm looking forward to your feedback.
Best,
NT