we have a vlan that has connection issues a couple of times per week. vlan 3 has most of our server vm's like ldap, dhcp, nextcloud and 20 others.
we can run traceroute to all vlans addresses except vlan 3 . examples next:
I know it is impossible to debug our issue without switch and cluster network config.
all vlans are tagged at the netgear m5300 .
we use bridged network at pve using lacp bond.
at pve we use just vmbr0
we do not have an separate entries per vlan like vmbr0.3 . perhaps those are needed?
any suggestions on a direction to go to try to fix?
we can run traceroute to all vlans addresses except vlan 3 . examples next:
Code:
# vlan 3
# traceroute mail
traceroute to mail (10.1.3.14), 30 hops max, 60 byte packets
1 * * *
2 * * *
...
25 * * *
26 * * *
27 * * *
28 * * *
29 * * *
30 * * *
# OK
# traceroute pve2
traceroute to pve2 (10.1.10.2), 30 hops max, 60 byte packets
1 _gateway (10.1.3.1) 0.808 ms 1.223 ms 2.888 ms
2 pve2.fantinibakery.com (10.1.10.2) 0.166 ms 0.139 ms 0.112 ms
# traceroute alex
traceroute to alex (10.1.37.81), 30 hops max, 60 byte packets
1 _gateway (10.1.3.1) 1.365 ms 1.618 ms 1.932 ms
2 alex.fantinibakery.com (10.1.37.81) 0.458 ms 0.514 ms 0.479 ms
I know it is impossible to debug our issue without switch and cluster network config.
all vlans are tagged at the netgear m5300 .
we use bridged network at pve using lacp bond.
at pve we use just vmbr0
Code:
auto vmbr0
iface vmbr0 inet static
address 10.1.10.3/24
gateway 10.1.10.1
bridge-ports bond0
bridge-stp off
bridge-fd 0
bridge-vlan-aware yes
bridge-vids 2-4094
auto vmbr3
iface vmbr3 inet static
address 10.1.130.3/24
bridge-ports bond3
bridge-stp off
bridge-fd 0
bridge-vlan-aware yes
bridge-vids 2-4094
mtu 9000
we do not have an separate entries per vlan like vmbr0.3 . perhaps those are needed?
any suggestions on a direction to go to try to fix?