Network issues with 5.4.114-1 kernel and openvswitch LAG

e100

Renowned Member
Nov 6, 2010
1,268
46
88
Columbus, Ohio
ulbuilder.wordpress.com
We use openvswitch and tagged vlans.
When rebooting into kernel 5.4.114-1 we starting having network issues
SSH connections would break, live migrations failing.
Eventually networking stopped entirely.

Rebooted with kernel 5.4.106-1 and everything works fine again.

Intel 10G network card

Not sure what other information would be helpful but seems like an issue with the kernel.

# pveversion -v
Code:
proxmox-ve: 6.4-1 (running kernel: 5.4.106-1-pve)
pve-manager: 6.4-6 (running version: 6.4-6/be2fa32c)
pve-kernel-5.4: 6.4-2
pve-kernel-helper: 6.4-2
pve-kernel-5.4.114-1-pve: 5.4.114-1
pve-kernel-5.4.106-1-pve: 5.4.106-1
pve-kernel-5.4.78-2-pve: 5.4.78-2
pve-kernel-5.4.34-1-pve: 5.4.34-2
ceph: 14.2.20-pve1
ceph-fuse: 14.2.20-pve1
corosync: 3.1.2-pve1
criu: 3.11-3
glusterfs-client: 5.5-3
ifupdown: 0.8.35+pve1
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.20-pve1
libproxmox-acme-perl: 1.1.0
libproxmox-backup-qemu0: 1.0.3-1
libpve-access-control: 6.4-1
libpve-apiclient-perl: 3.1-3
libpve-common-perl: 6.4-3
libpve-guest-common-perl: 3.1-5
libpve-http-server-perl: 3.2-2
libpve-storage-perl: 6.4-1
libqb0: 1.0.5-1
libspice-server1: 0.14.2-4~pve6+1
lvm2: 2.03.02-pve4
lxc-pve: 4.0.6-2
lxcfs: 4.0.6-pve1
novnc-pve: 1.1.0-1
openvswitch-switch: 2.12.3-1
proxmox-backup-client: 1.1.6-2
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.5-4
pve-cluster: 6.4-1
pve-container: 3.3-5
pve-docs: 6.4-2
pve-edk2-firmware: 2.20200531-1
pve-firewall: 4.1-3
pve-firmware: 3.2-3
pve-ha-manager: 3.1-1
pve-i18n: 2.3-1
pve-qemu-kvm: 5.2.0-6
pve-xtermjs: 4.7.0-3
pve-zsync: 2.1-1
qemu-server: 6.4-2
smartmontools: 7.2-pve2
spiceterm: 3.1-1
vncterm: 1.6-2
zfsutils-linux: 2.0.4-pve1
Code:
allow-vmbr0 enp8s0f0   
iface enp8s0f0 inet manual
                                              
allow-vmbr0 enp8s0f1                                                                           
iface enp8s0f1 inet manual
                                              
allow-vmbr0 bond0
iface bond0 inet manual
    ovs_bridge vmbr0
    ovs_type OVSBond   
    ovs_bonds enp8s0f0 enp8s0f1
    ovs_options bond_mode=balance-tcp lacp=active other_config:lacp-time=fast
                                              
allow-ovs vmbr0                                                                               
iface vmbr0 inet manual
    ovs_type OVSBridge   
    ovs_ports bond0 vlan9 vlan6 vlan7
                                              
# mgmt lan
allow-vmbr0 vlan9
iface vlan9 inet static
    ovs_type OVSIntPort
    ovs_bridge vmbr0
    ovs_options tag=9
    ovs_extra set interface ${IFACE} external-ids:iface-id=$(hostname -s)-${IFACE}-vif
    address x.x.x.x
    netmask 255.255.255.0
    gateway x.x.x.x
 
When rebooting into kernel 5.4.114-1 we starting having network issues
SSH connections would break, live migrations failing.
Eventually networking stopped entirely.
What is the rough timespan between working somewhat initially to complete breakage?

Not sure what other information would be helpful but seems like an issue with the kernel.
Is there anything suspicious in the kernel log (dmesg), NIC flapping or other, seemingly network related, errors?
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!