Hello community!
I have a problem on one of my Proxmox hosts. The NIC to which my main network bridge is connected goes DOWN and UP again every few seconds.
The communication between the VMs is not interrupted. Only the communication between VMs and the “outside world”.
The problem does not occur constantly, but every few days. Even a restart did not always solve the problem.
To be honest, I don't know exactly how I fixed the error the last few times ...
At first, I thought it was a hardware problem. Then I put the bridge on the other 10G NIC. After a short time, the error occurred there too.
Now I have temporarily placed the bridge on one of the 1G NICs. For the time being, everything is working again, although more slowly of course.
Here are some brief details about the host:
It is a server ordered in this way with the following components:
- Mainboard H12DSi-NT6 with 2x10G Broadcom BCM57416 (these NICs are the ones affected by the error)
- 1x BCM95719A1904AC PCIe 4x 1G NICs (these work for the time being)
I don't think the rest of the data is important. I can provide it if needed.
This is a short section of the dmesg output:
This is my /etc/network/interfaces:
eno1np0 and eno2np1 are the 10G NICs that make problems.
This is the current configuration with the 1G NICs that works.
I'm at my wit's end.
Does anyone have any ideas on how I can find and solve the problem?
Many, many thanks
Andreas
I have a problem on one of my Proxmox hosts. The NIC to which my main network bridge is connected goes DOWN and UP again every few seconds.
The communication between the VMs is not interrupted. Only the communication between VMs and the “outside world”.
The problem does not occur constantly, but every few days. Even a restart did not always solve the problem.
To be honest, I don't know exactly how I fixed the error the last few times ...
At first, I thought it was a hardware problem. Then I put the bridge on the other 10G NIC. After a short time, the error occurred there too.
Now I have temporarily placed the bridge on one of the 1G NICs. For the time being, everything is working again, although more slowly of course.
Here are some brief details about the host:
It is a server ordered in this way with the following components:
- Mainboard H12DSi-NT6 with 2x10G Broadcom BCM57416 (these NICs are the ones affected by the error)
- 1x BCM95719A1904AC PCIe 4x 1G NICs (these work for the time being)
I don't think the rest of the data is important. I can provide it if needed.
This is a short section of the dmesg output:
Code:
[ 890.223359] bnxt_en 0000:02:00.0 eno1np0: NIC Link is Down
[ 890.225092] vmbr1: port 1(eno1np0) entered disabled state
[ 892.723143] bnxt_en 0000:02:00.0 eno1np0: NIC Link is Up, 10000 Mbps full duplex, Flow control: ON - receive & transmit
[ 892.723152] bnxt_en 0000:02:00.0 eno1np0: EEE is not active
[ 892.723154] bnxt_en 0000:02:00.0 eno1np0: FEC autoneg off encoding: None
[ 892.723184] vmbr1: port 1(eno1np0) entered blocking state
[ 892.723199] vmbr1: port 1(eno1np0) entered forwarding state
[ 895.722180] bnxt_en 0000:02:00.0 eno1np0: NIC Link is Down
[ 895.724029] vmbr1: port 1(eno1np0) entered disabled state
[ 897.972793] bnxt_en 0000:02:00.0 eno1np0: NIC Link is Up, 10000 Mbps full duplex, Flow control: ON - receive & transmit
[ 897.972805] bnxt_en 0000:02:00.0 eno1np0: EEE is not active
[ 897.972807] bnxt_en 0000:02:00.0 eno1np0: FEC autoneg off encoding: None
[ 897.972855] vmbr1: port 1(eno1np0) entered blocking state
[ 897.972875] vmbr1: port 1(eno1np0) entered forwarding state
[ 900.231984] bnxt_en 0000:02:00.0 eno1np0: NIC Link is Down
This is my /etc/network/interfaces:
Code:
auto lo
iface lo inet loopback
auto eno1np0
iface eno1np0 inet manual
#zu Switch (funktioniert auch nicht)
auto eno2np1
iface eno2np1 inet manual
#defekt (doch nicht defekt)
auto enp161s0f1
iface enp161s0f1 inet manual
auto enp161s0f2
iface enp161s0f2 inet manual
auto enp161s0f3
iface enp161s0f3 inet manual
auto enp1s0f1
iface enp1s0f1 inet manual
auto enp1s0f2
iface enp1s0f2 inet manual
auto enp1s0f3
iface enp1s0f3 inet manual
#zu Modem
auto enp161s0f0
iface enp161s0f0 inet manual
iface enxbe3af2b6059f inet manual
auto enp1s0f0
iface enp1s0f0 inet manual
auto vmbr0
iface vmbr0 inet manual
bridge-ports enp1s0f3
bridge-stp off
bridge-fd 0
#Extern (Zu Modem)
auto vmbr1
iface vmbr1 inet static
address 10.10.10.111/16
gateway 10.10.10.254
bridge-ports eno1np0 enp1s0f2
bridge-stp off
bridge-fd 0
#Intern
eno1np0 and eno2np1 are the 10G NICs that make problems.
This is the current configuration with the 1G NICs that works.
I'm at my wit's end.
Does anyone have any ideas on how I can find and solve the problem?
Many, many thanks
Andreas