Hi,
Networking seems all fine (ping google, ssh connect) but host totally disconnect from time to time (quite every 2 days ... )and we have to reboot it, and networking.service fail to start ....
I'm running 4.15.18-27-pve with this kind of network config :
root@bks:~# ip link
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN mode DEFAULT group default qlen 1000
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
2: eno3: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr0 state UP mode DEFAULT group default qlen 1000
link/ether 0c:c4:7a:c3:53:a6 brd ff:ff:ff:ff:ff:ff
3: eno4: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr1 state UP mode DEFAULT group default qlen 1000
link/ether 0c:c4:7a:c3:53:a7 brd ff:ff:ff:ff:ff:ff
4: vmbr0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP mode DEFAULT group default qlen 1000
link/ether 0c:c4:7a:c3:53:a6 brd ff:ff:ff:ff:ff:ff
5: vmbr1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP mode DEFAULT group default qlen 1000
link/ether 0c:c4:7a:c3:53:a7 brd ff:ff:ff:ff:ff:ff
6: vmbr2: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN mode DEFAULT group default qlen 1000
link/ether 96:ed:50:b4:f2:c4 brd ff:ff:ff:ff:ff:ff
root@bks:~# ip address
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
valid_lft forever preferred_lft forever
inet6 ::1/128 scope host
valid_lft forever preferred_lft forever
2: eno3: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr0 state UP group default qlen 1000
link/ether 0c:c4:7a:c3:53:a6 brd ff:ff:ff:ff:ff:ff
3: eno4: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr1 state UP group default qlen 1000
link/ether 0c:c4:7a:c3:53:a7 brd ff:ff:ff:ff:ff:ff
4: vmbr0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
link/ether 0c:c4:7a:c3:53:a6 brd ff:ff:ff:ff:ff:ff
inet 111.222.111.222/24 brd 111.222.333.255 scope global vmbr0
valid_lft forever preferred_lft forever
inet6 2001:2222:3333:4444::/64 scope global
valid_lft forever preferred_lft forever
inet6 fe80::ec4:7aff:fec3:53a6/64 scope link
valid_lft forever preferred_lft forever
5: vmbr1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
link/ether 0c:c4:7a:c3:53:a7 brd ff:ff:ff:ff:ff:ff
inet 10.1.0.3/24 brd 10.1.0.255 scope global vmbr1
valid_lft forever preferred_lft forever
inet6 fe80::ec4:7aff:fec3:53a7/64 scope link
valid_lft forever preferred_lft forever
6: vmbr2: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN group default qlen 1000
link/ether 96:ed:50:b4:f2:c4 brd ff:ff:ff:ff:ff:ff
inet 10.2.0.254/16 brd 10.2.255.255 scope global vmbr2
valid_lft forever preferred_lft forever
inet6 fe80::94ed:50ff:feb4:f2c4/64 scope link
valid_lft forever preferred_lft forever
root@bks:~# cat /etc/network/interfaces
auto lo
iface lo inet loopback
iface eno3 inet manual
iface eno4 inet manual
auto vmbr0
iface vmbr0 inet static
address 111.222.111.222
netmask 24
gateway 111.222.111.254
bridge-ports eno3
bridge-stp off
bridge-fd 0
auto vmbr1
iface vmbr1 inet static
address 10.1.0.3
netmask 24
bridge-ports eno4
bridge-stp off
bridge-fd 0
auto vmbr2
iface vmbr2 inet static
address 10.2.0.254
netmask 16
bridge-ports none
bridge-stp off
bridge-fd 0
post-up echo 1 > /proc/sys/net/ipv4/ip_forward
post-up iptables -t nat -A POSTROUTING -s '10.2.0.0/16' -o vmbr0 -j MASQUERADE
post-down iptables -t nat -D POSTROUTING -s '10.2.0.0/16' -o vmbr0 -j MASQUERADE
but networking.serivces is like that :
root@bks:~# systemctl status networking.service
networking.service - Raise network interfaces
Loaded: loaded (/lib/systemd/system/networking.service; enabled; vendor preset: enabled)
Active: failed (Result: exit-code) since Fri 2020-04-17 15:20:33 CEST; 27min ago
Docs: man:interfaces(5)
Process: 1907 ExecStart=/sbin/ifup -a --read-environment (code=exited, status=1/FAILURE)
Process: 1740 ExecStartPre=/bin/sh -c [ "$CONFIGURE_INTERFACES" != "no" ] && [ -n "$(ifquery --read-environment --list --exclude=lo)" ] && udevadm settle (code=exited, status=0/SUCCESS)
Main PID: 1907 (code=exited, status=1/FAILURE)
CPU: 427ms
avril 17 15:20:31 bks systemd[1]: Starting Raise network interfaces...
avril 17 15:20:32 bks ifup[1907]: Waiting for vmbr0 to get ready (MAXWAIT is 2 seconds).
avril 17 15:20:32 bks ifup[1907]: RTNETLINK answers: File exists
avril 17 15:20:32 bks ifup[1907]: ifup: failed to bring up vmbr0
avril 17 15:20:33 bks ifup[1907]: Waiting for vmbr1 to get ready (MAXWAIT is 2 seconds).
avril 17 15:20:33 bks ifup[1907]: Waiting for vmbr2 to get ready (MAXWAIT is 2 seconds).
avril 17 15:20:33 bks systemd[1]: networking.service: Main process exited, code=exited, status=1/FAILURE
avril 17 15:20:33 bks systemd[1]: Failed to start Raise network interfaces.
avril 17 15:20:34 bks systemd[1]: networking.service: Unit entered failed state.
avril 17 15:20:34 bks systemd[1]: networking.service: Failed with result 'exit-code'.
I rebooted without vmbr2 and its masquerading without success ....
would you have any idea of what could go wrong ?
Jean-Michel
Networking seems all fine (ping google, ssh connect) but host totally disconnect from time to time (quite every 2 days ... )and we have to reboot it, and networking.service fail to start ....
I'm running 4.15.18-27-pve with this kind of network config :
root@bks:~# ip link
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN mode DEFAULT group default qlen 1000
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
2: eno3: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr0 state UP mode DEFAULT group default qlen 1000
link/ether 0c:c4:7a:c3:53:a6 brd ff:ff:ff:ff:ff:ff
3: eno4: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr1 state UP mode DEFAULT group default qlen 1000
link/ether 0c:c4:7a:c3:53:a7 brd ff:ff:ff:ff:ff:ff
4: vmbr0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP mode DEFAULT group default qlen 1000
link/ether 0c:c4:7a:c3:53:a6 brd ff:ff:ff:ff:ff:ff
5: vmbr1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP mode DEFAULT group default qlen 1000
link/ether 0c:c4:7a:c3:53:a7 brd ff:ff:ff:ff:ff:ff
6: vmbr2: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN mode DEFAULT group default qlen 1000
link/ether 96:ed:50:b4:f2:c4 brd ff:ff:ff:ff:ff:ff
root@bks:~# ip address
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
valid_lft forever preferred_lft forever
inet6 ::1/128 scope host
valid_lft forever preferred_lft forever
2: eno3: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr0 state UP group default qlen 1000
link/ether 0c:c4:7a:c3:53:a6 brd ff:ff:ff:ff:ff:ff
3: eno4: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr1 state UP group default qlen 1000
link/ether 0c:c4:7a:c3:53:a7 brd ff:ff:ff:ff:ff:ff
4: vmbr0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
link/ether 0c:c4:7a:c3:53:a6 brd ff:ff:ff:ff:ff:ff
inet 111.222.111.222/24 brd 111.222.333.255 scope global vmbr0
valid_lft forever preferred_lft forever
inet6 2001:2222:3333:4444::/64 scope global
valid_lft forever preferred_lft forever
inet6 fe80::ec4:7aff:fec3:53a6/64 scope link
valid_lft forever preferred_lft forever
5: vmbr1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
link/ether 0c:c4:7a:c3:53:a7 brd ff:ff:ff:ff:ff:ff
inet 10.1.0.3/24 brd 10.1.0.255 scope global vmbr1
valid_lft forever preferred_lft forever
inet6 fe80::ec4:7aff:fec3:53a7/64 scope link
valid_lft forever preferred_lft forever
6: vmbr2: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN group default qlen 1000
link/ether 96:ed:50:b4:f2:c4 brd ff:ff:ff:ff:ff:ff
inet 10.2.0.254/16 brd 10.2.255.255 scope global vmbr2
valid_lft forever preferred_lft forever
inet6 fe80::94ed:50ff:feb4:f2c4/64 scope link
valid_lft forever preferred_lft forever
root@bks:~# cat /etc/network/interfaces
auto lo
iface lo inet loopback
iface eno3 inet manual
iface eno4 inet manual
auto vmbr0
iface vmbr0 inet static
address 111.222.111.222
netmask 24
gateway 111.222.111.254
bridge-ports eno3
bridge-stp off
bridge-fd 0
auto vmbr1
iface vmbr1 inet static
address 10.1.0.3
netmask 24
bridge-ports eno4
bridge-stp off
bridge-fd 0
auto vmbr2
iface vmbr2 inet static
address 10.2.0.254
netmask 16
bridge-ports none
bridge-stp off
bridge-fd 0
post-up echo 1 > /proc/sys/net/ipv4/ip_forward
post-up iptables -t nat -A POSTROUTING -s '10.2.0.0/16' -o vmbr0 -j MASQUERADE
post-down iptables -t nat -D POSTROUTING -s '10.2.0.0/16' -o vmbr0 -j MASQUERADE
but networking.serivces is like that :
root@bks:~# systemctl status networking.service
networking.service - Raise network interfaces
Loaded: loaded (/lib/systemd/system/networking.service; enabled; vendor preset: enabled)
Active: failed (Result: exit-code) since Fri 2020-04-17 15:20:33 CEST; 27min ago
Docs: man:interfaces(5)
Process: 1907 ExecStart=/sbin/ifup -a --read-environment (code=exited, status=1/FAILURE)
Process: 1740 ExecStartPre=/bin/sh -c [ "$CONFIGURE_INTERFACES" != "no" ] && [ -n "$(ifquery --read-environment --list --exclude=lo)" ] && udevadm settle (code=exited, status=0/SUCCESS)
Main PID: 1907 (code=exited, status=1/FAILURE)
CPU: 427ms
avril 17 15:20:31 bks systemd[1]: Starting Raise network interfaces...
avril 17 15:20:32 bks ifup[1907]: Waiting for vmbr0 to get ready (MAXWAIT is 2 seconds).
avril 17 15:20:32 bks ifup[1907]: RTNETLINK answers: File exists
avril 17 15:20:32 bks ifup[1907]: ifup: failed to bring up vmbr0
avril 17 15:20:33 bks ifup[1907]: Waiting for vmbr1 to get ready (MAXWAIT is 2 seconds).
avril 17 15:20:33 bks ifup[1907]: Waiting for vmbr2 to get ready (MAXWAIT is 2 seconds).
avril 17 15:20:33 bks systemd[1]: networking.service: Main process exited, code=exited, status=1/FAILURE
avril 17 15:20:33 bks systemd[1]: Failed to start Raise network interfaces.
avril 17 15:20:34 bks systemd[1]: networking.service: Unit entered failed state.
avril 17 15:20:34 bks systemd[1]: networking.service: Failed with result 'exit-code'.
I rebooted without vmbr2 and its masquerading without success ....
would you have any idea of what could go wrong ?
Jean-Michel