More head scratching
.. so have got the ceph network back on the server1 can now ping all the other 10.107 hosts.
However some wirld stuff. So the physical obboard ethernet which isn't used had the 192.168.107.55 address that was meantioned in the heatbeat failure.
server1 and server 2 are identifical machines. their interfaces file was identifical but the ip a output not.
this is the one from server2
auto lo
iface lo inet loopback
iface enp4s0f0 inet manual
iface enp0s31f6 inet manual
iface enp4s0f1 inet manual
auto vmbr0
iface vmbr0 inet static
address 192.168.107.2/24
gateway 192.168.107.254
bridge-ports enp4s0f0
bridge-stp off
bridge-fd 0
bridge-vlan-aware yes
bridge-vids 2-4094
auto vmbr1
iface vmbr1 inet static
address 10.107.0.2/16
bridge-ports enp4s0f1
bridge-stp off
bridge-fd 0
bridge-vlan-aware yes
bridge-vids 2-4094
output from ip a on server 2
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
valid_lft forever preferred_lft forever
inet6 ::1/128 scope host
valid_lft forever preferred_lft forever
2: enp0s31f6: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000
link/ether 6c:2b:59:cd:5d:b8 brd ff:ff:ff:ff:ff:ff
3: enp4s0f0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr0 state UP group default qlen 1000
link/ether 2c:27:d7:4f:8f:f0 brd ff:ff:ff:ff:ff:ff
4: enp4s0f1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr1 state UP group default qlen 1000
link/ether 2c:27:d7:4f:8f:f4 brd ff:ff:ff:ff:ff:ff
5: vmbr0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
link/ether 2c:27:d7:4f:8f:f0 brd ff:ff:ff:ff:ff:ff
inet 192.168.107.2/24 scope global vmbr0
valid_lft forever preferred_lft forever
inet6 fe80::2e27:d7ff:fe4f:8ff0/64 scope link
valid_lft forever preferred_lft forever
6: vmbr1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
link/ether 2c:27:d7:4f:8f:f4 brd ff:ff:ff:ff:ff:ff
inet 10.107.0.2/16 scope global vmbr1
valid_lft forever preferred_lft forever
inet6 fe80::2e27:d7ff:fe4f:8ff4/64 scope link
valid_lft forever preferred_lft forever
this is what I would expect .. server 1 however server1
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
valid_lft forever preferred_lft forever
inet6 ::1/128 scope host
valid_lft forever preferred_lft forever
2: enp0s31f6: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc pfifo_fast state DOWN group default qlen 1000
link/ether d8:9e:f3:3b:97:9a brd ff:ff:ff:ff:ff:ff
3: enp4s0f0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr0 state UP group default qlen 1000
link/ether c4:34:6b:cc:7b:70 brd ff:ff:ff:ff:ff:ff
inet 192.168.107.44/24 brd 192.168.107.255 scope global noprefixroute enp4s0f0
valid_lft forever preferred_lft forever
4: enp4s0f1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr1 state UP group default qlen 1000
link/ether c4:34:6b:cc:7b:74 brd ff:ff:ff:ff:ff:ff
inet 192.168.107.55/24 brd 192.168.107.255 scope global noprefixroute enp4s0f1
valid_lft forever preferred_lft forever
5: vmbr0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
link/ether c4:34:6b:cc:7b:70 brd ff:ff:ff:ff:ff:ff
inet 192.168.107.1/24 scope global vmbr0
valid_lft forever preferred_lft forever
inet 192.168.107.44/24 brd 192.168.107.255 scope global secondary noprefixroute vmbr0
valid_lft forever preferred_lft forever
inet6 fe80::1c99:1e0f:68d5:62cd/64 scope link
valid_lft forever preferred_lft forever
6: vmbr1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
link/ether c4:34:6b:cc:7b:74 brd ff:ff:ff:ff:ff:ff
inet 10.107.0.1/16 scope global vmbr1
valid_lft forever preferred_lft forever
inet 192.168.107.55/24 brd 192.168.107.255 scope global noprefixroute vmbr1
valid_lft forever preferred_lft forever
inet6 fe80::e64f:3af4:b429:26cb/64 scope link
valid_lft forever preferred_lft forever
have no idea why the additoinal IP addresses have attached themselves to the NIC and bridge
don't forget I have two other 'ghost' servers which are communicating fine