Hi,
do an upgrade today (2/3 nodes). Due the new openvswitch 2.6.0-2 the network-connectivity for all VMs was lost for an short time (app. 2 min.) on the first node.
And the upgrade was disruped due ssh-access via ovs-bridge
But on the second node, I use an IP which isn't controlled by ovs... and the whole node wasn't accessible.
Then I went to the console and I see, the nodes was rebooted (can be due ha-feature) - but all ovs-networks are down (restart openvswitch-service don't help).
I move some important VMs to another node and start the VMs there.
But one important VM use local storage - I try to use vzdump, but vzdump can't backup an VM in emergency situations. First, can't get lock (ok "pvesm expected 1" helps) but then the VM was tried to start, which failed, because the bridge wasn't there.
I think an -emergeny flag for vzdump was an good thing! (Without lock and starting VM - can work with shutdown client only I think).
I use "qm move_disk" to migrate to ceph and than move the config and start the VM on the other node.
Long story short - another reboot bring the ovs-bridge up again.
So be carefully by this update (perhaps disable HA before)
Udo
do an upgrade today (2/3 nodes). Due the new openvswitch 2.6.0-2 the network-connectivity for all VMs was lost for an short time (app. 2 min.) on the first node.
And the upgrade was disruped due ssh-access via ovs-bridge
But on the second node, I use an IP which isn't controlled by ovs... and the whole node wasn't accessible.
Then I went to the console and I see, the nodes was rebooted (can be due ha-feature) - but all ovs-networks are down (restart openvswitch-service don't help).
I move some important VMs to another node and start the VMs there.
But one important VM use local storage - I try to use vzdump, but vzdump can't backup an VM in emergency situations. First, can't get lock (ok "pvesm expected 1" helps) but then the VM was tried to start, which failed, because the bridge wasn't there.
I think an -emergeny flag for vzdump was an good thing! (Without lock and starting VM - can work with shutdown client only I think).
I use "qm move_disk" to migrate to ceph and than move the config and start the VM on the other node.
Long story short - another reboot bring the ovs-bridge up again.
So be carefully by this update (perhaps disable HA before)
Udo