Hello. I've set up a cluster with physically separated cluster network but it is not doing what I expect it to do.
Here is my lab:
I have simulated malfunction of the "primary network" (10.1.0.0/16) on node1 so only cluster network (172.16.0.0/24) remained working.
But, to my surprise, no resources have been moved to other nodes.
How should I configure a HA cluster to move resources to other nodes on such malfunction via a cluster network.
Because in this situation the cluster thinks everything is OK (it can communicate through cluster network) and doesn't even try to migrate anything.
Preferably with bandwidth limitation, to not disrupt cluster communication in cluster network?
Something like this in datacenter.cfg?:
Please advise.
Here is my lab:
Code:
[node1 ~]# pvecm status
Cluster information
-------------------
Name: clustest
Config Version: 3
Transport: knet
Secure auth: on
Quorum information
------------------
Date: Fri Jun 19 11:08:54 2020
Quorum provider: corosync_votequorum
Nodes: 3
Node ID: 0x00000001
Ring ID: 1.8b
Quorate: Yes
Votequorum information
----------------------
Expected votes: 3
Highest expected: 3
Total votes: 3
Quorum: 2
Flags: Quorate
Membership information
----------------------
Nodeid Votes Name
0x00000001 1 172.16.0.1 (local)
0x00000002 1 172.16.0.2
0x00000003 1 172.16.0.3
[node1 ~]#
[node1 ~]# cat /etc/pve/corosync.conf
logging {
debug: off
to_syslog: yes
}
nodelist {
node {
name: one
nodeid: 1
quorum_votes: 1
ring0_addr: 172.16.0.1
ring1_addr: 10.1.0.1
}
node {
name: two
nodeid: 2
quorum_votes: 1
ring0_addr: 172.16.0.2
ring1_addr: 10.1.0.2
}
node {
name: three
nodeid: 3
quorum_votes: 1
ring0_addr: 172.16.0.3
ring1_addr: 10.1.0.3
}
}
quorum {
provider: corosync_votequorum
}
totem {
cluster_name: clustest
config_version: 3
interface {
linknumber: 0
}
interface {
linknumber: 1
}
ip_version: ipv4-6
link_mode: passive
secauth: on
version: 2
}
[node1 ~]#
[node1 ~]# cat /etc/pve/datacenter.cfg
console: vv
keyboard: en-us
migration: network=10.1.0.0/16
[node1 ~]#
But, to my surprise, no resources have been moved to other nodes.
How should I configure a HA cluster to move resources to other nodes on such malfunction via a cluster network.
Because in this situation the cluster thinks everything is OK (it can communicate through cluster network) and doesn't even try to migrate anything.
Preferably with bandwidth limitation, to not disrupt cluster communication in cluster network?
Something like this in datacenter.cfg?:
Code:
fallback_migration: network=172.16.0.0/24
fallback_bwlimit: migration=100000