Hi All
I have a 4 blade cluster I'm trying to setup and a I can not get past the first step after install, creating the cluster.
hosts are:
blade1 - 10.5.1.201
blade3 - 10.5.1.203
blade5 - 10.5.1.205
blade7 - 10.5.1.207
I've added these as host records in the hosts file and I'm able to ssh between the systems and they all have access to the internet and NTP is setup to sync the time. I have purchased and install the enterprise licenses and done the upgrades.
I ran the create cluster command
pvecm create Jazmin-Adl
Let that finished then ran
pvecm add blade7
This took many mins to finish.
The problem is that from the CLI the two nodes have never shown up
There is no Quoratum
Attempting to add another node just gives
I have no idea what to try next. There does seem to be traffic between the nodes shown via tcpdump. The logs below show some sort of issue.
I have a 4 blade cluster I'm trying to setup and a I can not get past the first step after install, creating the cluster.
hosts are:
blade1 - 10.5.1.201
blade3 - 10.5.1.203
blade5 - 10.5.1.205
blade7 - 10.5.1.207
I've added these as host records in the hosts file and I'm able to ssh between the systems and they all have access to the internet and NTP is setup to sync the time. I have purchased and install the enterprise licenses and done the upgrades.
I ran the create cluster command
pvecm create Jazmin-Adl
Let that finished then ran
pvecm add blade7
This took many mins to finish.
The problem is that from the CLI the two nodes have never shown up
Code:
root@blade7:~# pvecm nodes
Membership information
----------------------
Nodeid Votes Name
1 1 blade7 (local)
root@blade5:~# pvecm nodes
Membership information
----------------------
Nodeid Votes Name
2 1 blade5 (local)
root@blade5:~#
There is no Quoratum
Code:
root@blade7:~# pvecm status
Quorum information
------------------
Date: Fri Nov 13 12:06:33 2015
Quorum provider: corosync_votequorum
Nodes: 1
Node ID: 0x00000001
Ring ID: 1580
Quorate: No
Votequorum information
----------------------
Expected votes: 2
Highest expected: 2
Total votes: 1
Quorum: 2 Activity blocked
Flags:
Membership information
----------------------
Nodeid Votes Name
0x00000001 1 10.1.5.207 (local)
Attempting to add another node just gives
Code:
root@blade1:~# pvecm add blade7
root@blade7's password:
unable to copy ssh ID
I have no idea what to try next. There does seem to be traffic between the nodes shown via tcpdump. The logs below show some sort of issue.
Code:
Nov 13 11:57:08 blade5 corosync[1171]: [MAIN ] Corosync Cluster Engine ('2.3.5'): started and ready to provide service.
Nov 13 11:57:08 blade5 corosync[1171]: [MAIN ] Corosync built-in features: augeas systemd pie relro bindnow
Nov 13 11:57:08 blade5 corosync[1172]: [TOTEM ] Initializing transport (UDP/IP Multicast).
Nov 13 11:57:08 blade5 corosync[1172]: [TOTEM ] Initializing transmit/receive security (NSS) crypto: aes256 hash: sha1
Nov 13 11:57:08 blade5 corosync[1172]: [TOTEM ] The network interface [10.1.5.205] is now up.
Nov 13 11:57:08 blade5 corosync[1172]: [SERV ] Service engine loaded: corosync configuration map access [0]
Nov 13 11:57:08 blade5 corosync[1172]: [QB ] server name: cmap
Nov 13 11:57:08 blade5 corosync[1172]: [SERV ] Service engine loaded: corosync configuration service [1]
Nov 13 11:57:08 blade5 corosync[1172]: [QB ] server name: cfg
Nov 13 11:57:08 blade5 corosync[1172]: [SERV ] Service engine loaded: corosync cluster closed process group service v1.01 [2]
Nov 13 11:57:08 blade5 corosync[1172]: [QB ] server name: cpg
Nov 13 11:57:08 blade5 corosync[1172]: [SERV ] Service engine loaded: corosync profile loading service [4]
Nov 13 11:57:08 blade5 corosync[1172]: [QUORUM] Using quorum provider corosync_votequorum
Nov 13 11:57:08 blade5 corosync[1172]: [SERV ] Service engine loaded: corosync vote quorum service v1.0 [5]
Nov 13 11:57:08 blade5 corosync[1172]: [QB ] server name: votequorum
Nov 13 11:57:08 blade5 corosync[1172]: [SERV ] Service engine loaded: corosync cluster quorum service v0.1 [3]
Nov 13 11:57:08 blade5 corosync[1172]: [QB ] server name: quorum
Nov 13 11:57:08 blade5 corosync[1172]: [TOTEM ] A new membership (10.1.5.205:1572) was formed. Members joined: 2
Nov 13 11:57:08 blade5 corosync[1172]: [QUORUM] Members[1]: 2
Nov 13 11:57:08 blade5 corosync[1172]: [MAIN ] Completed service synchronization, ready to provide service.
Nov 13 11:57:08 blade5 corosync[1172]: [TOTEM ] Digest does not match
Nov 13 11:57:08 blade5 corosync[1172]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:57:08 blade5 corosync[1172]: [TOTEM ] Invalid packet data
Nov 13 11:57:09 blade5 corosync[1165]: Starting Corosync Cluster Engine (corosync): [ OK ]
Nov 13 11:57:09 blade5 corosync[1172]: [TOTEM ] Digest does not match
Nov 13 11:57:09 blade5 corosync[1172]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:57:09 blade5 corosync[1172]: [TOTEM ] Invalid packet data
Nov 13 11:57:09 blade5 corosync[1172]: [TOTEM ] Digest does not match
....
Nov 13 12:02:35 blade5 corosync[1172]: [TOTEM ] A new membership (10.1.5.205:1576) was formed. Members joined: 1
Nov 13 12:02:35 blade5 corosync[1172]: [QUORUM] This node is within the primary component and will provide service.
Nov 13 12:02:35 blade5 corosync[1172]: [QUORUM] Members[2]: 2 1
Nov 13 12:02:35 blade5 corosync[1172]: [MAIN ] Completed service synchronization, ready to provide service.
Nov 13 12:02:35 blade5 corosync[1172]: [TOTEM ] Digest does not match
Nov 13 12:02:35 blade5 corosync[1172]: [TOTEM ] Received message has invalid digest... ignoring.
....
Nov 13 12:02:46 blade5 corosync[1172]: [TOTEM ] Digest does not match
Nov 13 12:02:46 blade5 corosync[1172]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 12:02:46 blade5 corosync[1172]: [TOTEM ] Invalid packet data
Nov 13 12:03:07 blade5 corosync[1172]: [TOTEM ] Retransmit List: 2c 2d
Nov 13 12:03:07 blade5 corosync[1172]: [TOTEM ] Retransmit List: 2c 2d
Nov 13 12:03:07 blade5 corosync[1172]: [TOTEM ] Retransmit List: 2c 2d 2e
Nov 13 12:03:07 blade5 corosync[1172]: [TOTEM ] Retransmit List: 2c 2d 2e
Nov 13 12:03:07 blade5 corosync[1172]: [TOTEM ] Retransmit List: 2c 2d 2e
Nov 13 12:03:07 blade5 corosync[1172]: [TOTEM ] Retransmit List: 2c 2d 2e
Nov 13 12:03:07 blade5 corosync[1172]: [TOTEM ] Retransmit List: 2c 2d 2e
....
Nov 13 12:03:08 blade5 corosync[1172]: [TOTEM ] Retransmit List: 2c 2d 2e 2f
Nov 13 12:03:08 blade5 corosync[1172]: [TOTEM ] Retransmit List: 2c 2d 2e 2f
Nov 13 12:03:09 blade5 corosync[1172]: [TOTEM ] A processor failed, forming new configuration.
Nov 13 12:03:10 blade5 corosync[1172]: [TOTEM ] A new membership (10.1.5.205:1580) was formed. Members left: 1
Nov 13 12:03:10 blade5 corosync[1172]: [TOTEM ] Failed to receive the leave message. failed: 1
Nov 13 12:03:10 blade5 corosync[1172]: [QUORUM] This node is within the non-primary component and will NOT provide any services.
Nov 13 12:03:10 blade5 corosync[1172]: [QUORUM] Members[1]: 2
Nov 13 12:03:10 blade5 corosync[1172]: [MAIN ] Completed service synchronization, ready to provide service.
Code:
Nov 13 11:31:00 blade7 corosync[1300]: [MAIN ] Corosync Cluster Engine ('2.3.5'): started and ready to provide service.
Nov 13 11:31:00 blade7 corosync[1300]: [MAIN ] Corosync built-in features: augeas systemd pie relro bindnow
Nov 13 11:31:00 blade7 corosync[1301]: [TOTEM ] Initializing transport (UDP/IP Multicast).
Nov 13 11:31:00 blade7 corosync[1301]: [TOTEM ] Initializing transmit/receive security (NSS) crypto: aes256 hash: sha1
Nov 13 11:31:00 blade7 corosync[1301]: [TOTEM ] The network interface [10.1.5.207] is now up.
Nov 13 11:31:00 blade7 corosync[1301]: [SERV ] Service engine loaded: corosync configuration map access [0]
Nov 13 11:31:00 blade7 corosync[1301]: [QB ] server name: cmap
Nov 13 11:31:00 blade7 corosync[1301]: [SERV ] Service engine loaded: corosync configuration service [1]
Nov 13 11:31:00 blade7 corosync[1301]: [QB ] server name: cfg
Nov 13 11:31:00 blade7 corosync[1301]: [SERV ] Service engine loaded: corosync cluster closed process group service v1.01 [2]
Nov 13 11:31:00 blade7 corosync[1301]: [QB ] server name: cpg
Nov 13 11:31:00 blade7 corosync[1301]: [SERV ] Service engine loaded: corosync profile loading service [4]
Nov 13 11:31:00 blade7 corosync[1301]: [QUORUM] Using quorum provider corosync_votequorum
Nov 13 11:31:00 blade7 corosync[1301]: [SERV ] Service engine loaded: corosync vote quorum service v1.0 [5]
Nov 13 11:31:00 blade7 corosync[1301]: [QB ] server name: votequorum
Nov 13 11:31:00 blade7 corosync[1301]: [SERV ] Service engine loaded: corosync cluster quorum service v0.1 [3]
Nov 13 11:31:00 blade7 corosync[1301]: [QB ] server name: quorum
Nov 13 11:31:00 blade7 corosync[1301]: [TOTEM ] A new membership (10.1.5.207:436) was formed. Members joined: 1
Nov 13 11:31:00 blade7 corosync[1301]: [QUORUM] Members[1]: 1
Nov 13 11:31:00 blade7 corosync[1301]: [MAIN ] Completed service synchronization, ready to provide service.
Nov 13 11:31:00 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:31:00 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:31:00 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:31:01 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:31:01 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:31:01 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:31:01 blade7 corosync[1294]: Starting Corosync Cluster Engine (corosync): [ OK ]
Nov 13 11:31:01 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:31:01 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:31:01 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:31:01 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:31:01 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:31:01 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:31:02 blade7 corosync[1301]: [TOTEM ] A new membership (10.1.5.207:440) was formed. Members
….
Nov 13 11:31:09 blade7 corosync[1301]: [MAIN ] Completed service synchronization, ready to provide service.
Nov 13 11:31:09 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:31:09 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:31:09 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:31:10 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:31:10 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:31:10 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:31:10 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:31:10 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:31:10 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:31:10 blade7 corosync[1301]: [TOTEM ] A new membership (10.1.5.207:464) was formed. Members
Nov 13 11:31:10 blade7 corosync[1301]: [QUORUM] Members[1]: 1
Nov 13 11:31:10 blade7 corosync[1301]: [MAIN ] Completed service synchronization, ready to provide service.
Nov 13 11:31:10 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:31:10 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:31:10 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:31:11 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:31:11 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:31:11 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:31:11 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:31:11 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:31:11 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:31:11 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:31:11 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:31:11 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:31:12 blade7 corosync[1301]: [TOTEM ] A new membership (10.1.5.207:468) was formed. Members
Nov 13 11:31:12 blade7 corosync[1301]: [QUORUM] Members[1]: 1
Nov 13 11:31:12 blade7 corosync[1301]: [MAIN ] Completed service synchronization, ready to provide service.
….
Nov 13 11:32:55 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:32:55 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:32:55 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:32:56 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:32:56 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:32:56 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:32:56 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:32:56 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:32:56 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:32:56 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:32:56 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:32:56 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:32:57 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:32:57 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:32:57 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:32:57 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:32:57 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:32:57 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:32:58 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:32:58 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:32:58 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:32:58 blade7 corosync[1301]: [TOTEM ] Digest does not match
….
Nov 13 11:33:06 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:33:06 blade7 corosync[1301]: [TOTEM ] Digest does not match
Nov 13 11:33:06 blade7 corosync[1301]: [TOTEM ] Received message has invalid digest... ignoring.
Nov 13 11:33:06 blade7 corosync[1301]: [TOTEM ] Invalid packet data
Nov 13 11:33:08 blade7 corosync[1301]: [TOTEM ] FAILED TO RECEIVE
Nov 13 11:33:09 blade7 corosync[1301]: [TOTEM ] A new membership (10.1.5.207:1580) was formed. Members left: 2
Nov 13 11:33:09 blade7 corosync[1301]: [TOTEM ] Failed to receive the leave message. failed: 2
Nov 13 11:33:09 blade7 corosync[1301]: [QUORUM] This node is within the non-primary component and will NOT provide any services.
Nov 13 11:33:09 blade7 corosync[1301]: [QUORUM] Members[1]: 1
Nov 13 11:33:09 blade7 corosync[1301]: [MAIN ] Completed service synchronization, ready to provide service.