Just created a new 4.0 test cluster with boxes that had authority keys in place and password-less ssh root access between clean installed+patched boxes.
On first node I created a new cluster with 'pvecm create clustername' and adding 2. node seemed to take like forever and eventually I interrupted that,
wonder how to recover from this 'Activity blocked' state and my nodes?
On node 2 I got this:
This is found in /var/log/daemon.log:
After a reboot n2 says:
Any hints appreciated, TIA
On first node I created a new cluster with 'pvecm create clustername' and adding 2. node seemed to take like forever and eventually I interrupted that,
wonder how to recover from this 'Activity blocked' state and my nodes?
Code:
root@n1:~# pvecm status
Quorum information
------------------
Date: Wed Sep 2 09:49:13 2015
Quorum provider: corosync_votequorum
Nodes: 1
Node ID: 0x00000001
Ring ID: 4
Quorate: No
Votequorum information
----------------------
Expected votes: 2
Highest expected: 2
Total votes: 1
Quorum: 2 Activity blocked
Flags:
Membership information
----------------------
Nodeid Votes Name
0x00000001 1 10.3.0.1 (local)
On node 2 I got this:
Code:
root@n2:~# pvecm add n1
copy corosync auth key
stopping pve-cluster service
backup old database
waiting for quorum...^C
root@n2:~# pvecm add 10.3.0.1
can't create shared ssh key database '/etc/pve/priv/authorized_keys'
authentication key already exists
root@n2:~# ping n1
PING n1 (10.3.0.1) 56(84) bytes of data.
64 bytes from n1 (10.3.0.1): icmp_seq=1 ttl=64 time=0.171 ms
64 bytes from n1 (10.3.0.1): icmp_seq=2 ttl=64 time=0.158 ms
^C
This is found in /var/log/daemon.log:
Code:
Sep 2 09:42:22 n2 pmxcfs[1406]: [main] notice: teardown filesystem
Sep 2 09:42:24 n2 pmxcfs[1406]: [main] notice: exit proxmox configuration filesystem (0)
Sep 2 09:42:24 n2 pmxcfs[3204]: [quorum] crit: quorum_initialize failed: 2
Sep 2 09:42:24 n2 pmxcfs[3204]: [quorum] crit: can't initialize service
Sep 2 09:42:24 n2 pmxcfs[3204]: [confdb] crit: cmap_initialize failed: 2
Sep 2 09:42:24 n2 pmxcfs[3204]: [confdb] crit: can't initialize service
Sep 2 09:42:24 n2 pmxcfs[3204]: [dcdb] crit: cpg_initialize failed: 2
Sep 2 09:42:24 n2 pmxcfs[3204]: [dcdb] crit: can't initialize service
Sep 2 09:42:24 n2 pmxcfs[3204]: [status] crit: cpg_initialize failed: 2
Sep 2 09:42:24 n2 pmxcfs[3204]: [status] crit: can't initialize service
Sep 2 09:42:24 n2 pve-ha-crm[1800]: ipcc_send_rec failed: Transport endpoint is not connected
Sep 2 09:42:24 n2 pve-ha-crm[1800]: ipcc_send_rec failed: Connection refused
Sep 2 09:42:24 n2 pve-ha-crm[1800]: ipcc_send_rec failed: Connection refused
Sep 2 09:42:24 n2 pve-ha-lrm[1810]: ipcc_send_rec failed: Transport endpoint is not connected
Sep 2 09:42:24 n2 pve-ha-lrm[1810]: ipcc_send_rec failed: Connection refused
Sep 2 09:42:24 n2 pve-ha-lrm[1810]: ipcc_send_rec failed: Connection refused
Sep 2 09:42:25 n2 corosync[3220]: [MAIN ] Corosync Cluster Engine ('2.3.4.22-8252'): started and ready to provide service.
Sep 2 09:42:25 n2 corosync[3220]: [MAIN ] Corosync built-in features: augeas systemd pie relro bindnow
Sep 2 09:42:25 n2 corosync[3222]: [TOTEM ] Initializing transport (UDP/IP Multicast).
Sep 2 09:42:25 n2 corosync[3222]: [TOTEM ] Initializing transmit/receive security (NSS) crypto: aes256 hash: sha1
Sep 2 09:42:25 n2 corosync[3222]: [TOTEM ] The network interface [10.3.0.2] is now up.
Sep 2 09:42:25 n2 corosync[3222]: [SERV ] Service engine loaded: corosync configuration map access [0]
Sep 2 09:42:25 n2 corosync[3222]: [QB ] server name: cmap
Sep 2 09:42:25 n2 corosync[3222]: [SERV ] Service engine loaded: corosync configuration service [1]
Sep 2 09:42:25 n2 corosync[3222]: [QB ] server name: cfg
Sep 2 09:42:25 n2 corosync[3222]: [SERV ] Service engine loaded: corosync cluster closed process group service v1.01 [2]
Sep 2 09:42:25 n2 corosync[3222]: [QB ] server name: cpg
Sep 2 09:42:25 n2 corosync[3222]: [SERV ] Service engine loaded: corosync profile loading service [4]
Sep 2 09:42:25 n2 corosync[3222]: [QUORUM] Using quorum provider corosync_votequorum
Sep 2 09:42:25 n2 corosync[3222]: [SERV ] Service engine loaded: corosync vote quorum service v1.0 [5]
Sep 2 09:42:25 n2 corosync[3222]: [QB ] server name: votequorum
Sep 2 09:42:25 n2 corosync[3222]: [SERV ] Service engine loaded: corosync cluster quorum service v0.1 [3]
Sep 2 09:42:25 n2 corosync[3222]: [QB ] server name: quorum
Sep 2 09:42:25 n2 corosync[3222]: [TOTEM ] A new membership (10.3.0.2:4) was formed. Members joined: 2
Sep 2 09:42:25 n2 corosync[3222]: [QUORUM] Members[1]: 2
Sep 2 09:42:25 n2 corosync[3222]: [MAIN ] Completed service synchronization, ready to provide service.
Sep 2 09:42:25 n2 corosync[3214]: Starting Corosync Cluster Engine (corosync): [ OK ]
Sep 2 09:42:26 n2 corosync[3222]: [TOTEM ] A new membership (10.3.0.2:8) was formed. Members
Sep 2 09:42:26 n2 corosync[3222]: [QUORUM] Members[1]: 2
Sep 2 09:42:26 n2 corosync[3222]: [MAIN ] Completed service synchronization, ready to provide service.
Sep 2 09:42:28 n2 corosync[3222]: [TOTEM ] A new membership (10.3.0.2:12) was formed. Members
Sep 2 09:42:28 n2 corosync[3222]: [QUORUM] Members[1]: 2
Sep 2 09:42:28 n2 corosync[3222]: [MAIN ] Completed service synchronization, ready to provide service.
Sep 2 09:42:29 n2 corosync[3222]: [TOTEM ] A new membership (10.3.0.2:16) was formed. Members
Sep 2 09:42:29 n2 corosync[3222]: [QUORUM] Members[1]: 2
Sep 2 09:42:29 n2 corosync[3222]: [MAIN ] Completed service synchronization, ready to provide service.
Sep 2 09:42:30 n2 pmxcfs[3204]: [status] notice: update cluster info (cluster name test-pmx, version = 2)
Sep 2 09:42:31 n2 corosync[3222]: [TOTEM ] A new membership (10.3.0.2:20) was formed. Members
Sep 2 09:42:31 n2 corosync[3222]: [QUORUM] Members[1]: 2
Sep 2 09:42:31 n2 corosync[3222]: [MAIN ] Completed service synchronization, ready to provide service.
Sep 2 09:42:31 n2 pmxcfs[3204]: [dcdb] notice: members: 2/3204
Sep 2 09:42:31 n2 pmxcfs[3204]: [dcdb] notice: all data is up to date
Sep 2 09:42:31 n2 pmxcfs[3204]: [status] notice: members: 2/3204
Sep 2 09:42:31 n2 pmxcfs[3204]: [status] notice: all data is up to date
Sep 2 09:42:32 n2 corosync[3222]: [TOTEM ] A new membership (10.3.0.2:24) was formed. Members
Sep 2 09:42:32 n2 corosync[3222]: [QUORUM] Members[1]: 2
Sep 2 09:42:32 n2 corosync[3222]: [MAIN ] Completed service synchronization, ready to provide service.
Sep 2 09:42:34 n2 corosync[3222]: [TOTEM ] A new membership (10.3.0.2:28) was formed. Members
Sep 2 09:42:34 n2 corosync[3222]: [QUORUM] Members[1]: 2
Sep 2 09:42:34 n2 corosync[3222]: [MAIN ] Completed service synchronization, ready to provide service.
Sep 2 09:42:35 n2 corosync[3222]: [TOTEM ] A new membership (10.3.0.2:32) was formed. Members
Sep 2 09:42:35 n2 corosync[3222]: [QUORUM] Members[1]: 2
Sep 2 09:42:35 n2 corosync[3222]: [MAIN ] Completed service synchronization, ready to provide service.
Sep 2 09:42:36 n2 corosync[3222]: [TOTEM ] A new membership (10.3.0.2:36) was formed. Members
Sep 2 09:42:36 n2 corosync[3222]: [QUORUM] Members[1]: 2
Sep 2 09:42:36 n2 corosync[3222]: [MAIN ] Completed service synchronization, ready to provide service.
...
Sep 2 09:46:46 n2 corosync[3222]: [TOTEM ] A new membership (10.3.0.2:732) was formed. Members
Sep 2 09:46:46 n2 corosync[3222]: [QUORUM] Members[1]: 2
Sep 2 09:46:46 n2 corosync[3222]: [MAIN ] Completed service synchronization, ready to provide service.
root@n2:~# grep -c 'Members\[1\]: 2' /var/log/daemon.log
183
After a reboot n2 says:
Code:
from daemon.log:
...
Sep 2 10:27:31 n2 networking[1159]: done.
Sep 2 10:27:31 n2 pmxcfs[1416]: [quorum] crit: quorum_initialize failed: 2
Sep 2 10:27:31 n2 pmxcfs[1416]: [quorum] crit: can't initialize service
Sep 2 10:27:31 n2 pmxcfs[1416]: [confdb] crit: cmap_initialize failed: 2
Sep 2 10:27:31 n2 pmxcfs[1416]: [confdb] crit: can't initialize service
Sep 2 10:27:31 n2 pmxcfs[1416]: [dcdb] crit: cpg_initialize failed: 2
Sep 2 10:27:31 n2 pmxcfs[1416]: [dcdb] crit: can't initialize service
Sep 2 10:27:31 n2 pmxcfs[1416]: [status] crit: cpg_initialize failed: 2
Sep 2 10:27:31 n2 pmxcfs[1416]: [status] crit: can't initialize service
...
root@n2:~# pvecm status
Quorum information
------------------
Date: Wed Sep 2 10:28:39 2015
Quorum provider: corosync_votequorum
Nodes: 1
Node ID: 0x00000002
Ring ID: 892
Quorate: No
Votequorum information
----------------------
Expected votes: 2
Highest expected: 2
Total votes: 1
Quorum: 2 Activity blocked
Flags:
Membership information
----------------------
Nodeid Votes Name
0x00000002 1 10.3.0.2 (local)
Any hints appreciated, TIA