Hi,
I have a 13 node cluster running PVE 5.1. I have just rebuilt a server to add into the cluster and when adding it just fails constantly:
This then hangs forever. Corosync refuses to start every time:
Cluster status:
Any ideas how I can get this box to join the cluster?
I have a 13 node cluster running PVE 5.1. I have just rebuilt a server to add into the cluster and when adding it just fails constantly:
Code:
[11:45 root@pve14:~]# pvecm add pve1 -f
can't create shared ssh key database '/etc/pve/priv/authorized_keys'
copy corosync auth key
stopping pve-cluster service
backup old database
delete old backup '/var/lib/pve-cluster/backup/config-1516010417.sql.gz'
Job for corosync.service failed because the control process exited with error code.
See "systemctl status corosync.service" and "journalctl -xe" for details.
waiting for quorum...
This then hangs forever. Corosync refuses to start every time:
Code:
[11:50 root@pve14:~]# systemctl status corosync.service
‚ó corosync.service - Corosync Cluster Engine
Loaded: loaded (/lib/systemd/system/corosync.service; enabled; vendor preset: enabled)
Active: failed (Result: exit-code) since Mon 2018-01-15 11:48:41 GMT; 1min 27s ago
Docs: man:corosync
man:corosync.conf
man:corosync_overview
Process: 6135 ExecStart=/usr/sbin/corosync -f $COROSYNC_OPTIONS (code=exited, status=20)
Main PID: 6135 (code=exited, status=20)
CPU: 72ms
Jan 15 11:48:41 pve14 corosync[6135]: info [WD ] no resources configured.
Jan 15 11:48:41 pve14 corosync[6135]: notice [SERV ] Service engine loaded: corosync watchdog service [7]
Jan 15 11:48:41 pve14 corosync[6135]: notice [QUORUM] Using quorum provider corosync_votequorum
Jan 15 11:48:41 pve14 corosync[6135]: crit [QUORUM] Quorum provider: corosync_votequorum failed to initialize.
Jan 15 11:48:41 pve14 corosync[6135]: error [SERV ] Service engine 'corosync_quorum' failed to load for reason 'configuration error: nodelist or quorum.expected_votes must be configured!'
Jan 15 11:48:41 pve14 corosync[6135]: error [MAIN ] Corosync Cluster Engine exiting with status 20 at service.c:356.
Jan 15 11:48:41 pve14 systemd[1]: corosync.service: Main process exited, code=exited, status=20/n/a
Jan 15 11:48:41 pve14 systemd[1]: Failed to start Corosync Cluster Engine.
Jan 15 11:48:41 pve14 systemd[1]: corosync.service: Unit entered failed state.
Jan 15 11:48:41 pve14 systemd[1]: corosync.service: Failed with result 'exit-code'.
Cluster status:
Code:
[11:50 root@pve1:~]# pvecm status
Quorum information
------------------
Date: Mon Jan 15 11:50:56 2018
Quorum provider: corosync_votequorum
Nodes: 13
Node ID: 0x00000001
Ring ID: 1/2176
Quorate: Yes
Votequorum information
----------------------
Expected votes: 13
Highest expected: 13
Total votes: 13
Quorum: 7
Flags: Quorate
Membership information
----------------------
Nodeid Votes Name
0x00000001 1 10.168.3.1 (local)
0x00000004 1 10.168.3.2
0x00000005 1 10.168.3.3
0x00000006 1 10.168.3.4
0x00000007 1 10.168.3.5
0x00000009 1 10.168.3.6
0x00000008 1 10.168.3.7
0x0000000d 1 10.168.3.8
0x0000000a 1 10.168.3.9
0x00000002 1 10.168.3.10
0x00000003 1 10.168.3.11
0x0000000b 1 10.168.3.12
0x0000000c 1 10.168.3.13
Any ideas how I can get this box to join the cluster?