Add node

Neb

Well-Known Member
Apr 27, 2017
35
0
46
30
Hi,

I've a proxmox cluster of 3 nodes with ceph and i would like to add a new node into this. However when i try to run "pvecm add <ip-address>" on the node that i want to add within my cluster, corosync does not work and it does not start.

Look like this :

PHP:
17:24:03 ~ # pvecm add 10.51.0.11                                                        root@px-node-4
node px-node-4 already defined
copy corosync auth key
stopping pve-cluster service
backup old database
Job for corosync.service failed. See 'systemctl status corosync.service' and 'journalctl -xn' for details.
waiting for quorum...

Why ? It's a fresh install.
syslog does not say more informations.

Result of "pveversion" from a node within my cluster:

PHP:
17:24:02 ~ # pveversion -v                                                                root@px-node-3
proxmox-ve: 4.4-88 (running kernel: 4.4.62-1-pve)
pve-manager: 4.4-13 (running version: 4.4-13/7ea56165)
pve-kernel-4.4.35-1-pve: 4.4.35-77
pve-kernel-4.4.59-1-pve: 4.4.59-87
pve-kernel-4.4.62-1-pve: 4.4.62-88
lvm2: 2.02.116-pve3
corosync-pve: 2.4.2-2~pve4+1
libqb0: 1.0.1-1
pve-cluster: 4.0-50
qemu-server: 4.0-110
pve-firmware: 1.1-11
libpve-common-perl: 4.0-95
libpve-access-control: 4.0-23
libpve-storage-perl: 4.0-76
pve-libspice-server1: 0.12.8-2
vncterm: 1.3-2
pve-docs: 4.4-4
pve-qemu-kvm: 2.7.1-4
pve-container: 1.0-100
pve-firewall: 2.0-33
pve-ha-manager: 1.0-40
ksm-control-daemon: 1.2-1
glusterfs-client: 3.5.2-2+deb8u3
lxc-pve: 2.0.7-4
lxcfs: 2.0.6-pve1
criu: 1.6.0-1
novnc-pve: 0.5-9
smartmontools: 6.5+svn4324-1~pve80
zfsutils: 0.6.5.9-pve15~bpo80
ceph: 10.2.7-1~bpo80+1


Result of "pveversion" that i want to include in my existant cluster :

PHP:
17:34:44 ~ # pveversion -v                                                               root@px-node-4
proxmox-ve: 4.4-88 (running kernel: 4.4.62-1-pve)
pve-manager: 4.4-13 (running version: 4.4-13/7ea56165)
pve-kernel-4.4.62-1-pve: 4.4.62-88
lvm2: 2.02.116-pve3
corosync-pve: 2.4.2-2~pve4+1
libqb0: 1.0.1-1
pve-cluster: 4.0-50
qemu-server: 4.0-110
pve-firmware: 1.1-11
libpve-common-perl: 4.0-95
libpve-access-control: 4.0-23
libpve-storage-perl: 4.0-76
pve-libspice-server1: 0.12.8-2
vncterm: 1.3-2
pve-docs: 4.4-4
pve-qemu-kvm: 2.7.1-4
pve-container: 1.0-100
pve-firewall: 2.0-33
pve-ha-manager: 1.0-40
ksm-control-daemon: 1.2-1
glusterfs-client: 3.5.2-2+deb8u3
lxc-pve: 2.0.7-4
lxcfs: 2.0.6-pve1
criu: 1.6.0-1
novnc-pve: 0.5-9
smartmontools: 6.5+svn4324-1~pve80
ceph: 10.2.7-1~bpo80+1

Moreover, NTP is configured on each node. I don't see what is wrong. Any idea please ?

Thanks you
 
> Any idea please ?
Maybe you should start with the hints these command gave you ?

pvecm add 10.51.0.11 root@px-node-4
--> node px-node-4 already defined

coult it be a wrong ip adress here ?

systemctl status corosync.service ?
 
>
pvecm add 10.51.0.11 root@px-node-4
--> node px-node-4 already defined

coult it be a wrong ip adress here ?

This is the good ip address. There is this message because I executed this command before and I forget to delete it then

>systemctl status corosync.service ?

Code:
● corosync.service - Corosync Cluster Engine
   Loaded: loaded (/lib/systemd/system/corosync.service; enabled)
   Active: failed (Result: exit-code) since mar. 2017-06-06 17:25:18 CEST; 18h ago
  Process: 5136 ExecStart=/usr/share/corosync/corosync start (code=exited, status=1/FAILURE)

juin 06 17:24:18 px-node-4 corosync[5146]: [MAIN  ] No configured qb.ipc_type. Using native ipc
juin 06 17:24:18 px-node-4 corosync[5146]: [QB    ] server name: cpg
juin 06 17:24:18 px-node-4 corosync[5146]: [SERV  ] Service engine loaded: corosync profile loading service [4]
juin 06 17:24:18 px-node-4 corosync[5146]: [MAIN  ] NOT Initializing IPC on pload [4]
juin 06 17:24:18 px-node-4 corosync[5146]: [QUORUM] Using quorum provider corosync_votequorum
juin 06 17:25:18 px-node-4 corosync[5136]: Starting Corosync Cluster Engine (corosync): [FAILED]
juin 06 17:25:18 px-node-4 systemd[1]: corosync.service: control process exited, code=exited status=1
juin 06 17:25:18 px-node-4 systemd[1]: Failed to start Corosync Cluster Engine.
juin 06 17:25:18 px-node-4 systemd[1]: Unit corosync.service entered failed state.
juin 06 17:43:45 px-node-4 systemd[1]: Stopped Corosync Cluster Engine.
------------------------------------------------------------