Error adding Node to the cluster

havanna

New Member
Nov 11, 2011
14
0
1
I installed Proxmox VE 2.0 beta on 2 boxes described in this post http://pve.proxmox.com/wiki/Install_Proxmox_VE_on_Debian_Squeeze what works fine. Installing drbd also works fine. Then i created a cluster on box11(192.168.0.11) described in this post http://pve.proxmox.com/wiki/Proxmox_VE_2.0_Cluster with the following command

pvecm create mycluster

what also works fine. Now i added a second node box12(192.168.0.12) to the cluster using putty box12, issuing the command

pvecm add 192.168.0.12

and i get the following error message:

root@box12:~/pve-kernel-2.6.32# pvecm add 192.168.0.12
Generating public/private rsa key pair.
Your identification has been saved in /root/.ssh/id_rsa.
Your public key has been saved in /root/.ssh/id_rsa.pub.
The key fingerprint is:
3f:cb:d0:09:85:53:dc:8e:79:22:87:59:88:77:f5:47 root@box12
The key's randomart image is:
+--[ RSA 2048]----+
| . o.o. E|
| o |
+-----------------+
The authenticity of host '192.168.0.12 (192.168.0.12)' can't be established.
RSA key fingerprint is 7a:1b:52:33:18:80:f9:f1:34:ed:fd:dc:8e:91:10:1d.
Are you sure you want to continue connecting (yes/no)? yes
root@192.168.0.12's password:
I/O warning : failed to load external entity "/etc/pve/cluster.conf"
ccs_tool: Error: unable to parse requested configuration file

command 'ccs_tool lsnode -c /etc/pve/cluster.conf' failed: exit code 1
unable to add node: command failed (ssh 192.168.0.12 -o BatchMode=yes pvecm addnode box12 --force 1)
root@box12:~/#


What goes worng here? What did i do wrong?

Thank you in advance for your help.
 
Last edited:
yes, i also set up 2 debian hosts in oracle virtual box with the same error when adding a node to the cluster, but i prefer my 2 real boxes.

on box11 in the browser on services tab all services are running.
on box12 in the browser on services tab the service CMan is stopped and cannot be started.

I don't know what went wrong
 
root@box12:~# ls -l /etc/pve
insgesamt 2
-rw-r----- 1 root www-data 451 10. Nov 18:48 authkey.pub
-rw-r----- 1 root www-data 26 10. Nov 19:04 datacenter.cfg
lrwxr-x--- 1 root www-data 0 1. Jan 1970 local -> nodes/box12
drwxr-x--- 2 root www-data 0 10. Nov 18:48 nodes
lrwxr-x--- 1 root www-data 0 1. Jan 1970 openvz -> nodes/box12/openvz
drwx------ 2 root www-data 0 10. Nov 18:48 priv
-rw-r----- 1 root www-data 1533 10. Nov 18:48 pve-root-ca.pem
-rw-r----- 1 root www-data 1679 10. Nov 18:48 pve-www.key
lrwxr-x--- 1 root www-data 0 1. Jan 1970 qemu-server -> nodes/box12/qemu-server
root@box12:~#

root@box12:~# cat /etc/pve/cluster.conf
cat: /etc/pve/cluster.conf: Datei oder Verzeichnis nicht gefunden
root@box12:~#

Which package is in charge to install the cluster.conf file?
 
no file, the file does not exist on box12 what should be added to the cluster. on the promoted master (box11) the file exists with an entry for box11.
 
how can i debug pvecm? i have no idea. the 'pvecm add box12' command gives the following output:

root@box12:~# pvecm add box12
The authenticity of host 'box12 (192.168.0.12)' can't be established.
RSA key fingerprint is 7a:1b:52:33:18:80:f9:f1:34:ed:fd:dc:8e:91:10:1d.
Are you sure you want to continue connecting (yes/no)? yes
I/O warning : failed to load external entity "/etc/pve/cluster.conf"
ccs_tool: Error: unable to parse requested configuration file

command 'ccs_tool lsnode -c /etc/pve/cluster.conf' failed: exit code 1
unable to add node: command failed (ssh box12 -o BatchMode=yes pvecm addnode box12 --force 1)
root@box12:~#

cat /var/log/syslog says:

Nov 14 10:08:51 box12 pvestatd[1775]: start status update
Nov 14 10:08:51 box12 pvestatd[1775]: status update finished (0.005 seconds)
Nov 14 10:09:01 box12 /USR/SBIN/CRON[2086]: (root) CMD ( [ -x /usr/lib/php5/maxlifetime ] && [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -delete)
Nov 14 10:09:01 box12 pvestatd[1775]: start status update
Nov 14 10:09:01 box12 pvestatd[1775]: status update finished (0.005 seconds)

directory 'ls -l /var/lib/php5' is empty, 'cat /var/log/messages' also stays empty.
 
Last edited:
Please tell me how to debug 'pvecm'. if it helps i can give you a login so you can use putty to connect to my servers to look around what went wrong. you may also contact me by skype.

For me looks strange the bold text:
command 'ccs_tool lsnode -c /etc/pve/cluster.conf' failed: exit code 1
unable to add node: command failed (ssh box12 -o BatchMode=yes pvecm addnode box12 --force 1)
root@box12:~#

when i am on box12 with putty does it make sense to do a ssh to the box i am still on?

Thanks again for helping me.
 
Last edited:
I installed Proxmox VE 2.0 beta on 2 boxes described in this post http://pve.proxmox.com/wiki/Install_Proxmox_VE_on_Debian_Squeeze what works fine. Installing drbd also works fine. Then i created a cluster on box11(192.168.0.11) described in this post http://pve.proxmox.com/wiki/Proxmox_VE_2.0_Cluster with the following command

pvecm create mycluster

what also works fine. Now i added a second node box12(192.168.0.12) to the cluster using putty box12, issuing the command

pvecm add 192.168.0.12

You add the new node to the cluster by typing
pvecm add 192.168.0.11

I made the same mistake a while ago. It's a little counterintuitive that you need to "add" the original node and not the new one. Maybe that point should be clarified in the documentation.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!