Hi,
I have a Proxmox 2.1 cluster up and running for 2 Months. The last days I added a few nodes, and now the last one (the 11th) is not able to join the cluster. Network is fine, system clocks match, the other ten are running flawless. They're all blades devided up in 2 bladecenters which are connected to the same managed switches. I've already tried rebooting, removing the node and rejoining, joining at another node, etc.
When I'm trying to add the node I get this:
I've waited for a really long time (>30 min), but nothing happens any more at this point.
Here is what pveversion -v sais:
The only useful lines in the syslog are:
Maybe one has an idea, google and I don't :-(
Best regards
I have a Proxmox 2.1 cluster up and running for 2 Months. The last days I added a few nodes, and now the last one (the 11th) is not able to join the cluster. Network is fine, system clocks match, the other ten are running flawless. They're all blades devided up in 2 bladecenters which are connected to the same managed switches. I've already tried rebooting, removing the node and rejoining, joining at another node, etc.
When I'm trying to add the node I get this:
Code:
can't create shared ssh key database '/etc/pve/priv/authorized_keys'
copy corosync auth key
stopping pve-cluster service
Stopping pve cluster filesystem: pve-cluster.
backup old database
Starting pve cluster filesystem : pve-cluster.
Starting cluster:
Checking if cluster has been disabled at boot... [ OK ]
Checking Network Manager... [ OK ]
Global setup... [ OK ]
Loading kernel modules... [ OK ]
Mounting configfs... [ OK ]
Starting cman... [ OK ]
Waiting for quorum... Timed-out waiting for cluster
[FAILED]
waiting for quorum...
Here is what pveversion -v sais:
Code:
pve-manager: 2.1-14 (pve-manager/2.1/f32f3f46)
running kernel: 2.6.32-14-pve
proxmox-ve-2.6.32: 2.1-74
pve-kernel-2.6.32-11-pve: 2.6.32-66
pve-kernel-2.6.32-14-pve: 2.6.32-74
lvm2: 2.02.95-1pve2
clvm: 2.02.95-1pve2
corosync-pve: 1.4.3-1
openais-pve: 1.1.4-2
libqb: 0.10.1-2
redhat-cluster-pve: 3.1.92-3
resource-agents-pve: 3.9.2-3
fence-agents-pve: 3.1.8-1
pve-cluster: 1.0-27
qemu-server: 2.0-49
pve-firmware: 1.0-18
libpve-common-perl: 1.0-30
libpve-access-control: 1.0-24
libpve-storage-perl: 2.0-31
vncterm: 1.0-3
vzctl: 3.0.30-2pve5
vzprocps: 2.0.11-2
vzquota: 3.0.12-3
pve-qemu-kvm: 1.1-8
ksm-control-daemon: 1.1-1
Code:
Aug 30 12:28:56 proxmox-14 pmxcfs[1930]: [main] notice: teardown filesystem
Aug 30 12:28:58 proxmox-14 pmxcfs[1930]: [main] notice: exit proxmox configuration filesystem (0)
Aug 30 12:28:59 proxmox-14 pmxcfs[2367]: [status] notice: update cluster info (cluster name xyz, version = 19)
Aug 30 12:28:59 proxmox-14 pmxcfs[2367]: [dcdb] notice: members: 11/2367
Aug 30 12:28:59 proxmox-14 pmxcfs[2367]: [dcdb] notice: all data is up to date
Aug 30 12:28:59 proxmox-14 pmxcfs[2367]: [dcdb] notice: members: 11/2367
Aug 30 12:28:59 proxmox-14 pmxcfs[2367]: [dcdb] notice: all data is up to date
Aug 30 12:29:05 proxmox-14 pvestatd[1762]: WARNING: ipcc_send_rec failed: Transport endpoint is not connected
Best regards