Hello,
I have a 4 node proxmox cluster. Currently I've tried to add a fourth node to no avail. I thought perhaps the issue was related to multicast, but I can do a multicast ping between all nodes in the cluster. I can mount the /etc/pve fileystem just fine. The fourth node also shows up in the UI but with a red light. I'm at a loss of where else to check or how to continue to troubleshoot this.
Below are some commands that I've run to try to troubleshoot the issue:
When I restart 'cman' on the fourth node to join the cluster I get:
The /etc/pve filesystem mounts fine as well:
When I do 'pvecm nodes' on one of the working nodes I get:
When I issue the same command on the fourth (not working) node I get:
I'm at a loss on how to continue to troubleshoot this. I thought if all the nodes could communicate with multicast they should be able to form the quorum. If anyone can point me in the right direction, it would be greatly appreciated!
I have a 4 node proxmox cluster. Currently I've tried to add a fourth node to no avail. I thought perhaps the issue was related to multicast, but I can do a multicast ping between all nodes in the cluster. I can mount the /etc/pve fileystem just fine. The fourth node also shows up in the UI but with a red light. I'm at a loss of where else to check or how to continue to troubleshoot this.
Below are some commands that I've run to try to troubleshoot the issue:
Code:
root@virt4-atl:/# asmping 239.192.37.213 virt2-atl
asmping joined (S,G) = (*,239.192.37.234)
pinging 10.10.155.11 from 10.10.155.13
unicast from 10.10.155.11, seq=1 dist=0 time=1.518 ms
multicast from 10.10.155.11, seq=1 dist=0 time=1.543 ms
unicast from 10.10.155.11, seq=2 dist=0 time=0.257 ms
multicast from 10.10.155.11, seq=2 dist=0 time=0.266 ms
unicast from 10.10.155.11, seq=3 dist=0 time=0.237 ms
multicast from 10.10.155.11, seq=3 dist=0 time=0.250 ms
--- 10.10.155.11 statistics ---
3 packets transmitted, time 2828 ms
unicast:
3 packets received, 0% packet loss
rtt min/avg/max/std-dev = 0.237/0.670/1.518/0.599 ms
multicast:
3 packets received, 0% packet loss since first mc packet (seq 1) recvd
rtt min/avg/max/std-dev = 0.250/0.686/1.543/0.606 ms
When I restart 'cman' on the fourth node to join the cluster I get:
Code:
starting cluster:
Checking if cluster has been disabled at boot... [ OK ]
Checking Network Manager... [ OK ]
Global setup... [ OK ]
Loading kernel modules... [ OK ]
Mounting configfs... [ OK ]
Starting cman... [ OK ]
Waiting for quorum... Timed-out waiting for cluster
The /etc/pve filesystem mounts fine as well:
Code:
/dev/fuse on /etc/pve type fuse (rw,nosuid,nodev,default_permissions,allow_other)
When I do 'pvecm nodes' on one of the working nodes I get:
Code:
root@virt2-atl:~# pvecm nodes
Node Sts Inc Joined Name
1 M 1148 2013-02-27 16:29:38 virt3-atl
2 M 1148 2013-02-27 16:29:38 virt2-atl
3 M 1148 2013-02-27 16:29:38 virt1-atl
4 X 0 virt4-atl
When I issue the same command on the fourth (not working) node I get:
Code:
root@virt4-atl:/# pvecm nodes
Node Sts Inc Joined Name
1 X 0 virt3-atl
2 X 0 virt2-atl
3 X 0 virt1-atl
4 M 96 2013-02-27 16:37:00 virt4-atl
I'm at a loss on how to continue to troubleshoot this. I thought if all the nodes could communicate with multicast they should be able to form the quorum. If anyone can point me in the right direction, it would be greatly appreciated!