I have a cluster of 3 Proxmox servers that repeatedly seem to lose contact with one another. My switch that they are on it multicast enabled, and it is indeed enabled.
When I attempt to log on, I get this:
Note that it will not let me select a Realm to log on with.
Only one of the three servers lets me log on to the web interface, and it does not show communication to the other two servers.
If I reboot the cluster, they will work fine for a while, but then lose contact with one another after a random time.
The VMs appear to be running still.
What is my next step in troubleshooting this?
When I attempt to log on, I get this:
Note that it will not let me select a Realm to log on with.
Only one of the three servers lets me log on to the web interface, and it does not show communication to the other two servers.
If I reboot the cluster, they will work fine for a while, but then lose contact with one another after a random time.
The VMs appear to be running still.
What is my next step in troubleshooting this?
# pveversion -v
proxmox-ve: 4.4-78 (running kernel: 4.4.35-2-pve)
pve-manager: 4.4-5 (running version: 4.4-5/c43015a5)
pve-kernel-4.4.35-1-pve: 4.4.35-77
pve-kernel-4.4.35-2-pve: 4.4.35-78
lvm2: 2.02.116-pve3
corosync-pve: 2.4.0-1
libqb0: 1.0-1
pve-cluster: 4.0-48
qemu-server: 4.0-102
pve-firmware: 1.1-10
libpve-common-perl: 4.0-85
libpve-access-control: 4.0-19
libpve-storage-perl: 4.0-71
pve-libspice-server1: 0.12.8-1
vncterm: 1.2-1
pve-docs: 4.4-1
pve-qemu-kvm: 2.7.1-1
pve-container: 1.0-90
pve-firewall: 2.0-33
pve-ha-manager: 1.0-38
ksm-control-daemon: 1.2-1
glusterfs-client: 3.5.2-2+deb8u3
lxc-pve: 2.0.6-5
lxcfs: 2.0.5-pve2
criu: 1.6.0-1
novnc-pve: 0.5-8
smartmontools: 6.5+svn4324-1~pve80
zfsutils: 0.6.5.8-pve13~bpo80