I tried adding a node to our (existing) cluster of currently four machines.
We tried to do this through GUI.
GUI on new node stopped responding after it was restarting pve-cluster...something. (didn't grab a screenshot)
GUI didn't come back. Server is still reachable over SSH.
Node has been restarted, but this didn't fix.
Node seems to be added at least visually in existing cluster, but cluster shows new node as offline. Edit (17:47: This is only true for the node that i used to add it to.)
Cluster gui is also extremely slow or even times out until I stop corosync on new offending node.
I also had headaches adding the third and fourth machine to the cluster in a according to my memory similiar fashion, but after runnning pvecm updatecerts on these machines they came online and have worked flawlessly since. (at least as far as i've noticed)
This server refuses to run this command due to lack of quorum.
Pveproxy also complained in logs about not finding a certificate, so i looked deeper, found a topic on this forum that suggesting lowering quorum to 1 and then running the same command with -f might do something.
This seems to have worked, pveproxy also seems happy now, but the server still doesn't come online within the cluster.
(rebooted after this)
Gui of the new node does work now, however after logging it only gives errror: Connection error 401: permission denied - invalid PVE ticket.
Corosync on offending node logs these seemingly indefinately:
Corosync on node that i used to add the new node logs this:
Also this looks weird to me:
Is seems to see only itself?
They are in the same network.
The only thing i can feasibly think of performance wise is that the server is currently in parity procedure on RAID and it's therefore somewhat slower.
I'm running out of idea's of things to try and would appreciate some pointers.
We tried to do this through GUI.
GUI on new node stopped responding after it was restarting pve-cluster...something. (didn't grab a screenshot)
GUI didn't come back. Server is still reachable over SSH.
Node has been restarted, but this didn't fix.
Node seems to be added at least visually in existing cluster, but cluster shows new node as offline. Edit (17:47: This is only true for the node that i used to add it to.)
Cluster gui is also extremely slow or even times out until I stop corosync on new offending node.
I also had headaches adding the third and fourth machine to the cluster in a according to my memory similiar fashion, but after runnning pvecm updatecerts on these machines they came online and have worked flawlessly since. (at least as far as i've noticed)
This server refuses to run this command due to lack of quorum.
Pveproxy also complained in logs about not finding a certificate, so i looked deeper, found a topic on this forum that suggesting lowering quorum to 1 and then running the same command with -f might do something.
This seems to have worked, pveproxy also seems happy now, but the server still doesn't come online within the cluster.
(rebooted after this)
Gui of the new node does work now, however after logging it only gives errror: Connection error 401: permission denied - invalid PVE ticket.
Corosync on offending node logs these seemingly indefinately:
Code:
Oct 05 17:23:35 PX6PVE5 corosync[9250]: [TOTEM ] A new membership (5.4377) was formed. Members
Oct 05 17:23:35 PX6PVE5 corosync[9250]: [QUORUM] Members[1]: 5
Oct 05 17:23:35 PX6PVE5 corosync[9250]: [MAIN ] Completed service synchronization, ready to provide service.
Oct 05 17:23:42 PX6PVE5 corosync[9250]: [TOTEM ] A new membership (5.438b) was formed. Members
Oct 05 17:23:42 PX6PVE5 corosync[9250]: [QUORUM] Members[1]: 5
Oct 05 17:23:42 PX6PVE5 corosync[9250]: [MAIN ] Completed service synchronization, ready to provide service.
Oct 05 17:23:49 PX6PVE5 corosync[9250]: [TOTEM ] A new membership (5.439f) was formed. Members
Oct 05 17:23:49 PX6PVE5 corosync[9250]: [QUORUM] Members[1]: 5
Oct 05 17:23:49 PX6PVE5 corosync[9250]: [MAIN ] Completed service synchronization, ready to provide service.
Corosync on node that i used to add the new node logs this:
Code:
Oct 05 17:24:07 px6pve1 corosync[32918]: [TOTEM ] Token has not been received in 4516 ms
Oct 05 17:24:10 px6pve1 corosync[32918]: [TOTEM ] A new membership (1.43db) was formed. Members
Oct 05 17:24:12 px6pve1 corosync[32918]: [TOTEM ] Token has not been received in 2213 ms
Oct 05 17:24:15 px6pve1 corosync[32918]: [TOTEM ] Token has not been received in 4515 ms
Oct 05 17:24:17 px6pve1 corosync[32918]: [TOTEM ] A new membership (1.43ef) was formed. Members
Also this looks weird to me:
Code:
Oct 05 17:24:17 px6pve1 corosync[32918]: [QUORUM] Members[4]: 1 2 3 4
vs on the new node:
Oct 05 17:23:42 PX6PVE5 corosync[9250]: [QUORUM] Members[1]: 5
Is seems to see only itself?
They are in the same network.
The only thing i can feasibly think of performance wise is that the server is currently in parity procedure on RAID and it's therefore somewhat slower.
I'm running out of idea's of things to try and would appreciate some pointers.
Last edited: