I've been trawling the forums, but my search-fu seems to be lacking today. Here's the problem:
We had a nicely functional Proxmox VE 6 cluster. Identical machines, nice and new, with both local (lvm-thin) and NFS storage. Let's call them NODE1 and NODE2. Unfortunately, NODE2 ran into some really unusual hardware problems (it's a fairly new platform), and the manufacturer ended up replacing the whole system, disks and all. To make a shipping deadline, I did what I thought was necessary to remove the node, then scrubbed the disks and shipped it out. It took some time to get replaced, and when the new machine arrived, we were (and still are) shut down for COVAD-19 safety.
However, since we are an essential industry, I was called in this weekend for another matter, and managed to get the "replacement" NODE2 in the rack and remote access configured. Once I could, I remotely set it up, installed Proxmox VE 6, relicensed it, and then tried to add it back to the cluster. No love.
Establishing API connection with host '172.19.68.211'
Login succeeded.
Request addition of this node
TASK ERROR: 500 cluster not ready - no quorum?
I believe the old system is still showing up as part of the cluster - it's there in the list when I log into NODE1 with an "x" on its icon. If I click on NODE2 and try to get the status, I get:
tls_process_server_certificate: certificate verify failed (596)
If I go to the datacenter view, the cluster status shows as NODE2 being offline, and shows Quorate: no.
So, the question is: can I get NODE2 to come back into the cluster?
-or-
Since it's a two-node cluster, can I safely destroy it and re-create it?
Thanks for any help, and if this *is* an FAQ and I missed it, I apologize.
We had a nicely functional Proxmox VE 6 cluster. Identical machines, nice and new, with both local (lvm-thin) and NFS storage. Let's call them NODE1 and NODE2. Unfortunately, NODE2 ran into some really unusual hardware problems (it's a fairly new platform), and the manufacturer ended up replacing the whole system, disks and all. To make a shipping deadline, I did what I thought was necessary to remove the node, then scrubbed the disks and shipped it out. It took some time to get replaced, and when the new machine arrived, we were (and still are) shut down for COVAD-19 safety.
However, since we are an essential industry, I was called in this weekend for another matter, and managed to get the "replacement" NODE2 in the rack and remote access configured. Once I could, I remotely set it up, installed Proxmox VE 6, relicensed it, and then tried to add it back to the cluster. No love.
Establishing API connection with host '172.19.68.211'
Login succeeded.
Request addition of this node
TASK ERROR: 500 cluster not ready - no quorum?
I believe the old system is still showing up as part of the cluster - it's there in the list when I log into NODE1 with an "x" on its icon. If I click on NODE2 and try to get the status, I get:
tls_process_server_certificate: certificate verify failed (596)
If I go to the datacenter view, the cluster status shows as NODE2 being offline, and shows Quorate: no.
So, the question is: can I get NODE2 to come back into the cluster?
-or-
Since it's a two-node cluster, can I safely destroy it and re-create it?
Thanks for any help, and if this *is* an FAQ and I missed it, I apologize.