[SOLVED] cluster reinstall and rejoin nodes

informant

Renowned Member
Jan 31, 2012
802
11
83
hi all, i must reinstall a cluster, i have 6 nodes with many running vms. how can i rejoin nodes to new installed cluster without remove vms on nodes? how can i do it? any ideas?
cong files corosnync ... i have as backup.
https://pve.proxmox.com/pve-docs/pve-admin-guide.html#pvecm_separate_node_without_reinstall
^^ this i have done already on all nodes.
but rejoin / readd i mean is better way... hope you have ideas...
on readd/rejoin to new cluster i get error:
Code:
detected the following error(s):
* this host already contains virtual guests
TASK ERROR: Check if node may join a cluster failed!
after clean
Code:
rm /var/lib/corosync/*
i get error message an rejoin:
Code:
Please enter superuser (root) password for 'myclusterip': myclusterpw
Establishing API connection with host 'myclusterip'
500 Can't connect myclusterip:8006 (hostname verification failed)
best regards.
 
Last edited:
i need help please, all vms run but if i connect to every node seperatly with ip, i cant show my vms :(
i have no idea how can i bring it back to work.
need help of members please.
regards
 
Hi Informant,
I'm afraid this is not possible to join a new cluster with running VM on your node.
I am currently planning this kind of operation, and I understand that I need to back up these VMs with PBS first in order to import them into the new node.
Regards
 
backup time is than many weeks for all vms, thats not a good idea. why i cant add a node to a new cluster with same ip and same name? is same server only new hdd with new isntalled cluster.... ? can i restore files to readd nodes with vms? i have clusters /etc/pve dir as backup.?
 
Last edited:
Hum I will redirect you to the documentation : Proxmox VE Administration Guide 111 / 614

5.4 Adding Nodes to the Cluster

Caution

All existing configuration in /etc/pve is overwritten when joining a cluster. In particular, a joining
node cannot hold any guests, since guest IDs could otherwise conflict, and the node will inherit the
cluster’s storage configuration. To join a node with existing guest, as a workaround, you can create
a backup of each guest (using vzdump) and restore it under a different ID after joining. If the
node’s storage layout differs, you will need to re-add the node’s storages, and adapt each storage’s
node restriction to reflect on which nodes the storage is actually available.

Regards
 
I know, but i jave Backup of /etc/pve
but i cant add nodes to cluster, look error message on top^^
i cant also add at moment notes to new cluster. if files are overwrite is no rpobpem i have backups but i cant add notes. normal i can add notes / rejoin it and can insert backup conf and it works back fine.
 
I’m not sure I fully understand:
In your new cluster, can you add a new node (different from your existing nodes), or does the issue arise only with your existing nodes ?
 
you might try to migrate vm one by one to a node in the new cluster ( qm remote-migrate command ). shared storage ( nfs) can help for this task
* move volume to nfs storage
* qm remote migrate
* re migrate to local storage
 
@Fabxienx and all others...
i must reinstall a cluster well ssd was dead, i have backup of /etc/pve
i have also isntalled a new cluster on old cluster server with new hdd with same ip and name
i would add old nodes to new cluster
but at moment i dont find a solution thats works fine
i have test:
1. https://pve.proxmox.com/pve-docs/pve-admin-guide.html#pvecm_separate_node_without_reinstall
than rejoin nodes, error
2. same way but > https://forum.proxmox.com/threads/joining-a-cluster-with-already-created-guests-vm.81064/post-388684
problem is ssl, i have aslo make a copy of cluster ssl and copy to nodes <node> 2 ssl files than services restart
this show me all old nodes in new cluster but offline and unknow and vms grey, same after restart cluster and services, and cluster gui is very slow and system hang here
3. i have make a copy of corosync.conf in /etc/corosync and /etc/pve tats both are same. but same issue after reboot all slow and not reachable in gui and gui slow
4. join nodes with commandline and fingerprint, but same issue all offline/unknow and slow
i dont have more ideas about 3 days testing, normal it must we work but i dont know why not
hope you have more ideas to fix it

ps: if i open nodes with ip:8006 at moment i can login but vms are only show if i copy old /pve/node/<node> to node and restart services, its normal well i have remove before in point 2

best way for me was all nodes make again separatly to working and than rejoin with vms again, but i dont find a way in last days

reagards
 
ok have found solutions to bring all nodes with all vms in running mode without any problems to new cluster. with infos about on top but step by step with clean configs
 
  • Like
Reactions: UdoB