Hello,
I have searched forums for resolution, but haven't found solution.
Problem: unable to add node to cluster after update.
Situation: update to one of the nodes in cluster changed kernel to 6.17 and network services no longer functioned on node (node 2 in cluster); node 1 on cluster functioning fine. Node 2 is older hardware with legacy BIOS. Searched forums and found issue with 6.17 and recommendation to pin to different kernel. RESOLVED the network issue using proxmox-boot-tool and pinning to kernel 6.14. Node functions in standalone mode.
However, when trying to join cluster, get eroor that host can't be resolved.
Prior to joining:
- cleared node2 of all VM/LXC (clean build /w updates and kernel pinned).
- both nodes running pve 9.1.1 manager
- both nodes can ping eachother and router
- can ssh into both nodes
- verified /etc/pve/nodes only has "local" node
- corosync.conf file doesn't have duplicate nodes and is created when cluster is started.
- verified /etc/hosts file has correct local host entry on each node (pve1 xxx.xxx.xx.5 or pve2 xxx.xxx.xx.6):
127.0.0.1 localhost.localdomain localhost
192.168.86.5 pve1.lan pve1
Both nodes will function in standalone, but can't add node 2 to node 1 without getting error.
TASK ERROR: 500 Can't connect to xx.xx.xx.xxx:8006
'/etc/pve/nodes/pve-c2/pve-ssl.pem' does not exist! (500)
Have tried 'pvecm updatecerts' on both nodes, but that hasn't worked.
Have been testing a 2 node cluster for a few months. Was able to update both nodes to pve9 (with some hiccups). Have been through failed to join cluster errors before, but can't resolve this one.
any help is approcieated
Thank you
I have searched forums for resolution, but haven't found solution.
Problem: unable to add node to cluster after update.
Situation: update to one of the nodes in cluster changed kernel to 6.17 and network services no longer functioned on node (node 2 in cluster); node 1 on cluster functioning fine. Node 2 is older hardware with legacy BIOS. Searched forums and found issue with 6.17 and recommendation to pin to different kernel. RESOLVED the network issue using proxmox-boot-tool and pinning to kernel 6.14. Node functions in standalone mode.
However, when trying to join cluster, get eroor that host can't be resolved.
Prior to joining:
- cleared node2 of all VM/LXC (clean build /w updates and kernel pinned).
- both nodes running pve 9.1.1 manager
- both nodes can ping eachother and router
- can ssh into both nodes
- verified /etc/pve/nodes only has "local" node
- corosync.conf file doesn't have duplicate nodes and is created when cluster is started.
- verified /etc/hosts file has correct local host entry on each node (pve1 xxx.xxx.xx.5 or pve2 xxx.xxx.xx.6):
127.0.0.1 localhost.localdomain localhost
192.168.86.5 pve1.lan pve1
Both nodes will function in standalone, but can't add node 2 to node 1 without getting error.
TASK ERROR: 500 Can't connect to xx.xx.xx.xxx:8006
'/etc/pve/nodes/pve-c2/pve-ssl.pem' does not exist! (500)
Have tried 'pvecm updatecerts' on both nodes, but that hasn't worked.
Have been testing a 2 node cluster for a few months. Was able to update both nodes to pve9 (with some hiccups). Have been through failed to join cluster errors before, but can't resolve this one.
any help is approcieated
Thank you