[SOLVED] the cluster does not see the node

Maksimus

Member
May 16, 2022
78
3
13
The cluster consists of 2 nodes, one node is not active, an error is generated
hostname lookup 'Host018' failed - failed to get address info for: Host018: Name or service not known (500)
in fact, the node is turned on and working, there is access via ssh
The name of the node did not change, the ip did not change, the dns record did not change either.
In the files, the node name is correct
etc\hosts
etc\hostname
PVE on both nodes updated to the latest version
 
I forgot to say that there is a VM on the node, which worked successfully before restarting the node.

systemctl status corosync
● corosync.service - Corosync Cluster Engine
Loaded: loaded (/lib/systemd/system/corosync.service; enabled; vendor preset: enabled)
Active: failed (Result: exit-code) since Thu 2022-08-25 12:21:18 MSK; 1h 11min ago
Docs: man:corosync
man:corosync.conf
man:corosync_overview
Process: 1216 ExecStart=/usr/sbin/corosync -f $COROSYNC_OPTIONS (code=exited, status=0/SUCCESS)
Process: 1270 ExecStop=/usr/sbin/corosync-cfgtool -H --force (code=exited, status=1/FAILURE)
Main PID: 1216 (code=exited, status=0/SUCCESS)
CPU: 89ms

Aug 25 12:21:17 Host018 corosync[1216]: [QB ] withdrawing server sockets
Aug 25 12:21:17 Host018 corosync[1216]: [SERV ] Service engine unloaded: corosync cluster closed process group service v1.01
Aug 25 12:21:17 Host018 corosync[1216]: [QB ] withdrawing server sockets
Aug 25 12:21:17 Host018 corosync[1216]: [SERV ] Service engine unloaded: corosync cluster quorum service v0.1
Aug 25 12:21:17 Host018 corosync[1216]: [SERV ] Service engine unloaded: corosync profile loading service
Aug 25 12:21:17 Host018 corosync[1216]: [SERV ] Service engine unloaded: corosync resource monitoring service
Aug 25 12:21:17 Host018 corosync[1216]: [SERV ] Service engine unloaded: corosync watchdog service
Aug 25 12:21:18 Host018 corosync[1216]: [MAIN ] Corosync Cluster Engine exiting normally
Aug 25 12:21:18 Host018 systemd[1]: corosync.service: Control process exited, code=exited, status=1/FAILURE
Aug 25 12:21:18 Host018 systemd[1]: corosync.service: Failed with result 'exit-code'.
 
Last edited:
found an error Cannot initialize CMAP service googled it and found a solution how to fix my cluster.
 
Please share your solution and mark the thread as solved, so others with the same problem know what to do.