I found some issue with corosync after node restart. When you reboot machine corosync is failing to start on one of nodes. Not sure why this is happening. Can somebody help me out with narrowing down where problem is?
systemctl status corosync
● corosync.service - Corosync Cluster Engine
Loaded: loaded (/lib/systemd/system/corosync.service; enabled)
Active: failed (Result: exit-code) since Fri 2016-06-03 14:42:28 IST; 17min ago
Process: 1946 ExecStart=/usr/share/corosync/corosync start (code=exited, status=1/FAILURE)
Jun 03 14:41:27 proxmoxn2 corosync[1957]: [SERV ] Service engine loaded: corosync configuration service [1]
Jun 03 14:41:27 proxmoxn2 corosync[1957]: [QB ] server name: cfg
Jun 03 14:41:27 proxmoxn2 corosync[1957]: [SERV ] Service engine loaded: corosync cluster closed process group service v1.01 [2]
Jun 03 14:41:27 proxmoxn2 corosync[1957]: [QB ] server name: cpg
Jun 03 14:41:27 proxmoxn2 corosync[1957]: [SERV ] Service engine loaded: corosync profile loading service [4]
Jun 03 14:41:27 proxmoxn2 corosync[1957]: [QUORUM] Using quorum provider corosync_votequorum
Jun 03 14:42:28 proxmoxn2 corosync[1946]: Starting Corosync Cluster Engine (corosync): [FAILED]
Jun 03 14:42:28 proxmoxn2 systemd[1]: corosync.service: control process exited, code=exited status=1
Jun 03 14:42:28 proxmoxn2 systemd[1]: Failed to start Corosync Cluster Engine.
Jun 03 14:42:28 proxmoxn2 systemd[1]: Unit corosync.service entered failed state.
journalctl -xn
-- Logs begin at Fri 2016-06-03 14:41:15 IST, end at Fri 2016-06-03 14:59:50 IST. --
Jun 03 14:59:38 proxmoxn2 pmxcfs[1931]: [dcdb] crit: cpg_initialize failed: 2
Jun 03 14:59:38 proxmoxn2 pmxcfs[1931]: [status] crit: cpg_initialize failed: 2
Jun 03 14:59:44 proxmoxn2 pmxcfs[1931]: [quorum] crit: quorum_initialize failed: 2
Jun 03 14:59:44 proxmoxn2 pmxcfs[1931]: [confdb] crit: cmap_initialize failed: 2
Jun 03 14:59:44 proxmoxn2 pmxcfs[1931]: [dcdb] crit: cpg_initialize failed: 2
Jun 03 14:59:44 proxmoxn2 pmxcfs[1931]: [status] crit: cpg_initialize failed: 2
Jun 03 14:59:50 proxmoxn2 pmxcfs[1931]: [quorum] crit: quorum_initialize failed: 2
Jun 03 14:59:50 proxmoxn2 pmxcfs[1931]: [confdb] crit: cmap_initialize failed: 2
Jun 03 14:59:50 proxmoxn2 pmxcfs[1931]: [dcdb] crit: cpg_initialize failed: 2
Jun 03 14:59:50 proxmoxn2 pmxcfs[1931]: [status] crit: cpg_initialize failed: 2
pvecm status
Cannot initialize CMAP service
pvecm nodes
Cannot initialize CMAP service
root@proxmoxn2:~# cat /etc/hosts
127.0.0.1 localhost.localdomain localhost
192.168.106.234 proxmoxn2.servergate.local proxmoxn2 pvelocalhost
192.168.106.235 proxmoxn1.servergate.local proxmoxn1
192.168.106.231 proxmoxn3.servergate.local proxmoxn3
Thanks
Wojtek
systemctl status corosync
● corosync.service - Corosync Cluster Engine
Loaded: loaded (/lib/systemd/system/corosync.service; enabled)
Active: failed (Result: exit-code) since Fri 2016-06-03 14:42:28 IST; 17min ago
Process: 1946 ExecStart=/usr/share/corosync/corosync start (code=exited, status=1/FAILURE)
Jun 03 14:41:27 proxmoxn2 corosync[1957]: [SERV ] Service engine loaded: corosync configuration service [1]
Jun 03 14:41:27 proxmoxn2 corosync[1957]: [QB ] server name: cfg
Jun 03 14:41:27 proxmoxn2 corosync[1957]: [SERV ] Service engine loaded: corosync cluster closed process group service v1.01 [2]
Jun 03 14:41:27 proxmoxn2 corosync[1957]: [QB ] server name: cpg
Jun 03 14:41:27 proxmoxn2 corosync[1957]: [SERV ] Service engine loaded: corosync profile loading service [4]
Jun 03 14:41:27 proxmoxn2 corosync[1957]: [QUORUM] Using quorum provider corosync_votequorum
Jun 03 14:42:28 proxmoxn2 corosync[1946]: Starting Corosync Cluster Engine (corosync): [FAILED]
Jun 03 14:42:28 proxmoxn2 systemd[1]: corosync.service: control process exited, code=exited status=1
Jun 03 14:42:28 proxmoxn2 systemd[1]: Failed to start Corosync Cluster Engine.
Jun 03 14:42:28 proxmoxn2 systemd[1]: Unit corosync.service entered failed state.
journalctl -xn
-- Logs begin at Fri 2016-06-03 14:41:15 IST, end at Fri 2016-06-03 14:59:50 IST. --
Jun 03 14:59:38 proxmoxn2 pmxcfs[1931]: [dcdb] crit: cpg_initialize failed: 2
Jun 03 14:59:38 proxmoxn2 pmxcfs[1931]: [status] crit: cpg_initialize failed: 2
Jun 03 14:59:44 proxmoxn2 pmxcfs[1931]: [quorum] crit: quorum_initialize failed: 2
Jun 03 14:59:44 proxmoxn2 pmxcfs[1931]: [confdb] crit: cmap_initialize failed: 2
Jun 03 14:59:44 proxmoxn2 pmxcfs[1931]: [dcdb] crit: cpg_initialize failed: 2
Jun 03 14:59:44 proxmoxn2 pmxcfs[1931]: [status] crit: cpg_initialize failed: 2
Jun 03 14:59:50 proxmoxn2 pmxcfs[1931]: [quorum] crit: quorum_initialize failed: 2
Jun 03 14:59:50 proxmoxn2 pmxcfs[1931]: [confdb] crit: cmap_initialize failed: 2
Jun 03 14:59:50 proxmoxn2 pmxcfs[1931]: [dcdb] crit: cpg_initialize failed: 2
Jun 03 14:59:50 proxmoxn2 pmxcfs[1931]: [status] crit: cpg_initialize failed: 2
pvecm status
Cannot initialize CMAP service
pvecm nodes
Cannot initialize CMAP service
root@proxmoxn2:~# cat /etc/hosts
127.0.0.1 localhost.localdomain localhost
192.168.106.234 proxmoxn2.servergate.local proxmoxn2 pvelocalhost
192.168.106.235 proxmoxn1.servergate.local proxmoxn1
192.168.106.231 proxmoxn3.servergate.local proxmoxn3
Thanks
Wojtek