I've changed the IP of one node in the cluster as instructed here, rebooted and now the node is out of the cluster and refuses to bring up any VM & CT.
The steps I've done are:
Perhaps it's something new and silly to be done in newer versions.
Can anyone please give me a hand to fix it?
Edit:
The steps I've done are:
- Backup of all VM/CT
- Informed the vlan tag in all the CT/VM. It was a vlan+ip change.
- Changed /etc/network/interfaces with the new IP and changed vlan settings.
- Changed IP in /etc/hosts
- Changed IP in /etc/pve/corosync.conf and pumped 1 up the token version.
- Poweroff
- Changed the switch port to a trunk one (it was in an untagged/fixed vlan)
- Rebooted.
Perhaps it's something new and silly to be done in newer versions.
Can anyone please give me a hand to fix it?
Edit:
The changes made in corosync have propagated to the other nodes in the cluster. But still no communication.
Bash:
# pvecm status
Cluster information
-------------------
Name: rainland
Config Version: 7
Transport: knet
Secure auth: on
Quorum information
------------------
Date: Tue Dec 20 21:57:30 2022
Quorum provider: corosync_votequorum
Nodes: 1
Node ID: 0x00000002
Ring ID: 2.6d0
Quorate: No
Votequorum information
----------------------
Expected votes: 4
Highest expected: 4
Total votes: 1
Quorum: 3 Activity blocked
Flags:
Membership information
----------------------
Nodeid Votes Name
0x00000002 1 10.1.3.22 (local)
Log (In last lines I overrided the quorum so I could start the services):
Bash:
Dec 20 20:16:29 core pmxcfs[1156]: [dcdb] notice: data verification successful
Dec 20 20:27:58 core pmxcfs[1156]: [status] notice: node lost quorum
Dec 20 20:27:58 core pmxcfs[1156]: [dcdb] notice: members: 2/1156
Dec 20 20:27:58 core pmxcfs[1156]: [status] notice: members: 2/1156
Dec 20 20:27:58 core pmxcfs[1156]: [dcdb] crit: received write while not quorate - trigger resync
Dec 20 20:27:58 core pmxcfs[1156]: [dcdb] crit: leaving CPG group
Dec 20 20:27:59 core pmxcfs[1156]: [dcdb] notice: start cluster connection
Dec 20 20:27:59 core pmxcfs[1156]: [dcdb] crit: cpg_join failed: 14
Dec 20 20:27:59 core pmxcfs[1156]: [dcdb] crit: can't initialize service
Dec 20 20:28:05 core pmxcfs[1156]: [dcdb] notice: members: 2/1156
Dec 20 20:28:05 core pmxcfs[1156]: [dcdb] notice: all data is up to date
Dec 20 20:37:22 core pmxcfs[1156]: [dcdb] notice: data verification successful
Dec 20 20:41:13 core pmxcfs[1156]: [dcdb] notice: members: 1/894, 2/1156, 3/843, 4/2503
Dec 20 20:41:13 core pmxcfs[1156]: [dcdb] notice: starting data syncronisation
Dec 20 20:41:13 core pmxcfs[1156]: [status] notice: members: 1/894, 2/1156, 3/843, 4/2503
Dec 20 20:41:13 core pmxcfs[1156]: [status] notice: starting data syncronisation
Dec 20 20:41:13 core pmxcfs[1156]: [status] notice: node has quorum
Dec 20 20:41:13 core pmxcfs[1156]: [dcdb] notice: received sync request (epoch 1/894/00000012)
Dec 20 20:41:13 core pmxcfs[1156]: [status] notice: received sync request (epoch 1/894/0000000E)
Dec 20 20:41:13 core pmxcfs[1156]: [dcdb] notice: received all states
Dec 20 20:41:13 core pmxcfs[1156]: [dcdb] notice: leader is 1/894
Dec 20 20:41:13 core pmxcfs[1156]: [dcdb] notice: synced members: 1/894, 3/843, 4/2503
Dec 20 20:41:13 core pmxcfs[1156]: [dcdb] notice: waiting for updates from leader
Dec 20 20:41:13 core pmxcfs[1156]: [status] notice: received all states
Dec 20 20:41:13 core pmxcfs[1156]: [status] notice: all data is up to date
Dec 20 20:41:13 core pmxcfs[1156]: [dcdb] notice: update complete - trying to commit (got 4 inode updates)
Dec 20 20:41:13 core pmxcfs[1156]: [dcdb] notice: all data is up to date
Dec 20 20:51:04 core pmxcfs[1156]: [status] notice: received log
Dec 20 21:06:04 core pmxcfs[1156]: [status] notice: received log
Dec 20 21:08:42 core pmxcfs[1156]: [status] notice: received log
Dec 20 21:08:42 core pmxcfs[1156]: [status] notice: received log
Dec 20 21:08:44 core pmxcfs[1156]: [status] notice: received log
Dec 20 21:09:49 core pmxcfs[1156]: [dcdb] notice: wrote new corosync config '/etc/corosync/corosync.conf' (version = 7)
Dec 20 21:09:50 core pmxcfs[1156]: [dcdb] crit: corosync-cfgtool -R failed with exit code 7#010
Dec 20 21:12:39 core pmxcfs[1156]: [confdb] crit: cmap_dispatch failed: 2
Dec 20 21:12:39 core pmxcfs[1156]: [quorum] crit: quorum_dispatch failed: 2
Dec 20 21:12:39 core pmxcfs[1156]: [status] notice: node lost quorum
Dec 20 21:12:39 core pmxcfs[1156]: [dcdb] crit: cpg_dispatch failed: 2
Dec 20 21:12:39 core pmxcfs[1156]: [dcdb] crit: cpg_leave failed: 2
Dec 20 21:12:39 core pmxcfs[1156]: [status] crit: cpg_dispatch failed: 2
Dec 20 21:12:39 core pmxcfs[1156]: [status] crit: cpg_leave failed: 2
Dec 20 21:12:39 core pmxcfs[1156]: [quorum] crit: quorum_initialize failed: 2
Dec 20 21:12:39 core pmxcfs[1156]: [quorum] crit: can't initialize service
Dec 20 21:12:39 core pmxcfs[1156]: [confdb] crit: cmap_initialize failed: 2
Dec 20 21:12:39 core pmxcfs[1156]: [confdb] crit: can't initialize service
Dec 20 21:12:39 core pmxcfs[1156]: [dcdb] notice: start cluster connection
Dec 20 21:12:39 core pmxcfs[1156]: [dcdb] crit: cpg_initialize failed: 2
Dec 20 21:12:39 core pmxcfs[1156]: [dcdb] crit: can't initialize service
Dec 20 21:12:39 core pmxcfs[1156]: [status] notice: start cluster connection
Dec 20 21:12:39 core pmxcfs[1156]: [status] crit: cpg_initialize failed: 2
Dec 20 21:12:39 core pmxcfs[1156]: [status] crit: can't initialize service
Dec 20 21:12:40 core systemd[1]: Stopping The Proxmox VE cluster filesystem...
Dec 20 21:12:40 core pmxcfs[1156]: [main] notice: teardown filesystem
Dec 20 21:12:41 core pmxcfs[1156]: [quorum] crit: quorum_finalize failed: 9
Dec 20 21:12:41 core pmxcfs[1156]: [confdb] crit: cmap_track_delete nodelist failed: 9
Dec 20 21:12:41 core pmxcfs[1156]: [confdb] crit: cmap_track_delete version failed: 9
Dec 20 21:12:41 core pmxcfs[1156]: [confdb] crit: cmap_finalize failed: 9
Dec 20 21:12:41 core pmxcfs[1156]: [main] notice: exit proxmox configuration filesystem (0)
Dec 20 21:12:41 core systemd[1]: pve-cluster.service: Succeeded.
Dec 20 21:12:41 core systemd[1]: Stopped The Proxmox VE cluster filesystem.
Dec 20 21:12:41 core systemd[1]: pve-cluster.service: Consumed 6min 29.513s CPU time.
-- Boot ddeb7fdd72f24a028bdeff71910fe3b4 --
Dec 20 21:13:30 core systemd[1]: Starting The Proxmox VE cluster filesystem...
Dec 20 21:13:30 core pmxcfs[1195]: [quorum] crit: quorum_initialize failed: 2
Dec 20 21:13:30 core pmxcfs[1195]: [quorum] crit: can't initialize service
Dec 20 21:13:30 core pmxcfs[1195]: [confdb] crit: cmap_initialize failed: 2
Dec 20 21:13:30 core pmxcfs[1195]: [confdb] crit: can't initialize service
Dec 20 21:13:30 core pmxcfs[1195]: [dcdb] crit: cpg_initialize failed: 2
Dec 20 21:13:30 core pmxcfs[1195]: [dcdb] crit: can't initialize service
Dec 20 21:13:30 core pmxcfs[1195]: [status] crit: cpg_initialize failed: 2
Dec 20 21:13:30 core pmxcfs[1195]: [status] crit: can't initialize service
Dec 20 21:13:31 core systemd[1]: Started The Proxmox VE cluster filesystem.
Dec 20 21:13:36 core pmxcfs[1195]: [status] notice: update cluster info (cluster name rainland, version = 7)
Dec 20 21:13:36 core pmxcfs[1195]: [dcdb] notice: members: 2/1195
Dec 20 21:13:36 core pmxcfs[1195]: [dcdb] notice: all data is up to date
Dec 20 21:13:36 core pmxcfs[1195]: [status] notice: members: 2/1195
Dec 20 21:13:36 core pmxcfs[1195]: [status] notice: all data is up to date
Dec 20 21:34:12 core pmxcfs[1195]: [status] notice: node has quorum
Last edited: