no wories, for a second there i thought my cluster was completely hosed... here are the journalctl outputs from the hosts:Ah, sorry my bad,pmxcfs
is managed by thepve-cluster.service
, sosystemctl stop pve-cluster.service
.
PVE01
Code:
Apr 18 11:38:07 pve01 systemd[1]: Started Corosync Cluster Engine.
Apr 18 11:55:44 pve01 systemd[1]: Stopping Corosync Cluster Engine...
Apr 18 11:55:44 pve01 corosync-cfgtool[6811]: Shutting down corosync
Apr 18 11:55:44 pve01 corosync[3325]: [MAIN ] Node was shut down by a signal
Apr 18 11:55:44 pve01 corosync[3325]: [SERV ] Unloading all Corosync service engines.
Apr 18 11:55:44 pve01 corosync[3325]: [QB ] withdrawing server sockets
Apr 18 11:55:44 pve01 corosync[3325]: [SERV ] Service engine unloaded: corosync vote quorum service v1.0
Apr 18 11:55:44 pve01 corosync[3325]: [CFG ] Node 1 was shut down by sysadmin
Apr 18 11:55:44 pve01 corosync[3325]: [QB ] withdrawing server sockets
Apr 18 11:55:44 pve01 corosync[3325]: [SERV ] Service engine unloaded: corosync configuration map access
Apr 18 11:55:44 pve01 corosync[3325]: [QB ] withdrawing server sockets
Apr 18 11:55:44 pve01 corosync[3325]: [SERV ] Service engine unloaded: corosync configuration service
Apr 18 11:55:44 pve01 corosync[3325]: [QB ] withdrawing server sockets
Apr 18 11:55:44 pve01 corosync[3325]: [SERV ] Service engine unloaded: corosync cluster closed process group service v1.01
Apr 18 11:55:44 pve01 corosync[3325]: [QB ] withdrawing server sockets
Apr 18 11:55:44 pve01 corosync[3325]: [SERV ] Service engine unloaded: corosync cluster quorum service v0.1
Apr 18 11:55:44 pve01 corosync[3325]: [SERV ] Service engine unloaded: corosync profile loading service
Apr 18 11:55:44 pve01 corosync[3325]: [SERV ] Service engine unloaded: corosync resource monitoring service
Apr 18 11:55:44 pve01 corosync[3325]: [SERV ] Service engine unloaded: corosync watchdog service
Apr 18 11:55:45 pve01 corosync[3325]: [KNET ] link: Resetting MTU for link 0 because host 1 joined
Apr 18 11:55:45 pve01 corosync[3325]: [MAIN ] Corosync Cluster Engine exiting normally
Apr 18 11:55:45 pve01 systemd[1]: corosync.service: Succeeded.
Apr 18 11:55:45 pve01 systemd[1]: Stopped Corosync Cluster Engine.
Apr 18 11:55:45 pve01 systemd[1]: corosync.service: Consumed 3.484s CPU time.
Apr 18 11:56:09 pve01 systemd[1]: Starting Corosync Cluster Engine...
Apr 18 11:56:09 pve01 corosync[6820]: [MAIN ] Corosync Cluster Engine 3.1.7 starting up
Apr 18 11:56:09 pve01 corosync[6820]: [MAIN ] Corosync built-in features: dbus monitoring watchdog systemd xmlconf vqsim nozzle snmp pie relro bindnow
Apr 18 11:56:09 pve01 corosync[6820]: [TOTEM ] Initializing transport (Kronosnet).
Apr 18 11:56:09 pve01 corosync[6820]: [TOTEM ] totemknet initialized
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] pmtud: MTU manually set to: 0
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] common: crypto_nss.so has been loaded from /usr/lib/x86_64-linux-gnu/kronosnet/crypto_nss.so
Apr 18 11:56:09 pve01 corosync[6820]: [SERV ] Service engine loaded: corosync configuration map access [0]
Apr 18 11:56:09 pve01 corosync[6820]: [QB ] server name: cmap
Apr 18 11:56:09 pve01 corosync[6820]: [SERV ] Service engine loaded: corosync configuration service [1]
Apr 18 11:56:09 pve01 corosync[6820]: [QB ] server name: cfg
Apr 18 11:56:09 pve01 corosync[6820]: [SERV ] Service engine loaded: corosync cluster closed process group service v1.01 [2]
Apr 18 11:56:09 pve01 corosync[6820]: [QB ] server name: cpg
Apr 18 11:56:09 pve01 corosync[6820]: [SERV ] Service engine loaded: corosync profile loading service [4]
Apr 18 11:56:09 pve01 corosync[6820]: [SERV ] Service engine loaded: corosync resource monitoring service [6]
Apr 18 11:56:09 pve01 corosync[6820]: [WD ] Watchdog not enabled by configuration
Apr 18 11:56:09 pve01 corosync[6820]: [WD ] resource load_15min missing a recovery key.
Apr 18 11:56:09 pve01 corosync[6820]: [WD ] resource memory_used missing a recovery key.
Apr 18 11:56:09 pve01 corosync[6820]: [WD ] no resources configured.
Apr 18 11:56:09 pve01 corosync[6820]: [SERV ] Service engine loaded: corosync watchdog service [7]
Apr 18 11:56:09 pve01 corosync[6820]: [QUORUM] Using quorum provider corosync_votequorum
Apr 18 11:56:09 pve01 corosync[6820]: [SERV ] Service engine loaded: corosync vote quorum service v1.0 [5]
Apr 18 11:56:09 pve01 corosync[6820]: [QB ] server name: votequorum
Apr 18 11:56:09 pve01 corosync[6820]: [SERV ] Service engine loaded: corosync cluster quorum service v0.1 [3]
Apr 18 11:56:09 pve01 corosync[6820]: [QB ] server name: quorum
Apr 18 11:56:09 pve01 corosync[6820]: [TOTEM ] Configuring link 0
Apr 18 11:56:09 pve01 corosync[6820]: [TOTEM ] Configured link number 0: local addr: 10.10.100.100, port=5405
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] link: Resetting MTU for link 0 because host 1 joined
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] host: host: 2 (passive) best link: 0 (pri: 0)
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] host: host: 2 has no active links
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] host: host: 2 (passive) best link: 0 (pri: 1)
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] host: host: 2 has no active links
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] host: host: 2 (passive) best link: 0 (pri: 1)
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] host: host: 2 has no active links
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] host: host: 4 (passive) best link: 0 (pri: 1)
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] host: host: 4 has no active links
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] host: host: 4 (passive) best link: 0 (pri: 1)
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] host: host: 4 has no active links
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] host: host: 4 (passive) best link: 0 (pri: 1)
Apr 18 11:56:09 pve01 corosync[6820]: [KNET ] host: host: 4 has no active links
Apr 18 11:56:09 pve01 corosync[6820]: [QUORUM] Sync members[1]: 1
Apr 18 11:56:09 pve01 systemd[1]: Started Corosync Cluster Engine.
Apr 18 11:56:09 pve01 corosync[6820]: [QUORUM] Sync joined[1]: 1
Apr 18 11:56:09 pve01 corosync[6820]: [TOTEM ] A new membership (1.a22) was formed. Members joined: 1
Apr 18 11:56:09 pve01 corosync[6820]: [QUORUM] Members[1]: 1
Apr 18 11:56:09 pve01 corosync[6820]: [MAIN ] Completed service synchronization, ready to provide service.