emergency mode - possible cluster issue?

bill209

Member
May 11, 2019
8
0
21
54
[edit: removed cluster, rechecking.]
[edit2: RESOLVED. it was the problem with the hard drives. removing them from /etc/fstab resulted in proxmox booting up normally]

i have two proxmox servers (both on VE-6), added them together in a cluster for testing, everything worked fine the past month. today, the servers could not see each other, and one of the servers now boots into emergency mode.
running journalctl -xb shows two basic issues:

1. four relatively new drives are not mounting. but i configured these in OMV, so that may be normal on bootup?
2. the cluster (quorum, cmap, cfg) cannot initialize. could this be due to the drives?

fyi, i did set the quorum expectation to ONE, so that i could still get to the servers if one failed. and i can get to the one that is running. PVECM STATUS output below.

to address the mounting issue, i tried removing the four mounts from /etc/pve/storage.cfg, but i am not allowed to edit that file.

i am suspicious of the cluster due to error messages (below), and wonder if removing the cluster would fix things.

i do need the downed server, but i do NOT need my two servers to be in a cluster, as i was going to remove that anyway.

journalctl -xb reports the following errors:

Couldn't get size: 0x8000...e (after loading UEFI: db cert VMware items)
ACPI errors (no biggie i dont think)
...
[note: the following errors happen for each of the four drives i added (a/b/c/d). these were setup/configured with OMV, so maybe this is normal?]
...
mox systemd[1] Timed out waiting for device /dev/disk/by-label/wdpurple4a/b/c/d
A start job for unit ...wdpurple4a/b/c/d ha failed
the job identifier is 63 and the job result is timeout.
mox systemd[1] Dependency failed for local file systems.
...
mox pmxcfs[930]: [quorum] crit: quorum_initialize failed: 2
mox pmxcfs[930]: [quorum] crit: can't initialize service
mox pmxcfs[930]: [quorum] crit: cmap_initialize failed: 2
mox pmxcfs[930]: [quorum] crit: can't initialize service
mox pmxcfs[930]: [quorum] crit: cpg_initialize failed: 2
mox pmxcfs[930]: [quorum] crit: can't initialize service
mox pmxcfs[930]: [quorum] crit: cpg_initialize failed: 2
mox pmxcfs[930]: [quorum] crit: can't initialize service
...
mox systemd[776]: emergency.service: executable /bin/plymouth missing. skipping...
the process /bin/plymouth could not be executed and failed.
...
[note: lots more quorum, cmap and cpg initialization errors]
...
[note: followed by messages about emergency.services]
...
[note: now just looping through lots of quorum, cmap and cpg initialization errors]


here is the result of pvecm status on running server

Cluster information
-------------------
Name: pmox
Config Version: 2
Transport: knet
Secure auth: on

Quorum information
------------------
Date: Fri Jan 10 19:57:32 2020
Quorum provider: corosync_votequorum
Nodes: 1
Node ID: 0x00000001
Ring ID: 1.580
Quorate: Yes

Votequorum information
----------------------
Expected votes: 1
Highest expected: 1
Total votes: 1
Quorum: 1
Flags: Quorate

Membership information
----------------------
Nodeid Votes Name
0x00000001 1 192.168.1.2 (local)

here is the result of pvecm status on down server

cannot initialize cmap service
 
Last edited:
output of (on downed server):

journalctl -u corosync.service

No entries

cat /etc/pve/corosync.conf (appears same as on running server)

logging {
debug: off
to_syslog: yes
}
nodelist {
node {
name: mox
nodeid: 2
quorum_votes: 1
ring0_addr: 192168.1.4
}
node {
name: prox
nodeid: 1
quorum_votes: 1
ring0_addr: 192168.1.2
}
}

quorum {
provider: corosync_votequorum
}

totem {
cluster_name: pmox
config_version: 2
interface {
bindnetaddr: 192.168.1.2
ringnumber: 0
}
ip_version: ipv4
secauth: on
version: 2
}
 
not sure of the relevance, but the hard drive space on my root drive for the running server is 99.38% full (i am trying to get this DOWN)
the hard drive space on the downed server is around 75%.
there is only 100MB allotted to root, out of a 1TB drive. would like to add a bit more. that's my next task. : |
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!