No login possible after clustering nodes

j.io

Member
Mar 16, 2021
92
5
8
36
Hi all,

after having added two nodes together to a cluster, I cannot login into my node if the second node is offline.

Is this a known issue?

Screenshot 2022-06-22 at 11.52.52.png
 
Last edited:

Matthias.

Proxmox Staff Member
Staff member
Jan 17, 2022
181
45
28
Not sure if I would call it either. It's a consequence of how quorums work. When logging in, we lock a file.
However, without quorum, the cluster file system is not modifiable (as to ensure there's no inconsistent state), so the locking fails.
SSH still works though.

If you remove the node, the cluster should become quorate again, so it should be working again.

There are a few forum threads on this topic, so I would agree that it's could be communicated better to the user.
https://forum.proxmox.com/threads/p...into-web-manager-if-both-nodes-are-on.101613/
https://forum.proxmox.com/threads/i-can-not-able-to-login-gui-after-2nd-node-down.108820/
 
  • Like
Reactions: j.io

j.io

Member
Mar 16, 2021
92
5
8
36
@Matthias. OK, now I made a mistake. I ran pvecm delnode <NAME> with running VMs on the machine. Obviously, I did not shutdown the machine either. Strangely, on another node my backups do not work either now (either I am only backing up to the local drive in the same physical machine). What can I do now to somehow fix this best?

Now this is what I get on any action I do ...

Screenshot 2022-07-04 at 16.27.29.png

EDIT: OK, I could now restore my slave using this post: https://forum.proxmox.com/threads/proxmox-ve-6-removing-cluster-configuration.56259/#post-259203

The master node is still untouched, what do I have to do with it? Both nodes see each other in the GUI as offline. I read I have to remove a folder? But I am better waiting for your input from now on ... luckily all data is backed up, but still annoying. However, it was my fault.
 
Last edited:

datschlatscher

Active Member
Staff member
Mar 14, 2022
126
41
28
What can I do now to somehow fix this best?
It seems what you have already done, removing the second node from your cluster and resetting all corosync configurations there. (And perhaps adding it again after that)

I read I have to remove a folder?
It sounds like there is still a remnant configurations folder for the other node on each node. However, to make sure please first post the output of pvecm status and cat /etc/pve/corosync.conf.

OK, I could now restore my slave using this post:
Thinking in terms of slave and master is not useful when working with a Proxmox cluster. Each node should be able to do any management task.
Every node in a Proxmox cluster gets assigned a certain number of votes (default 1), if the number of available votes are bigger than 50% of the total votes in the cluster, it is considered quorate and the cluster will be able to operate normally. Functionally, there are no "master" or "slave" nodes in Proxmox.
 
  • Like
Reactions: j.io

j.io

Member
Mar 16, 2021
92
5
8
36
Thank you for your answer.

pvecm status

Code:
Cluster information
-------------------
Name:             node1
Config Version:   2
Transport:        knet
Secure auth:      on

Quorum information
------------------
Date:             Mon Jul  4 18:02:49 2022
Quorum provider:  corosync_votequorum
Nodes:            1
Node ID:          0x00000001
Ring ID:          1.6a
Quorate:          Yes

Votequorum information
----------------------
Expected votes:   1
Highest expected: 1
Total votes:      1
Quorum:           1
Flags:            Quorate

Membership information
----------------------
    Nodeid      Votes Name
0x00000001          1 10.0.3.2 (local)

cat /etc/pve/corosync.conf

Code:
logging {
  debug: off
  to_syslog: yes
}

nodelist {
  node {
    name: node1
    nodeid: 1
    quorum_votes: 1
    ring0_addr: 10.0.3.2
  }
  node {
    name: node2
    nodeid: 2
    quorum_votes: 1
    ring0_addr: 10.0.3.3
  }
}

quorum {
  provider: corosync_votequorum
}

totem {
  cluster_name: node1
  config_version: 2
  interface {
    linknumber: 0
  }
  ip_version: ipv4-6
  link_mode: passive
  secauth: on
  version: 2
}
 

datschlatscher

Active Member
Staff member
Mar 14, 2022
126
41
28
Are both outputs from node1? The output of both commands from node2 could also be useful here.

It is a bit weird though, now that I read it again.
1. In the very beginning when you issued pvecm delnode for the first time, the node should have been removed from the corosync config already.
2. As far as I am aware, the GUI uses the corosync config to determine what to display in the GUI. The commands from the post you linked above should have also removed the corosync config on node2, though. It seems like they accidentally synced again (?)

I'd advise you to again try the steps in the post above again and separate the node without reinstalling.
When the nodes no longer see each other and are no longer visible in the config, it should be safe to remove the configurations folder in /etc/pve/nodes/<nodename>. After that you should be able to add the node to the cluster again, if you want to do so.
 
  • Like
Reactions: j.io

j.io

Member
Mar 16, 2021
92
5
8
36
Are both outputs from node1? The output of both commands from node2 could also be useful here.

It is a bit weird though, now that I read it again.
1. In the very beginning when you issued pvecm delnode for the first time, the node should have been removed from the corosync config already.
2. As far as I am aware, the GUI uses the corosync config to determine what to display in the GUI. The commands from the post you linked above should have also removed the corosync config on node2, though. It seems like they accidentally synced again (?)

I'd advise you to again try the steps in the post above again and separate the node without reinstalling.
When the nodes no longer see each other and are no longer visible in the config, it should be safe to remove the configurations folder in /etc/pve/nodes/<nodename>. After that you should be able to add the node to the cluster again, if you want to do so.
That worked perfectly. All solved. Thanks a ton. Great forum!
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get your own in 60 seconds.

Buy now!