Node offline

siegfried schwenker

Active Member
Nov 24, 2017
6
0
41
59
Hallo,

in unserem 5-Node Cluster ist eine Node als Offline markiert.
node01offline.png

Die Ausgabe von pvecm status sieht allerdings gut aus

Code:
 # pvecm status
Quorum information
------------------
Date:             Thu Jun 17 15:22:56 2021
Quorum provider:  corosync_votequorum
Nodes:            5
Node ID:          0x00000001
Ring ID:          1/140
Quorate:          Yes

Votequorum information
----------------------
Expected votes:   5
Highest expected: 5
Total votes:      5
Quorum:           3 
Flags:            Quorate

Membership information
----------------------
    Nodeid      Votes Name
0x00000001          1 172.16.250.21 (local)
0x00000002          1 172.16.250.22
0x00000003          1 172.16.250.23
0x00000004          1 172.16.250.24
0x00000005          1 172.16.250.25

Die Dienste scheinen auch alle zu laufen
node01offline02.png

Auf der node01 hat der pvesr Dienst Probleme
# systemctl status pvesr ● pvesr.service - Proxmox VE replication runner Loaded: loaded (/lib/systemd/system/pvesr.service; static; vendor preset: enabled) Active: failed (Result: exit-code) since Thu 2021-06-17 15:25:09 CEST; 3s ago Process: 1820826 ExecStart=/usr/bin/pvesr run --mail 1 (code=exited, status=13) Main PID: 1820826 (code=exited, status=13) CPU: 487ms Jun 17 15:25:04 node01b pvesr[1820826]: trying to acquire cfs lock 'file-replication_cfg' ... Jun 17 15:25:05 node01b pvesr[1820826]: trying to acquire cfs lock 'file-replication_cfg' ... Jun 17 15:25:06 node01b pvesr[1820826]: trying to acquire cfs lock 'file-replication_cfg' ... Jun 17 15:25:07 node01b pvesr[1820826]: trying to acquire cfs lock 'file-replication_cfg' ... Jun 17 15:25:08 node01b pvesr[1820826]: trying to acquire cfs lock 'file-replication_cfg' ... Jun 17 15:25:09 node01b pvesr[1820826]: error with cfs lock 'file-replication_cfg': no quorum! Jun 17 15:25:09 node01b systemd[1]: pvesr.service: Main process exited, code=exited, status=13/n/a Jun 17 15:25:09 node01b systemd[1]: Failed to start Proxmox VE replication runner. Jun 17 15:25:09 node01b systemd[1]: pvesr.service: Unit entered failed state. Jun 17 15:25:09 node01b systemd[1]: pvesr.service: Failed with result 'exit-code'.

Hat jemand eine Idee?
Vielen Dank im vorraus...
 
Hi,

sind alle Knoten auf der neuesten PVE-Version? Ist die Ausgabe von pvecm auf allen Knoten gleich?