corosync

  1. S

    Half of the hosts in the cluster automatically restart due to abnormality

    I especially want to know what protection mechanism the PVE cluster has to allow the host to automatically restart. Environment: There are 13 hosts in the cluster: node1-13 Version: pve-manager/6.4-4/337d6701 (running kernel: 5.4.106-1-pve) Web environment: There are two switches A and B...
  2. D

    corosync file always go back to default

    Hi, i am using Proxmox with ZFS replication and HA, i have set the /etc/pve/corsync.conf: quorum_votes: 1 two_nodes: yes but everytime the pve1 crashes the pve2 stays on "waiting for quorum", when pve1 is online again the option is back to default, how to solve this?
  3. J

    Expected Votes CS_ERR_INVALID_PARAM

    Hallo zusammen, ich habe ein Cluster konfiguriert, wo nicht alle Nodes die gleiche Anzahl an Votes haben soll. Grund dafür ist deren Standort. Soweit so gut. Allerdings werden nach aktueller Konfiguration 17 Votes benötigt um Quorum zu erreichen. Aufgrund der Konstellation reichen allerdings...
  4. E

    Of Shared Storage, SBDs, and Clustered VMs

    Hi all, I would like to cluster some VMs within my Proxmox Homer Lab, using Pacemaker and Corosync. i world then like to create a couple of Shared Storage Devices between the nodes, all sharing the same respective "physical" storage devices underneath. The idea is to create two, one being a...
  5. helojunkie

    New Servers w/100G Trunks, Should I still use a separate Corosync network?

    So, as the title says, I am deploying all new Proxmox servers to replace our aging fleet of 2U Dells. Currently, I have a 10G trunk for all of my normal VLANs and a separate 10G connection specific to only Corosync VLAN traffic. My new servers have 4 x 10G NICs and 2 x 100G NICs each. I was...
  6. C

    Corosync won't start

    I'm having issues getting corosync to start up, causing my node to be unable to connect to the other. Diagnostics so far I've tested the basics like pinging one node from the other and it works fine.\ Results from journalctl -xeu pve-cluster.service Jan 15 18:15:49 pve847 pmxcfs[122430]...
  7. E

    QDevice w/odd # nodes - docs discourage from perfectly safe setup?

    There's a great deal of misleading piece of argument to be found in currently official PVE docs on QDevices [1] and then some conscious effort to take it even further. Under "Supported setups" [1] the following is advised (emphasis mine): It continues to provide an absurd piece of reasoning...
  8. C

    Can you add a direct network link to existing cluster?

    I have a small cluster of just 2 devices, I will be adding a 3rd in the not too distant future. One of the nodes has 10gb uplink to external, the other only has 1gb. This means that synching between the two is limited to 1GB of course. I've added a 10gb network card to the node with 1GB...
  9. E

    HA & last_man_standing + wait_for_all

    It's nowhere in the PVE official docs, but corosync does support last_man_standing and when used with HA it is suggested to also set wait_for_all. I found some previous threads, but not in relation to HA. Now I understand the official PVE endorsed way would be to just use a qdevice, but this...
  10. M

    [SOLVED] Remove node from cluster/datacenter after physically removing the device

    Hi folks! I have a small homelab and I had 4 PVE nodes: pve0, pve1, pve2 and pve3 in 1 datacenter/cluster. I was looking to turn pve3 into PBS, so I simply disconnected the hardware and installed PBS there. When I login to GUI on pve0/pve1/pve2 I still can see pve3 and I'd like to get rid of...
  11. P

    How to re-join same node to the cluster (Proxmox 8)

    Hello, I faced with a problem, where I had to rejoin same node to the cluster, but the following errors appeared: Please enter superuser (root) password for 'IP ADDR': ************** detected the following error(s): * authentication key '/etc/corosync/authkey' already exists * cluster config...
  12. S

    Hyper-converged Cluster Networking

    What is the best practice for the corosync network? I can't imagine it having much bandwidth requirements, and I'd hate to use a seperate nic just for this. seems to just need low latency. Since I have ceph public and cluster networks separated on separate 10G Nics, Can I put corosync on the...
  13. S

    Need Help Removing Node from Proxmox Cluster

    Hello Proxmox Community, I hope this post finds you well. I'm currently facing an issue with my Proxmox setup, and I'm seeking advice on how to gracefully remove a node from a Proxmox cluster. Issue Summary: I accidentally created a Proxmox cluster with only one node, and now I'd like to...
  14. T

    Corosync using 30% of memory from all nodes

    Hi guys, In the company where I'm working, we have a Cluster composed of 8 nodes that are connected through a 1Gb ethernet network directly to a Switch. On the nodes we mostly run Kubernetes, DNS Servers and some other services. The problem is that on each node, the process corosync is using...
  15. K

    Join to existing cluster after node rename

    I tried to rename two of my nodes in my lab, and as I expected bricked the cluster configuration. Then I removed the cluster configs and created a new one, unfortunately I can't join to it..... what could be the reason for that? root@sofx1010pve3302.home.lan:~# pvecm add 192.168.30.7 -use_ssh...
  16. L

    [SOLVED] HA Cluster: One Node goes down, all other Nodes goes in reboot

    Hello, we have a 3 node PVE cluster (7.4-16), with separate cluster interfaces. The cluster interfaces are in a bond (LACP). The cluster interfaces are connected to two Cisco Nexus switches. Over the weekend our complete HA cluster failed. According to the logs of the other servers, the...
  17. D

    [SOLVED] How to force a node to catch-up? config_version is behind by 1

    Hi there, I have added a new node (pve-nuc6-5) while one was offline (pve-nuc6-4). As a result, pve-nuc6-4 remains offline despite being accessible. I noticed that pve-nuc6-4 was one version (Corosync.conf) behind the rest of the cluster. How do I force it to sync to the last version...
  18. K

    [SOLVED] Moving cluster operation traffic to another network

    I made a mistake when I created a 5-node cluster. I added all nodes by IP to the cluster and `pvecm status` now shows: Cluster information ------------------- Name: cluster2 Config Version: 5 Transport: knet Secure auth: on Quorum information ------------------ Date...
  19. M

    Cluster management/Corosync network - does it need to be faster than 1GB?

    Hello, I'm running PVE8.0 at home. Multiple users working from home with Windows VMs. Many small Linux VMs doing lots of low-bandwidth stuff. I'm in the process of setting up a 3-node cluster for HA. All 3 nodes have identical hardware. All 3 nodes are on a 10GB fiber network. All 3...
  20. X

    [SOLVED] PVE nodes were never quorum

    After temporary network issue, the nodes doesn't comeback quorum state. I only use 2 node in this cluster. corosync.conf of each node is same. Ping is OK. MTU is OK too. After "pvecm expected 1" exected, HA page "status" show "old timestamp - dead?, ..." in master, 2 lrms. What is the log file...

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!