corosync

  1. Adding node to cluster crash all nodes from it

    Hi, tonight is the second time I face a huge problem when trying to add a node to my existing cluster. My cluster contains approximately 15 nodes, I use CEPH as storage and everything is working pretty good. All our nodes and "future" nodes FQDN are contained in /etc/hosts like : -...
  2. Qdevice is not voting

    Hello guys, I have a problem with my Qdevice. If I type "pvemc status", my first node give the following result: root@pve1:~# pvecm status Cluster information ------------------- Name: server Config Version: 7 Transport: knet Secure auth: on Quorum information...
  3. Whole cluster randomly rebooted twice (maybe corosync?)

    Dear all, I am currently dealing with a problem in a hyperconverged (ceph) where the whole cluster reboots seemingly at random. Every single one (of the total of seven) node resets at the same time. I am suspecting corosync to not be able to communicate properly. This problem has only popped up...
  4. 1x NIC to 2x NICs (Keep VMs/WAN on 1st & Move PVE/Corosync/etc to 2nd)

    Hi there, I'm looking to update the networking setup for Proxmox VE now that the switches have been swapped out for more capable models. At the moment, each host has 1x NIC connected, which is serving VMs (WAN) and PVE (Web GUI, SSH, Corosync, etc). Static addressing for PVE is set against...
  5. Multiple cluster in corosync

    Hello all, May be someone can help, how to add second PVE cluster (as observer) in already exist PVE Cluster. How i can add here second separate CLUSTER-2? # cat /etc/pve/corosync.conf nodelist { node { name: n1 nodeid: 1 quorum_votes: 1 ring0_addr: n1 mcastaddr...
  6. corosync constant retransmit

    hello, i have up to 16 nodes in a proxmox cluster and corosync is constantly showing retransmit in logs: Jan 05 13:38:15 node2 corosync[18227]: [TOTEM ] Retransmit List: 66a9 Jan 05 13:38:23 node2 corosync[18227]: [TOTEM ] Retransmit List: 6708 Jan 05 13:38:30 node2 corosync[18227]...
  7. [SOLVED] corosync crash when adding a 15th node

    Before I had a cluster of 13 nodes. I added 3 other nodes and within 5 minutes I lost the whole cluster. After restarting corosync 1 by 1 but when I start a 15th node I have this message: corosync[29232]: [TOTEM ] Token has not been received in 380 ms then after a few minutes the cluster...
  8. Corosync - Mysterious reboot after network flapping

    Hello, I have four servers in a cluster. The last night, we faced to a big network flapping on 'srva' (private network and public network) with an impact to the private network '10.50.255.0/24'. The expected behavior was to get the three nodes (srvb, srvc, srvd) working together and the node...
  9. se4n_1

    [SOLVED] corosync-qdevice.service fails to start with 'received server error 18. Disconnecting from server'

    I have a problem getting a QDevice to work on proxmox 6.2-12 First I install the QDevice package on the 3rd witness (Raspberry Pi OS 20-08-2020) box: # apt install corosync-qnetd Reading package lists... Done Building dependency tree Reading state information... Done The following NEW...
  10. Made mistake in corosync.conf; now cannot edit

    I have (had) a 3 node Proxmox VE 6.2-11 and Ceph cluster. I'm modifying my config after install and some light use. Ceph is now on its own 10Gx2 LAN. I decided to dedicate a 1Gb interface and create a VLAN for corosync and attempted to modify corosync.conf before understanding exactly what...
  11. HA Design

    I currently have a 4 node HCI cluster that's working quite well. It will be expanding to 8 nodes total and be used for critical services. All of the testing was satisfactory and management was duly impressed. I am reinstalling the cluster from scratch in order to ensure none of the testing bits...
  12. Corosync Cluster Engine is dead. Is this normal?

    Hello. I recently installed Proxmox in 1 one physical server (node). I was browsing around the node's settings when I noticed that under System, it is saying that the status of the Corosync Cluster Engine is dead. I did some Googling and learned that the Corosync Cluster Engine is how physical...
  13. IP Range Correction

    Hey Guys, I'm sitting with an issue, we replaced some of our servers in our cluster... And somewhere we made a mistake..Can someone please help me. We would like the public IP range to be the 129.232.156.xx range, and the ceph data sync ip range 10.161.0.xx The reason obviously being that...
  14. 2 Node Cluster- Corosync Netzwerktrennung bedenken

    Ich möchte nun mein Corosync-Netzwerk auf eine andere Netzwerkschnittstelle legen mit folgendem Wikiartikel. Zurzeit befinden sich nur 2 Nodes im Cluster. Meine bedenken sind nun das wenn ich die Datei gemäß der Anleitung ändere zur neuen Netzwerkkarte, as ich dies nur auf dem einem Node über...
  15. Cluster broken after nodes update/upgrade

    Hi to All, I'm writing here since i can't find enough information about the issue I'm facing. I have decided to test 4 nodes cluster with proxmox . Everything was running just fine for the last 4 weeks.Ceph running good.Vm's running well no issues. Yesterday I have decided to update /upgrade...
  16. Corosync through vpn

    Hello everyone, I have two proxmox ve 6.2 servers, one in France and one in Germany. I have setup a layer 2 vpn between the two sites (germany site beeing the server by itself [dedicated server hosting solution]). The two servers can contact each other through the vpn lan on all ports. I...
  17. [SOLVED] Prevent node fencing while updating corosync config

    Hi, I am about to manually change the corosync config on my PVE cluster to introduce a 2ng Ring-Interface. I have read up on how to do that and although I am pretty sure, I got the config right, I was wondering if I could somehow prevent my nodes to be fenced, should I have messed up the new...
  18. Why did PVE reboot all nodes im my cluster, when only 2 needed to be fenced?

    A couple of daya ago, we experienced an issue with a switch, which carried the corosync traffic for two of the 6 PVE hosts in our cluster. I can understand that PVE fenced those two hosts, but why did the other 4 ones rebooted as well? How can I fin out, what caused all my nodes to reboot...
  19. Faulty cluster, randomly reset all nodes, can't add a new node

    Last night we tried to add a new node to the cluster, it stuck on joining showing the below messages: can't create shared ssh key database '/etc/pve/priv/authorized_keys' (re)generate node files generate new node certificate unable to create directory '/etc/pve/priv' - Permission denied We...
  20. Proxmox cluster new node error after restart

    Hi all ,joined a new node(4th) to my existing proxmox cluster(3- node cluster) and it joined after 15 min showing waiting for quorum, after that it joined but after restart it is not joining the cluster. after 30 in or so it is joining again . any suggestion on what is wrong or what the...

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE and Proxmox Mail Gateway. We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get your own in 60 seconds.

Buy now!