high avaibility

  1. N

    [SOLVED] Broken Pipe Lessons Learned (node naming and ssh_known_hosts fumble)

    Just sharing a possible issue some people may run into in the future experiencing a broken pipe when migrating a VM, during a replicate operation, etc. BACKGROUND I recently upgraded the CPU and motherboard for my alternate node. There were some issues during the upgrade and considering I had...
  2. K

    Diagnosing mount: /var/lib/lxc/.pve-staged-mounts/rootfs: can't read superblock on /dev/loop3

    Hello proxmox fourms! I'm recently getting this error (and the same error minus line 1 when starting without using HA) when attempting to start this container. Im a bit stressed, because the backup for this container is a bit old, and id really like to restore this container to functionality. I...
  3. se4n_1

    [SOLVED] Optimum Method to Test Proxmox HA

    Good morning all, I have set up my Proxmox 6.2-12 nodes and enabled high availability for my VMs and containers and now I would like to test the HA functionality. I tried rebooting a node but quickly found this does not work - the VMs remain in a frozen state as the shutdown is graceful. Short...
  4. R

    HA and public ip

    Hi ! I hope you are okay and that COVID19 is not too restrictive for you! I want to create a new HA cluster with proxmox. My goal is to use it in production mode. I am wondering about public ip. For example : If I create a virtualized web server available on the internet with a public ip...
  5. michael.schaefers

    HA migration on node shutdown: Selection of failover nodes?

    I am evaluating an 5 node cluster (nodeA to nodeE) where every VM is bound to run on a specific node by assigning a specific HA group I have for every node. Example: group: prefer-nodeA nodes nodeA:1 nofailback 0 restricted 0 my datacenter.cfg contains ha: shutdown_policy=migrate...
  6. S

    [SOLVED] Remove HA entries while HA services are disabled

    Hi all, I'm in the middle of creating my cluster and adding new nodes. I kept pve-ha-lrm and pve-ha-crm services disabled since I'm removing cables and rebooting nodes etc. I need all nodes not to fence In the middle of deployment one of my team added vms to HA resources. I need to delete...
  7. L

    [SOLVED] New Cluster Reboot Problems: NTP, Adding a new Node

    Hello all, [Background] We recently set up our first Proxmox cluster with four new HP ProLiant DL360 Gen9 servers and a HP-2530-48G (J9775A) switch. The servers' NICs configured to form LACP bonds to the switch, and we have set up CEPH and HA in Proxmox. [Issue 1] We were getting random...
  8. B

    [SOLVED] Prevent node fencing while updating corosync config

    Hi, I am about to manually change the corosync config on my PVE cluster to introduce a 2ng Ring-Interface. I have read up on how to do that and although I am pretty sure, I got the config right, I was wondering if I could somehow prevent my nodes to be fenced, should I have messed up the new...
  9. G

    Migration of LXC fails for shared storage but KVM works fine.

    Hello everyone, I'm running into an issue with container migration that I just wanted to get some clarification on. I currently have a two node cluster that I will be adding a third node to for HA. Both nodes have the following drive config: OS: 120GB RAID 1 VM/LXC Image Storage: 500GB RAID 1...
  10. C

    How to delay the HA procedure with 2 nodes

    Hello, I have 2 proxmox PVE 6 nodes in HA (softdog) at two different sites connected by internet through a VPN. I use a third computer (Synology) to keep the VM (high availability) using the NFS. The problem is that I don't want the HA process to start just for the loss of connectivity for 30...
  11. S

    CEPH and HA switching without restarting VM

    I have CEPH and HA configured and working properly, however in testing when I shutdown a physical node the VM is migrated and rebooted what I want to understand is if this is normal behavior or if there is a way not to reboot the VM when the physical the node fails
  12. D

    [SOLVED] lrm (unable to read lrm status)

    I had a server die this past week and I removed it from the infrastructure. I can confirm that the server is out of commission and is pulled from the rack. I removed the node and all is well, or so I thought. The cluster is giving me an issue with regards to it seeming not being able to find the...
  13. D

    Scaling beyond single server. Suggestion wanted.

    Hello, I have been running 1U Xeon e5 2620 v4 CPU with 4 Sata SSD(Adata SSD 1tb SSD, very slow for ZFS:( ) configured on ZFS mirrored stripe. It run well for me but it doesn't have enough IO for my VM needs. So I recently we purchased Amd Epyic 7351p with 8 NVME ssd (Intel P4510 1TB) to solve...
  14. C

    [SOLVED] Howto setup watchdog?

    Hi, I'm running PVE cluster on 6 nodes. In total 2 different server models are used, but all are from Lenovo. In the server configuration I can define 3 types of server timeouts: OS Watchdog Loader Watchdog Enable Power Off Delay I read here that by default all hardware watchdog modules are...
  15. A

    Firewall cluster Active/Active

    I'm not sure if there is the correct site to explain this case. I want to make a firewall cluster with load balance and High Availability. I thought to use Proxmox for the cluster and make 2 nodes for the load balance and 2 more for the HA. The problem is, how can I make the load balance? I...
  16. S

    HA with Ceph and separate Ceph cluster

    Hello, I'm curious to get feedback if this setup makes since or I'm looking at things from a bad angle. I currently have a Ceph cluster setup with 4 servers 11x4TB HDD's 10g for pub/priv. I'm thinking of setting up 3* proxmox servers with 1HDD for OS and 2 SSD to act as OSD's for a proxmox...
  17. H

    Best Proxmox Scalable High Availability Setup and Configuration

    I am trying to setup the best minimum scalable HA setup and Configuration. I read the documentation on High Availability, however I have some questions that are not well discussed in the documentation. I was thinking of starting with 4 servers: 2 x front end server that will use a central...
  18. Y

    High availability VM cluster with ceph

    Hi all, I have a 3 nodes cluster with a ceph storage (3mon, 3 OSD and a CephFS installed). I enabled the High Availability feature for my virtual machine and when I shutdown one of the nodes, my VM migrates to another and restarts. My VM is a Windows 2012 R2 server. My question is : is there a...
  19. K

    Bonding LAG with multiple switches

    Hi I have a question about bonding and HA. I want to create a HA PVE cluster, but I am confused with the bonding and its mode. See this simplified picture. I have two switches (Mikrotik CRS317, not stackable) and multiple PVE nodes (just one signed). What should I configure to create a HA...
  20. K

    Cannot query HA resource details with pvesh

    Proxmox version: 5.2-8/fdf39912 I'm trying to find out, which node a vm is currently routed to, so I thought this command might be useful: pvesh get /cluster/ha/resources/vm:101 However, it only sends back the following error message: Use of uninitialized value in string eq at...