high availability

  1. B

    Node reboot when a disk is removed

    Hi everyone, I have setup a 3 nodes Proxmox 7.0-11 cluster, each node has 3 HDD drives, the first drive is used for Proxmox and the others are OSDs. I wanted to test HA by removing the last disk of the first node (which doesn't contain Proxmox), but when I remove it the node reboot. Is this...
  2. K

    All resources in HA

    Hi We have been doing some testing on HA with PVE and Ceph, which has given very possitive results in terms of our goals with exploring the setup. From our testing, we have found two issues that we had not really expected 1. When a host has failed and comes back online, VMs are not migrated...
  3. K

    [SOLVED] HA with ZFS

    Hi We are looking into hardening our infrastructure to better handle outages due to network or hardware failtures. We use OVH dedicated physical servers for our infrastructure and currently have 3 node PVE cluster. This cluster is currently configured with 2x4TB nVME drives per node, with one of...
  4. A

    New to Ceph; looking for suggestions for a 3-node cluster

    Hey all; I've made use of Proxmox for a while and I'm looking to take my first few steps into ceph -- I have 3x QCT QuantaGrid SD1Q-1ULH that I will be making use of. Each of these nodes have a Xeon D-1541 with an OCP Mezz 10Gb SFP+ (QCT Intel® 82599ES dual-port 10G SFP+ OCP mezzanine) and...
  5. R

    HA NFS service for KVM VMs on a Proxmox Cluster with Ceph

    Hi, we are migrating from a VMware ESXi setup with a NetApp NFS based shared storage. We also did use NFS filesystems for mounts like /home or /root and application filesystems like a shared /var/www within our virtual machines and host-specific filesystems like /var/log. Most of our...
  6. A

    Best available HA solution with 3 servers

    Hello, I have some doubts on how High Availability can work in ProxMox. I'd like to create an HA enviroment to host my personal websites and also to lear more about HA. I imagine I have 3 identical server and all of them are part of the same cluster. I read about HA with local storage, shared...
  7. aasami

    [SOLVED] What's the point of a Separated cluster network?

    Hello. I've set up a cluster with physically separated cluster network but it is not doing what I expect it to do. Here is my lab: [node1 ~]# pvecm status Cluster information ------------------- Name: clustest Config Version: 3 Transport: knet Secure auth: on Quorum...
  8. 1

    [SOLVED] iSCSI multipath

    Hi, We are trying to deploy an HA storage environment on Proxmox using multipath, a Synology NAS (RS1619xs+) and an iSCSI SAN, and we cannot get the multipath driver to manage the iSCSI LUNs. We need a little help to go further or to interpret the errors we are commiting. This is what we are...
  9. H

    dockerd: sync duration of 1.x, expected less than 1s

    I've had this problem for a while here now, and I've turned the internet upside down trying to figure it out, but there is very, very little information out there about this specific warning. It is extremely consistent and reproducible in my PVE setup though, so I figured I'd ask here! May 28...
  10. L

    [SOLVED] Correct way to restart a node without stopping containers or vm's

    Is there a correct way to restart a node (mostly for applying kernel updates)? I have about 20 virtual machines and containers that should not go down ever unless planned. So I have set the all as HA. When I shutdown a node, the VM's / LXC's shut down instead of migrating to the node next in...
  11. P

    HA for replicated VM

    Hello I have read documentation and topics on forum about this subject but cannot find a clear answer (and the thread How does Replication work in a HA environment? is quite old , 2017) If I have a replicated VM betwen the 2 nodes I'll put this VM in HA configuration, Will HA process use the...
  12. D

    [SOLVED] lrm (unable to read lrm status)

    I had a server die this past week and I removed it from the infrastructure. I can confirm that the server is out of commission and is pulled from the rack. I removed the node and all is well, or so I thought. The cluster is giving me an issue with regards to it seeming not being able to find the...
  13. givan

    VM migration sometimes fails on host reboot/shutdown

    Hello, We have a 8 servers Proxmox 6.1-5 cluster (last updated 2 days ago) that is configured for HA with "shutdown_policy=migrate". However we noticed that this sometimes works and sometimes doesn't, having a VM with HA enabled running on a host that is rebooted or powered off can result in...
  14. L

    [SOLVED] pfSense in a VM with HA

    I'm trying to figure out how I could install a pfSense VM and make it HA. I have a ceph cluster of 4 nodes (but there will be nodes added regularly), so I can give each node a public IP address on one of it's ethernet ports. Creating a bridge and adding that to the pfSense VM will give me a...
  15. J

    Live Migration PVE 6.0

    Hello Proxmox Community, I am interested in live migration. Recently I read posted accorss the fron the XCP-ng's website the phrase "Live migrate your VMs all around the world, without interruption" However I did not find any detailed sources which stipulate that live migration was actually...
  16. P

    [SOLVED] Migration With HA Enabled Dont Respect Destination Node

    I have an LXContainer that I want to migrate to node3. If I have HA enabled for that Container, it will always migrate to node1, regardless of the selected destination. The container is on shared ceph storage.
  17. ?

    Always On VMs / Containers

    I'm new to Proxmox and I'm a little lost. I have the system setup, Client side resources (VMs / Containers) fail back n fourth and i can manually migrate. The problem I have is when a node is down, a resources sessions are terminated and that resource is shutdown, migrated to another node and...
  18. K

    multiple NFS storage setup for HA cluster & simulate a failed node

    hi all! I have 3 nodes in HA cluster using Proxmox 5.4 . The nodes use the public IP for cluster communication and the private network for the storage network. Each node has an NFS storage shared with the other nodes. The disk in all 3 nodes are in RAID 1 (mirroring) (the reason i cant use ceph...
  19. A

    Cannot delete HA resources since PVE 5 to 6 update

    Hello, I come accross a difficulty with the HA management since we have updated our 3-nodes PVE cluster from 5th to 6th version. We did of course follow the Upgrade from 5.x to 6.0 wiki page, so deactivated both pve-ha-lrm and pve-ha-crm services on all nodes before the update. The pve5to6...
  20. K

    [SOLVED] cluster network and storage network

    hi everyone! We are migrating our server to a different cloud provider. While reading a documentation, i read this: "Storage communication should never be on the same network as corosync!". Our server must have HA and data redundancy/ data high availability (using ceph). The problem is, our...