cluster

  1. K

    Cluster setup and HA configuration suggestions

    Hi all, I’m putting together a Proxmox cluster with Ceph for HA and wanted to get some feedback before I go ahead and deploy everything. What I’m aiming for is fairly simple: I want proper HA with no data loss and automatic failover, but at the same time I’d still like one node (an R640) to...
  2. S

    Trouble getting OPNSense's DHCP to work across a Proxmox cluster

    Hello ! Setup : 3 nodes cluster (important, nodes are on DIFFERENT physical sites, and I'm assuming no private network between them, to be extra cautious), added all nodes in a SDN vxlan zone + created a VNet (subnet 10.6.6.0/24 with gateway = 10.6.6.1) installed OPNSense in a VM on node 1...
  3. F

    Storage types and replication , NFS and local ZFS in a cluster

    Hi Proxmox community, I am currently running a 3 nodes cluster and all my VMs disks are stored on NFS shares provided by another physical server. As that NFS server is a SPOF, I would like to replicate the VM and their disks on my third node : pve1 and pve2 have access to the nfs shares named...
  4. D

    Corosync link flapping with 3 nodes

    Hello, I am experiencing an issue with my cluster and would appreciate your advice. Problem description: I am constantly seeing Corosync-related messages (flapping/instability), even though there are no visible link issues. Linux does not report any port or link failures. Switch monitoring...
  5. J

    PVE: facility for a cluster-wide mutex?

    I have a program that executes on proxmox PVE nodes and has to ensure that one of its critical sections will only run on *one* of all involved cluster node at the same time. It's a low-volume thing (the mutex will have to be acquired rarely), but it must work reliably. Will something to the...
  6. C

    Long-time offline of a node

    I have a node with hardware failure, its resurrection has been deferred repeatedly now (soon a year). HA votes are set to 0, quorum and capacity is OK with the remaining odd number of nodes. The rest of the nodes have been receiving updates as usual. Are there any gotchas for keeping a node...
  7. K

    Replication of a VM between two independent Proxmox clusters (LVM storage)

    Hello, I am facing a design question regarding VM replication between two independent Proxmox clusters. Environment Two separate Proxmox clusters (Cluster A and Cluster B) No shared cluster configuration Both clusters use LVM storage (no ZFS) Inter-site VPN link Proxmox Backup Server...
  8. R

    Storage for small clusters, any good solutions?

    Hi there, the Title may be a bit deceptive as I know there are good solutions that work for many but for me/my workplace we face a bit of a dilemma. I know I'm opening this can of worms again and this is also partly me venting a bit of my frustration and I'm sorry about that. We wanna use...
  9. P

    [SOLVED] VM replication to unclustered node or separate cluster

    Hi everyone, we're currently running a small cluster (4 nodes + qdevice) in a single server room and would now like to physically move two nodes to a separate location for some georedundancy. Those two servers are mostly hot spares, replicating VMs and other data from the primary ones. Because...
  10. D
  11. T

    Rolling Cluster Update Script

    I thought I would share a script that I created and have been using to do rolling updates for Proxmox clusters: https://github.com/thanegill/proxmox-upgrade-cluster It doesn't cover all edge cases but does a decent job of waiting for the correct things to happen in the correct order. Issues and...
  12. J

    storage status unknown in cluster

    Hi all. I just got my new server and I wanted to migrate some of my cts and vms from my old server (node1) to the new server (node2). So what I did, was 0. upgrade node1 from Proxmox 8.1.* (latest) to 9.1 1. to create a cluster on node1 2. join the cluster from node2 3. this is where it got...
  13. S

    [SOLVED] Clustering issues

    (This is a non-production environment) After a recent power cut, I ended up with a split cluster which consisted of three nodes (may switch came up last... After repeated attempts to reboot them all, one at a time, two...I gave up. Grantted, I should have asked for help at this point... One...
  14. E

    Proxmox Cluster

    Hello, I am doing some test about clustering two physical nodes along with a qDevice VM which role is to complete the quorum for a healthy cluster. The storage used for the VMs is an iSCSI share based on a 2022 Windows Server. The storage is accessible for both nodes. The first test I...
  15. T

    [TUTORIAL] How to Configure Fibre Channel SAN Storage with Multipath and High Availability on Proxmox VE 9

    Hi, guys i just finished my post about the "How to Configure Fibre Channel SAN Storage with Multipath and High Availability on Proxmox VE 9" , if you have some advices /tips are welcomed...
  16. J

    Storage for production cluster

    Hello everyone, I am reaching out to you because we are trying to migrate from VMware VSAN to Proxmox. First, let me give you a quick overview of our current situation. We have a cluster of three nodes (vxRail) with 10 HDDs and two SSDs (vSan in cache tiering) and a 10G network. I have done...
  17. M

    Shared Storage across 10GB Fiber for a ProxMox Cluster?

    Hey all. I have a configuration question for my home lab setup. Currently I have a 12-bay Dell PowerEdge r540 with 12 x 3.5" drive bays. I would like to maximize my storage on that server and use it as a shared ZFS pool across the network. The servers will have their own network away from...
  18. yboujraf

    Ceph storage hosted on proxmox nodes shared with K3S cluster ?

    Dear, I am facing to a choice to share the existing ceph storage from proxmox cluster to K3S cluster. Is it best practice to do that or need to SoC and each cluster has his own storage ? If proxmox manage the ceph storage is it a good governance ? Some clarifications are welcome. Best Regards,
  19. A

    vm migration error on node reboot

    I'm getting vm migration error during node reboot due to kernel update on a 3 node hyperconverged cluster with ceph installed. Error is here: Cleanup after stopping VM failed - org.freedesktop.DBus.Error.NoReply: Did not receive a reply. Possible causes include: the remote application did not...
  20. K

    NTP Not synced correctly

    NTP doesn't sync correctly with only one of my computers in my cluster, and I was wondering how I could change the NTP time or maybe sync it correctly? Here is the status of it, its about 5 hours off. Yes, my system is called butt. ● chrony.service - chrony, an NTP client/server Loaded...