Proxmox High Availability (HA) cluster - is it working at all for anyone?

I've not yet seen any responses of anyone successfully using a Proxmox cluster in a HA production environment. Can anyone let me know their experiences? I'm eager to use this in our production environment and would really like to hear from others who have deployed it.

We implemented multiple working HA-Scenarios, and we do it in every of our proxmox advanced trainings. One productive used cluster now works for over an year without problems concerning HA at a telcommunication service provider's data center.
 
We implemented multiple working HA-Scenarios, and we do it in every of our proxmox advanced trainings. One productive used cluster now works for over an year without problems concerning HA at a telcommunication service provider's data center.

Thank you for the feedback. How many nodes in the cluster and how many HA VMs?
 
I discovered the problem with our cluster setup. Actually, there were two problems. First, the DRAC cards had the MTU set to 1500. Problem is that I have the internal network configured to use an MTU of 9000 to improve throughput with the storage SAN (NFS). This explains the hit-or-miss with fencing of the nodes. Second problem, one of the physical nodes was still using an MTU of 1500. This is what caused the cluster instabilities after one of the nodes was shutdown.

I've configured everything for an MTU of 1500 for the moment and have been pleasantly surprised that the node is fenced (turned off) when I yank the network on it. Also, the HA VMs are migrated to one of the remaining active nodes. I haven't figured out what the criteria is for a node to receive a particular HA VM, but it does work. I should mention that the node is restarted on another node, not live migrated. This makes some sense as this would require all nodes to have live copies of the active state (memory) of the HA VM, which I presume isn't (yet) possible with QEMU.
 
"I haven't figured out what the criteria is for a node to receive a particular HA VM" There is no criteria, except that the destination must be running;-)
 
I've been testing Proxmox 3.4 with pve-no-subscription repository. Does anyone know what happens with the installed modules when we switch to the enterprise (subscription) repository? Are these components immediately replaced when an "aptitude full-upgrade" is done the first time?
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!