Good morning,
We are testing PVE as a possible path forward for our current infrastructure, with nodes in different geographic sites. The appeal of PVE for us is the ability to view all the systems from one panel and perform offline migrations.
As I understand it, "pvecm expected" is temporary and resets once all is good.
I also believe I understand the perils of a split brain on the /etc/pve filesystem. That being said, since we are not running shared storage or HA, it seems the risk of differences on the filesystem is minimal between the various nodes.
We had an instance that a site lost connectivity for a few hours. We took the opportunity to bring the node down and perform offline maintenance. Once connectivity was regained, the node had issues reconnecting to the cluster, and refused to automatically bring the vms back online. At that point we rebooted the entire cluster, which resolved the issue. These being test machines, this was an option.
If these had been our production servers, rebooting the entire cluster for one node would not have been an option. We do need the nodes to immediately start the vms on them regardless of quorum.
Which leads back to the original question, is there a way to permanently disable the cluster quorum requirement?
Or am I not understanding something?
As always, I'm open to suggestions.
Thanks!