I have an idea that I'd like feedback on from the proxmox community and devs if possible.
I have a 3 node proxmox cluster. Two nodes are identical hardware. The 3rd node is similar to the other two nodes, just less resources (NIC and memory). I have ZFS replication running between the two "worker" nodes. These two nodes are also configured in an HA group. This is working very well when all things are operating normally, even when one nodes is down for whatever reason. I have had a few issues unrelated to proxmox that cause some initial problems with two nodes going offline, taking down all vm's on the cluster (as expected), which have been mostly resolved. I also have two servers external to proxmox, one running proxmox backup server and one running plex.
My idea is, if I added a qdevice to the two external servers, would that allow me to keep one of the main "worker" nodes online of the other two "real" nodes in the cluster go offline for whatever reason? And would this allow HA to function with a single "real worker" node and two qdevices? If this works, is there any downside to this?
Thanks for any response,
Al
I explained this pretty badly I think.
I currently have a 3-node cluster with two equal nodes where any VM's that need HA run in an HA group. The 3rd node is a device with lower resources (smaller memory and less network ports) that is not in the group, and has a few VM's that do not need HA. ZFS replication is running between the two HA nodes.
I'm planning on splitting the 2 HA nodes into separate racks. So far no matter how I arrange nodes, and regardless of the number of nodes, if the rack with more nodes goes offline, then so does the node(s)s in the rack with fewer devices. I'm trying to find a way to keep nodes in one rack online if the other fails, regardless of the node counts in either rack.
Would it make more sense to remove the 3rd node and add 2 qdevices, if adding 2 qdevices is even an option? The 3rd node is a leftover from running a 3-node ceph cluster, i really don't need it to stay if I can find resiliency with 2 nodes.
Again any feedback and/or advice is appreciated.
Al
Last edited: