Hi
@slalaure, welcome to the forum!
- With a properly configured 3-node Proxmox VE (PVE) cluster, you will achieve PVE HA (High Availability) protection for your virtual machines (VMs). In this context, a "node" refers to a member of the cluster, not necessarily a hypervisor node. Your Quorum VM should fulfill the requirement for cluster quorum.
However, it’s crucial to thoroughly test and fine-tune your setup to ensure orderly recovery during failover scenarios. For example, consider a situation where a client VM attempts to start before the iSCSI layer becomes fully available. Such scenarios can lead to unexpected behavior.
I recommend extensive testing to evaluate edge cases. For instance, if a software-managed active VM becomes unresponsive (hung), PVE HA will not detect the issue automatically. In such cases, the software layer would need to initiate a failover.
In summary, while PVE HA provides robust failover mechanisms, there are several possible scenarios to consider for achieving truly bulletproof high availability.
- You mentioned using iSCSI to serve your client VMs, so it seems unclear where NFS VM snapshots fit into the workflow. Perhaps I missed this in your description?
To my knowledge, PVE does not natively support storing snapshots in a different storage pool/type from the primary data storage. If you were referring to Fleecing volumes, you’ll want to ensure that the fleecing storage performs as well as, if not better than, your primary storage. Generally, NFS is not ideal as a fleecing target for primary iSCSI storage, but this ultimately depends on your specific performance requirements for the software layer.
I strongly encourage testing your desired configuration under conditions as close to production loads as possible to validate its reliability and performance.
Good luck!
Blockbridge : Ultra low latency all-NVME shared storage for Proxmox - https://www.blockbridge.com/proxmox