We are in the process of migrating from VMware to ProxMox. We have a Primary site and a Secondary site in case the 1st one fail.
To ensure proper design, we are questioning how we should setup the storage of the SANs (iSCSI) for proper failover redundancy. We did a lot of reading, but no definitive answer so far regarding ZFS and CEPH. For reference: https://pve.proxmox.com/pve-docs/pve-admin-guide.html?pubDate=20250308#chapter_storage
Here's a quick rundown of our infrastructure:
Primary site
Promox host 1
Storage SAN1 (connected via ISCSI)
About 60 virtual machines running on Proxmox 1.
Quorum server3 (for votes)
Secondary site (standby)
Proxmox host 2
Storage SAN2 (connected via ISCSI)
The sites are linked via Layer-2 network (10Gbps).
We have a lab in place to simulate the whole new Proxmox concept. HA is configured and working, but with some hiccups to what we want to achieve. Everything is going pretty well, but when we disconnect the Server1 to simulate an outage, the VM failover can't find its boot disk on the secondary server.
So the question I have is: how should we setup the 2 storage SANs to ensure redundancy when 1 fails?
Some suggestions we found so far:
1- Setup a single ZFS pool with the same name on both servers. Should I assume that the data will be safely distributed between the 2 SANs, and if 1 SAN fails, everything on the 2nd one will still be intact? Is it that simple?
2- Use CEPH instead of ZFS. But that seems to add a layer of complexity (or not?).
3- Configure 2 storage pool on each servers, targetting each SAN.
4- Opposite of point 3 > Configure each SAN to provide 1 storage pool per server.
In VMware, we have a replication every # of hours. If the Primary site fails, we simply boot the latest VM replication on the secondary host and we are good to go. Our lack of knowledge in ProxMox seems to block us from doing that so far. We don't mind losing a few hours of data in case of a crash (Primary site is pretty well protected, but we never know). Otherwise, we love the platform!
Thanks in advance for your help.
To ensure proper design, we are questioning how we should setup the storage of the SANs (iSCSI) for proper failover redundancy. We did a lot of reading, but no definitive answer so far regarding ZFS and CEPH. For reference: https://pve.proxmox.com/pve-docs/pve-admin-guide.html?pubDate=20250308#chapter_storage
Here's a quick rundown of our infrastructure:
Primary site
Promox host 1
Storage SAN1 (connected via ISCSI)
About 60 virtual machines running on Proxmox 1.
Quorum server3 (for votes)
Secondary site (standby)
Proxmox host 2
Storage SAN2 (connected via ISCSI)
The sites are linked via Layer-2 network (10Gbps).
We have a lab in place to simulate the whole new Proxmox concept. HA is configured and working, but with some hiccups to what we want to achieve. Everything is going pretty well, but when we disconnect the Server1 to simulate an outage, the VM failover can't find its boot disk on the secondary server.
So the question I have is: how should we setup the 2 storage SANs to ensure redundancy when 1 fails?
Some suggestions we found so far:
1- Setup a single ZFS pool with the same name on both servers. Should I assume that the data will be safely distributed between the 2 SANs, and if 1 SAN fails, everything on the 2nd one will still be intact? Is it that simple?
2- Use CEPH instead of ZFS. But that seems to add a layer of complexity (or not?).
3- Configure 2 storage pool on each servers, targetting each SAN.
4- Opposite of point 3 > Configure each SAN to provide 1 storage pool per server.
In VMware, we have a replication every # of hours. If the Primary site fails, we simply boot the latest VM replication on the secondary host and we are good to go. Our lack of knowledge in ProxMox seems to block us from doing that so far. We don't mind losing a few hours of data in case of a crash (Primary site is pretty well protected, but we never know). Otherwise, we love the platform!

Thanks in advance for your help.