I'm currently planning a new Proxmox installation for an organization with two sites in different countries. The sites are connected with a private backbone and thus for the means of disaster recovery I want to be able to fail over all services to one site and be able to bring them up with the other site remaining offline. If in such a disaster there is data loss of few minutes and the fail over will take some time (even a few days) and manual labor that is no problem. So true HA and thus HA storage is not required and zfs with pve-zsync or storage replication would be sufficient (though maybe we still will opt for ceph but that is irrelevant for this topic).
The question now is, is it better to have A) one Proxmox cluster with an equal amount of servers on each site or B) have two entirely separate Proxmox clusters on each site? And is there best practice/recommended approach?
Personally I'd like one cluster more since it is less maintenance and documentation. However I'm not sure how good a Proxmox cluster would function without quorum in the event of disaster and what pitfalls there might be. With no ture HA there also is no fencing which should make it easier but for example I remember in the earlier implementations of Proxmox 2FA you couldn't log in anymore into an Proxmox server which was part of cluster which lost quorum. This would be very relevant for this installation as well as SSH will be blocked. So is this still an issue and are there other known similar issues?
Thanks for any insights!
The question now is, is it better to have A) one Proxmox cluster with an equal amount of servers on each site or B) have two entirely separate Proxmox clusters on each site? And is there best practice/recommended approach?
Personally I'd like one cluster more since it is less maintenance and documentation. However I'm not sure how good a Proxmox cluster would function without quorum in the event of disaster and what pitfalls there might be. With no ture HA there also is no fencing which should make it easier but for example I remember in the earlier implementations of Proxmox 2FA you couldn't log in anymore into an Proxmox server which was part of cluster which lost quorum. This would be very relevant for this installation as well as SSH will be blocked. So is this still an issue and are there other known similar issues?
Thanks for any insights!