Hey everyone,
I've been thinking about multi-site architecture for Proxmox and wanted to run this idea by the community before spending time on it.
The Problem
Right now Proxmox doesn't have native multi-datacenter support. If you have clusters in multiple locations:
- You manage each one separately
- No unified view of all sites
- Manual VPN configs between sites
- No automatic failover if a whole site goes down
This makes Proxmox hard to use for:
- MSPs managing 10+ customer sites
- Companies with multiple datacenters
- Anyone wanting to build regional infrastructure
VMware has vCenter multi-site, OpenStack has regions, but both are expensive or complex.
My Idea (High-Level)
Add hierarchical levels on top of Proxmox clusters :
Global (optional)
|--- Region (like "EU-West")
|--- Site (NEW - multiple clusters in one datacenter)
|--- Cluster (Proxmox as-is, no changes)
Key points:
- Existing Proxmox clusters stay unchanged
- New orchestration layer sits on top
- Opt-in (you enable it if you need it)
Network Approach
Support 2 modes so it works for everyone :
Mode 1: Mesh VPN (smaller setups)
- Auto-configure with Netmaker or ZeroTier
- Good for MSPs without dedicated WAN
Mode 2: Enterprise WAN (bigger setups)
- Use existing MPLS/EVPN/VXLAN
- Good for enterprises with network teams
Multi-Mesh Isolation
Separate network meshes for different purposes :
- Production (isolated per customer)
- Dev/Test (separate from prod)
- Management (admin access)
With relay nodes controlling what can talk between meshes (useful for GDPR - keep EU/US data separate).
Disaster Recovery (Optional)
Make DR per-VM instead of all-or-nothing :
- Default: Local storage (1x cost) - most VMs
- Critical VMs : Enable cross-site replication (3x cost) - only what needs it
Saves money vs replicating everything.
Why Not Just Scripts?
Yes, you can do some of this with Ansible/Terraform today. But:
- Everyone reinvents the wheel
- No standardization
- Not integrated in Proxmox GUI
- Breaks when Proxmox updates
- No official support
Native integration would be way better - like how Ceph got integrated instead of everyone scripting storage.
What I'm Asking
Before I invest time developing this :
1. Is this interesting to Proxmox team ? Would you consider it for a future version ?
2. Is the approach sound ? Any major problems you see ?
3. Should it be :
- Core contribution (if you want it native)
- External add-on (if not aligned with your vision)
- Just documentation/architecture (for others to build)
Why This Matters Now
VMware prices went up 300-500% under Broadcom. Companies are looking for alternatives. Proxmox is great but multi-site limitation is a dealbreaker for many.
This could open up the MSP and multi-datacenter enterprise market for Proxmox.
Technical Sketch (If Interested)
Would use:
- Network: Netmaker/ZeroTier or MPLS/EVPN integration
- Storage: Ceph multi-site (already exists)
- Orchestration: etcd + Consul
- Estimated: ~3000 lines Perl + ~1000 lines JS
- API: New /api2/json/site/* endpoints
I have more detailed architecture docs if anyone wants to see them.
What Do You Think ?
Is this something Proxmox would want? Does the community need this?
I'm happy to work on it either way, just want to make sure I'm going in a useful direction.
Thanks for reading !
I've been thinking about multi-site architecture for Proxmox and wanted to run this idea by the community before spending time on it.
The Problem
Right now Proxmox doesn't have native multi-datacenter support. If you have clusters in multiple locations:
- You manage each one separately
- No unified view of all sites
- Manual VPN configs between sites
- No automatic failover if a whole site goes down
This makes Proxmox hard to use for:
- MSPs managing 10+ customer sites
- Companies with multiple datacenters
- Anyone wanting to build regional infrastructure
VMware has vCenter multi-site, OpenStack has regions, but both are expensive or complex.
My Idea (High-Level)
Add hierarchical levels on top of Proxmox clusters :
Global (optional)
|--- Region (like "EU-West")
|--- Site (NEW - multiple clusters in one datacenter)
|--- Cluster (Proxmox as-is, no changes)
Key points:
- Existing Proxmox clusters stay unchanged
- New orchestration layer sits on top
- Opt-in (you enable it if you need it)
Network Approach
Support 2 modes so it works for everyone :
Mode 1: Mesh VPN (smaller setups)
- Auto-configure with Netmaker or ZeroTier
- Good for MSPs without dedicated WAN
Mode 2: Enterprise WAN (bigger setups)
- Use existing MPLS/EVPN/VXLAN
- Good for enterprises with network teams
Multi-Mesh Isolation
Separate network meshes for different purposes :
- Production (isolated per customer)
- Dev/Test (separate from prod)
- Management (admin access)
With relay nodes controlling what can talk between meshes (useful for GDPR - keep EU/US data separate).
Disaster Recovery (Optional)
Make DR per-VM instead of all-or-nothing :
- Default: Local storage (1x cost) - most VMs
- Critical VMs : Enable cross-site replication (3x cost) - only what needs it
Saves money vs replicating everything.
Why Not Just Scripts?
Yes, you can do some of this with Ansible/Terraform today. But:
- Everyone reinvents the wheel
- No standardization
- Not integrated in Proxmox GUI
- Breaks when Proxmox updates
- No official support
Native integration would be way better - like how Ceph got integrated instead of everyone scripting storage.
What I'm Asking
Before I invest time developing this :
1. Is this interesting to Proxmox team ? Would you consider it for a future version ?
2. Is the approach sound ? Any major problems you see ?
3. Should it be :
- Core contribution (if you want it native)
- External add-on (if not aligned with your vision)
- Just documentation/architecture (for others to build)
Why This Matters Now
VMware prices went up 300-500% under Broadcom. Companies are looking for alternatives. Proxmox is great but multi-site limitation is a dealbreaker for many.
This could open up the MSP and multi-datacenter enterprise market for Proxmox.
Technical Sketch (If Interested)
Would use:
- Network: Netmaker/ZeroTier or MPLS/EVPN integration
- Storage: Ceph multi-site (already exists)
- Orchestration: etcd + Consul
- Estimated: ~3000 lines Perl + ~1000 lines JS
- API: New /api2/json/site/* endpoints
I have more detailed architecture docs if anyone wants to see them.
What Do You Think ?
Is this something Proxmox would want? Does the community need this?
I'm happy to work on it either way, just want to make sure I'm going in a useful direction.
Thanks for reading !