Hello everyone,
I am planning to design a production Proxmox VE cluster and would like to get guidance on appropriate hardware sizing and best practices for the following expected workload:
I would appreciate recommendations on:
The goal is to ensure a highly available, scalable, and production-grade design with room for future growth.
Any reference architectures or real-world examples would be greatly appreciated.
Thank you in advance for your support.
I am planning to design a production Proxmox VE cluster and would like to get guidance on appropriate hardware sizing and best practices for the following expected workload:
- vCPU requirement: ~1500 vCPUs
- Memory requirement: ~3.5 TB RAM (3500 GB)
- Storage requirement: ~180 TB usable capacity
I would appreciate recommendations on:
- Minimum and recommended number of nodes for a stable cluster
- CPU sizing per node (Intel/AMD generation, core count, overcommit guidance)
- Memory distribution strategy across nodes
- Storage design (Ceph vs ZFS vs external SAN) to support ~180 TB usable capacity
- Network requirements (10/25/40/100 GbE considerations)
- High availability and failure domain best practices
- Any known limitations or design pitfalls for this scale in Proxmox VE
The goal is to ensure a highly available, scalable, and production-grade design with room for future growth.
Any reference architectures or real-world examples would be greatly appreciated.
Thank you in advance for your support.