Hi,
We are currently running an aging VMware vSAN environment on Dell VxRail and are evaluating Proxmox VE as a migration path.
We have already completed a nested PVE test deployment on our existing vSAN environment. This allowed us to validate the VMware-to-PVE migration process using our existing Veeam license, as well as validate use of the PVE API for VM automation. This was important for us because our Dev/QA team currently uses VMware PowerCLI with Jenkins automation.
So far, the testing has gone well and we believe Proxmox VE could be a strong replacement path for our VxRail environment.
Our main dilemma now is selecting appropriate new server hardware for a 3-node or 4-node PVE cluster using Ceph for hyperconverged storage.
We are currently considering Dell, Lenovo, and Supermicro, including models such as the following, but we are certainly open to other suggestions that may be more cost-effective:
The tentative design is:
Specific questions:
Thanks,
Marc
We are currently running an aging VMware vSAN environment on Dell VxRail and are evaluating Proxmox VE as a migration path.
We have already completed a nested PVE test deployment on our existing vSAN environment. This allowed us to validate the VMware-to-PVE migration process using our existing Veeam license, as well as validate use of the PVE API for VM automation. This was important for us because our Dev/QA team currently uses VMware PowerCLI with Jenkins automation.
So far, the testing has gone well and we believe Proxmox VE could be a strong replacement path for our VxRail environment.
Our main dilemma now is selecting appropriate new server hardware for a 3-node or 4-node PVE cluster using Ceph for hyperconverged storage.
We are currently considering Dell, Lenovo, and Supermicro, including models such as the following, but we are certainly open to other suggestions that may be more cost-effective:
- Dell PowerEdge R770
- Lenovo ThinkSystem SR650 V4
- Supermicro SYS-222H-TN
The tentative design is:
- 3 or 4 nodes
- NVMe storage for Ceph OSDs, likely in the range of 5–8 enterprise NVMe drives per node
- Redundant 25G network for Ceph
- Redundant 10G network for VM/public traffic
- Separate redundant network for Corosync/cluster communication
- Intel CPUs preferred since VxRail's on Intel
- Intel NICs preferred unless there is strong reason to choose otherwise
Specific questions:
- Are there particular server models that have proven reliable for PVE/Ceph deployments?
- Are the newer Dell R770, Lenovo SR650 V4, or Supermicro SYS-222H-TN reasonable choices, or would you prefer the previous generation for maturity?
- Are there any specific NICs, HBAs, NVMe backplanes, BIOS settings, or firmware considerations we should watch for?
- Do any Proxmox partners or experienced users maintain informal lists of commonly deployed server models for PVE/Ceph clusters?
- Are we overdesigning this? If so, what specs would you consider more than sufficient for a reliable PVE/Ceph production cluster?
Thanks,
Marc