I’ve set up and admin a 5 node Proxmox cluster for my group at work. I’m not an enterprise IT pro, I’m a software developer that happens to be the “IT guy” for my group. We’re about to move labs, and my boss suggested we upgrade the VM farm while we’re at it. That’s perfect timing because I’ve been wanting to take what I learned from setting up that first VM farm and make a better one without being disruptive to our current workflow. This lab move affords me the perfect opportunity to do so.
One of the things I don’t like about our current farm is how I setup the cluster storage to facilitate fast live migration of virtual machines. The way I have it set up is that I have the VMs stored on an NFS server and it is connected to the cluster over a 10GigE network. This configuration made live migration work great. However, I found that the disk I/O performance of having the VMs live on an NFS fileserver left a lot to be desired. Especially for Windows VMs. Sometimes they were borderline unusable.
Then I saw a Craft Computing video on Youtube talking about Proxmox’s replication features. So I installed a single 8TB SSD in each of the Proxmox nodes as a test, set up a new cluster storage using those SSDs, created a Windows VM on that cluster storage, and then setup replication for that VM across all of the nodes of the cluster. I was actually quite pleased with the results. Live migration was still speedy and I guess the disk I/O performance of the VM was as close to bare metal as I can get. The only downside is that replication would sometimes mysteriously fail, and I haven’t quite figured out the reason yet. I am using an older version of Proxmox (version 7.1), so it might be a bug that has since been fixed.
Even though setting up VM replication is a bit more complicated than simply having the VMs live on a central fileserver, the performance increase makes it worth it. So I was dead set on having just local SSD storage for the next VM farm I get to build. But then I learned about 25GigE networking and the hardware for setting that up isn’t really all that much more expensive than 10GigE.
Now I’m wondering … would a 25GigE network give me the performance I want? Would it be similar enough to local SSD performance that it would make the complications introduced by the replication no longer worth it? What other storage options do I have?
One of the things I don’t like about our current farm is how I setup the cluster storage to facilitate fast live migration of virtual machines. The way I have it set up is that I have the VMs stored on an NFS server and it is connected to the cluster over a 10GigE network. This configuration made live migration work great. However, I found that the disk I/O performance of having the VMs live on an NFS fileserver left a lot to be desired. Especially for Windows VMs. Sometimes they were borderline unusable.
Then I saw a Craft Computing video on Youtube talking about Proxmox’s replication features. So I installed a single 8TB SSD in each of the Proxmox nodes as a test, set up a new cluster storage using those SSDs, created a Windows VM on that cluster storage, and then setup replication for that VM across all of the nodes of the cluster. I was actually quite pleased with the results. Live migration was still speedy and I guess the disk I/O performance of the VM was as close to bare metal as I can get. The only downside is that replication would sometimes mysteriously fail, and I haven’t quite figured out the reason yet. I am using an older version of Proxmox (version 7.1), so it might be a bug that has since been fixed.
Even though setting up VM replication is a bit more complicated than simply having the VMs live on a central fileserver, the performance increase makes it worth it. So I was dead set on having just local SSD storage for the next VM farm I get to build. But then I learned about 25GigE networking and the hardware for setting that up isn’t really all that much more expensive than 10GigE.
Now I’m wondering … would a 25GigE network give me the performance I want? Would it be similar enough to local SSD performance that it would make the complications introduced by the replication no longer worth it? What other storage options do I have?
Last edited: