We have 4 Proxmox host servers and two different NFS servers for VM storage. We usually we manually distribute the VM storage load between them.
The two NFS storage servers are all flash and have 10 gig connections to our ProxMox hosts.
A few times a year we need to update the firmware on our NFS storage back ends. In order to not have to shut down any of the VMs our process looks like this:
Basically we have to migrate all the VMs from one NFS storage node to the other by moving the Qcow file.
So move everything off NFS1 so all the VMs are running off NFS2.
Update the OS on NFS1.
Then migrate ALL the VM qcows to NFS1.
Then update the software on NFS2.
Then by hand, manually balance the storage load on both servers by moving some of the qcows BACK to NFS2.
Then repeat this process 4-6 months later.
The process itself is fine. I can’t imagine that we are the only end user doing this sort of thing. I just wanted to share some ideas that I think could make this process a little easier on the end users.
Right now I have to click on EVERY VM and see where it’s storage is. Hi-light the disk, then click to move button, deal with the dialogs, then move on to the next VM. It’s a lot of repetitive tasks when you’re dealing with over 50 VMs.
So here’s what I’m thinking. When I click on a storage node, then on “Content” I can see a list of all the VMs on that particular storage node. The best solution would be if I could actually DO something with that information. I’d love to have a button that says“Migrate All Storage to another storage device” Maybe even radio buttons where I could check off the VMs that I want to mass move to another storage node.
This would act like how when looking at a VM host you have a Migrate All VMs” to another host button. This would be an ENORMOUS time saver for end users like myself.
If that’s not possible, another issue we run into is moving a disks the way we do now. So we can only click the “Move Disk” button on a particular host for about 3 VMs at a time. Anything more than that, I get the following error:
“create full clone of drive scsi0 (San0-KVM-Storage:127/vm-127-disk-1.qcow2)
TASK ERROR: storage migration failed: error with cfs lock 'storage-SAN1-KVM-Storage': unable to create image: got lock timeout - aborting command”
So to deal with this, I need to move onto another host and move three of IT’S VMs. Then the next host etc etc. At some point, I can go back to the first host and move 3 more VMs disks. But hopefully I remember, or have written down which VM I left off at.
It would be nice if the system would just put them in a queue. With a 10 gig connection, and all Flash storage, I’m not sure why it can only handle 3 tasks at a time, but that’s fine. It would just be nice if it would put it in a queue so I could set this stuff to move, and then go home for the night. As it sits right now, I have to babysit this process for a few hours.
Anyways, just some thoughts on how to make ProxMox even more awesome!
The two NFS storage servers are all flash and have 10 gig connections to our ProxMox hosts.
A few times a year we need to update the firmware on our NFS storage back ends. In order to not have to shut down any of the VMs our process looks like this:
Basically we have to migrate all the VMs from one NFS storage node to the other by moving the Qcow file.
So move everything off NFS1 so all the VMs are running off NFS2.
Update the OS on NFS1.
Then migrate ALL the VM qcows to NFS1.
Then update the software on NFS2.
Then by hand, manually balance the storage load on both servers by moving some of the qcows BACK to NFS2.
Then repeat this process 4-6 months later.
The process itself is fine. I can’t imagine that we are the only end user doing this sort of thing. I just wanted to share some ideas that I think could make this process a little easier on the end users.
Right now I have to click on EVERY VM and see where it’s storage is. Hi-light the disk, then click to move button, deal with the dialogs, then move on to the next VM. It’s a lot of repetitive tasks when you’re dealing with over 50 VMs.
So here’s what I’m thinking. When I click on a storage node, then on “Content” I can see a list of all the VMs on that particular storage node. The best solution would be if I could actually DO something with that information. I’d love to have a button that says“Migrate All Storage to another storage device” Maybe even radio buttons where I could check off the VMs that I want to mass move to another storage node.
This would act like how when looking at a VM host you have a Migrate All VMs” to another host button. This would be an ENORMOUS time saver for end users like myself.
If that’s not possible, another issue we run into is moving a disks the way we do now. So we can only click the “Move Disk” button on a particular host for about 3 VMs at a time. Anything more than that, I get the following error:
“create full clone of drive scsi0 (San0-KVM-Storage:127/vm-127-disk-1.qcow2)
TASK ERROR: storage migration failed: error with cfs lock 'storage-SAN1-KVM-Storage': unable to create image: got lock timeout - aborting command”
So to deal with this, I need to move onto another host and move three of IT’S VMs. Then the next host etc etc. At some point, I can go back to the first host and move 3 more VMs disks. But hopefully I remember, or have written down which VM I left off at.
It would be nice if the system would just put them in a queue. With a 10 gig connection, and all Flash storage, I’m not sure why it can only handle 3 tasks at a time, but that’s fine. It would just be nice if it would put it in a queue so I could set this stuff to move, and then go home for the night. As it sits right now, I have to babysit this process for a few hours.
Anyways, just some thoughts on how to make ProxMox even more awesome!