I am POCing Proxmox to possibly replace VMware in our enterprise. I built a three-node cluster, and installed multipath tools to present my Pure Storage LUNs to the cluster. I followed the guide here to get it working properly. Everything was working fine, and I was testing VM migrations between the two shared volumes I created, and the three hosts.
I then decided to test scenarios like a host crashing while running a VM. While the VM was running, I simply hit "Reboot" from the Proxmox UI (which I now realize isn't a great test, since it sent a shutdown command to the VM). Either way, once the host came back up, now the shared LVM that the VM was running on has a question mark next to it in the UI. The VM cannot start because of the following error:
I can recover the VM by migrating it to another host, then powering it on there. However, the storage remains down on the original host.
If I SSH into the host, and run
I noticed there is a lock file for the VM in
I tried completely uninstalling multipath-tools and reinstalling, but the same situation exists.
I'm at a loss, and am hoping I'm overlooking something simple, otherwise I'm concluding that Proxmox may not be a good choice for Mutlipath FC SAN.
I then decided to test scenarios like a host crashing while running a VM. While the VM was running, I simply hit "Reboot" from the Proxmox UI (which I now realize isn't a great test, since it sent a shutdown command to the VM). Either way, once the host came back up, now the shared LVM that the VM was running on has a question mark next to it in the UI. The VM cannot start because of the following error:
TASK ERROR: can't activate LV '/dev/shared_s001/vm-100-disk-0'
I can recover the VM by migrating it to another host, then powering it on there. However, the storage remains down on the original host.
If I SSH into the host, and run
kpartx -a /dev/mapper/mpatha, it fails saying failed to stat() /dev/mapper/mpatha.I noticed there is a lock file for the VM in
/var/lock/qemu-server; however, I tried running qm unlock 100 and even eventually deleting the empty file lock-100.conf with no improvement.I tried completely uninstalling multipath-tools and reinstalling, but the same situation exists.
I'm at a loss, and am hoping I'm overlooking something simple, otherwise I'm concluding that Proxmox may not be a good choice for Mutlipath FC SAN.