I have several virtual machines setup in high availability. I use CEPH as shared storage for VMs disks. I have an NFS storage mounted to live backup nightly all the VMs.
Last night the server that hosts NFS rebooted while a backup was performing. The cluster node hosting the VM that was doing the backup rebooted, presumably because it became unstable due to the NFS communitacion failure.
After reboot, the VM in HA did not start. The error shown was:
I had firstly to remove de VM from the HA menu, and secondly to execute the next command from the cluster node hosting this VM.
Any idea why the node rebooted while backing-up combined with the NFS communication failure? How to fix it or workaround it?
Any idea why the VM did not start automatically (it is in HA) and why did I have to manually unlock it? How can I automate this?
Thanks in advance.
Gonzalo.
Last night the server that hosts NFS rebooted while a backup was performing. The cluster node hosting the VM that was doing the backup rebooted, presumably because it became unstable due to the NFS communitacion failure.
After reboot, the VM in HA did not start. The error shown was:
VM is locked (backup)
I had firstly to remove de VM from the HA menu, and secondly to execute the next command from the cluster node hosting this VM.
qm unlock ID
Any idea why the node rebooted while backing-up combined with the NFS communication failure? How to fix it or workaround it?
Any idea why the VM did not start automatically (it is in HA) and why did I have to manually unlock it? How can I automate this?
Thanks in advance.
Gonzalo.