I tried to do a live migration of a VM a couple hours ago and it failed (full output attached).
It was a live migration from a Proxmox node named vm6 to a node named vm4. The GUI shows VM 104 in the list for vm4 (Proxmox node), but displays the error "no such VM ('104') (500)" when I click on it. VM 104 no longer shows in the GUI on the original node (vm6), however, it did still show as being on vm6 in /etc/pve/, so I tried a manual move of the file:
The move has been running for 3 hours and has still not finished. qm status shows the same error:
I have a quorum of nodes (6 in the cluster, 5 actively running now), so the quorum is 4 nodes.
I cannot interrupt the manual `mv' I kicked off, even `kill -9' is ignored. The VM I tried to move is not running, and cannot be started, "no such VM".
EDIT: one thing I forgot to mention is that I had a large (~1TB) VM disk image from a different VM being moved during the time I tried to live migrate the above VM.
I guess the question at this point is: how do I recover?
Thanks,
Omen
Code:
ERROR: failed to clear migrate lock: no such VM ('104')
Code:
root@vm4:~# mv /etc/pve/nodes/vm6/qemu-server/104.conf /etc/pve/nodes/vm4/qemu-server/104.conf
Code:
root@vm4:~# qm status 104
no such VM ('104')
I cannot interrupt the manual `mv' I kicked off, even `kill -9' is ignored. The VM I tried to move is not running, and cannot be started, "no such VM".
EDIT: one thing I forgot to mention is that I had a large (~1TB) VM disk image from a different VM being moved during the time I tried to live migrate the above VM.
I guess the question at this point is: how do I recover?
Thanks,
Omen
Attachments
Last edited: