Failed reboot - VM stuck frozen (HA)

Blaz Repas

New Member
Jan 13, 2018
2
0
1
31
Hello!

I have a 7 node cluster and a software/lrm HA. I have just done maintenance on one of the nodes and have rebooted it. Due to some hardware failure it failed to boot. The VMs that were on that node are in a frozen (HA) state and I can not migrate or start them.

Is there a way to convince the HA to consider the rebooted node as failed and to migrate the VMs to running nodes - basically to unfreeze the frozen VMs?

I am running:
Kernel Version
Linux 4.13.13-4-pve #1 SMP PVE 4.13.13-35 (Mon, 8 Jan 2018 10:26:58 +0100)

PVE Manager Version
pve-manager/5.1-42/724a6cb3

Any help is appreciated...
 
Thank you for the reply.
However, the VM is not in an error state, but rather in a frozen state. I have already tried putting it into disabled, started, stopped states but it still remains in the frozen state.
 
However, the VM is not in an error state, but rather in a frozen state. I have already tried putting it into disabled, started, stopped states but it still remains in the frozen state.

Can you try to remove and then re-add it to HA again?
 
Hello , I'm facing the same issue ... I have 2-node HA cluster and Prox-backup server as 3rd node with qnetd service installed. The cluster is quorate (3 votes) and everything seemed to work fine (prox 7.1.5). During playing with HA features cluster somehow got into frozen state ... both nodes idle and CT got in frozen state. CT state could not be changed like above and remove-attempt (like suggested) got stuck in "deleting" state forever. Restarting both nodes did not changed "deleting" state and next try removing the same CT resulted with error-mess "cannot delete service : not HA managed! BUT, the service(CT) has been actually freed/unlocked and fro that point on I could manipulate it (start/stop/migrate/...) ... BUT I cannot put it in HA anymore .. not listed ....
Thank you in advance for you support ...

best regards
Tonci
 
Screenshot_20211120_223533.png
This is the point I got stuck ... CT:101 is available out of the HA but cannot be put back.

If I want to put another VM in HA it's state becmoes "queing" forever ...
 
Last edited:

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!