Catastrophic failure of entire server environment, fixed but get HA state error

Red Squirrel

Renowned Member
May 31, 2014
46
9
73
Had a catastrophic failure of my entire server rack, I don't know why but the batteries didn't last during a short outage and I still need to investigate this but long story short the whole rack dropped hard.

I managed to recover the NAS and I can see all the LUNs and the data etc. But in PVE all my VMs show HA state: error and it won't let me do anything. The VMs that don't have HA are able to start fine, so I know all the LUN mappings are working now.

Is there a way to clear this? I suspect the error is because at first things were not mapping correctly, as I had a hell of a time getting NFS back up.

I was able to do it manually one by one on each VM and got them running but surely there is a way to automate this? As it kinda defeats the purpose of HA if you need to manually intervene.
 
Last edited: