Hi!
for example unfortunately this week we have had a disaster situation, everything was lost, still don't know why, but we have lost a host (OS failure) and all the VMs, yes all of them.
Do you understand that a VM on a cluster node is just a configuration file?
This configuration file allows the node to allocate the necessary resources for the VM.
And disks for all VMs in your case are stored in the SAN-storage.
If you are using HA and have lost 2 of 3 nodes, then such a cluster will not work.
But the question arises, what to do in this case?
You can find configuration files for all VMs from lost nodes on the remaining node
in the
/etc/pve/nodes/<nodename>/qemu-server/ directory.
And you can use them to start VMs from the lost nodes on the remaining node (by moving
config files to a similar directory for remaining node) since the disks for all the lost VMs are available on the SAN.
If the lost nodes are not destroyed and can be returned to the cluster, then you simply return them.
And migrate needful VMs on arrived nodes.
If the lost nodes do not allow them to simply be returned (for example, due the loss the OS),
then you need to first remove the lost nodes from the cluster (Required! And read doc for deleting node)
and then re-create them. Proxmox VE installation on the returned nodes is done in 15...20 minutes (debian install method).
Best regards,
Gosha