I have a cluster running proxmox 8.4.1 using ZFS replication.
I have manually migrated many times, and I have tested a few times failing over by turning off nodes.
After power loss, I noticed my docker VM wasn't up. I looked at proxmox and all VMs failed over fine except for my larger docker VM. Even after bringing pve1 back online, it wasn't failed over. I noticed it was stuck on pve2 trying to migrate, but kept failing with "zfs failure" saying the vm disk did not exist in rpool.
I recovered by shutting down Pve2, and everything came back up on pve1 and then I brought pve2 back online.
Since then, I have tried manually migrating vms without issues, including 102 which was the one that had problems failing over.
I have manually migrated many times, and I have tested a few times failing over by turning off nodes.
After power loss, I noticed my docker VM wasn't up. I looked at proxmox and all VMs failed over fine except for my larger docker VM. Even after bringing pve1 back online, it wasn't failed over. I noticed it was stuck on pve2 trying to migrate, but kept failing with "zfs failure" saying the vm disk did not exist in rpool.
I recovered by shutting down Pve2, and everything came back up on pve1 and then I brought pve2 back online.
Since then, I have tried manually migrating vms without issues, including 102 which was the one that had problems failing over.