Hi,
i have a 2+1 node (2 nodes + 1 qdevice) cluster with linstor drbd as shared storage. One node is running on version 8.4.14, the other one on version 9.0.15 (freshly updated).
When i now live migrate all vms to the newer node (to upgrade the node with version 8) the migration fails for vms with an efi disk.
The error messages i get are:
Job: VM 104 - Migrate
Job: VM 104 - Start
When I remove the efidisk from the vm live migration works flawless. The Problem is that I have clusters with more than 50 vms running. So shutting all the vms down to remove the efidisk is not an option.
i have a 2+1 node (2 nodes + 1 qdevice) cluster with linstor drbd as shared storage. One node is running on version 8.4.14, the other one on version 9.0.15 (freshly updated).
When i now live migrate all vms to the newer node (to upgrade the node with version 8) the migration fails for vms with an efi disk.
The error messages i get are:
Job: VM 104 - Migrate
Code:
task started by HA resource agent
2025-11-17 16:29:25 use dedicated network address for sending migration traffic (10.255.240.59)
2025-11-17 16:29:25 starting migration of VM 104 to node 'regis' (10.255.240.59)
2025-11-17 16:29:26 starting VM 104 on remote node 'regis'
2025-11-17 16:29:27 [regis] Plugin "PVE::Storage::Custom::LINSTORPlugin" is implementing an older storage API, an upgrade is recommended
2025-11-17 16:29:30 [regis] close (rename) atomic file '/etc/pve/nodes/regis/qemu-server/104.conf' failed: File exists
2025-11-17 16:29:30 ERROR: online migrate failure - remote command failed with exit code 255
2025-11-17 16:29:30 aborting phase 2 - cleanup resources
2025-11-17 16:29:30 migrate_cancel
2025-11-17 16:29:32 ERROR: migration finished with problems (duration 00:00:07)
TASK ERROR: migration problems
Job: VM 104 - Start
Code:
efidisk0: enrolling Microsoft UEFI CA 2023
INFO: reading raw edk2 varstore from /var/run/qemu-server/qsd-104-efidisk0-enroll.fuse
INFO: var store range: 0x64 -> 0x40000
INFO: add db cert /usr/lib/python3/dist-packages/virt/firmware/certs/MicrosoftCorporationUEFICA2011.pem
INFO: certificate already present, skipping
INFO: add db cert /usr/lib/python3/dist-packages/virt/firmware/certs/MicrosoftUEFICA2023.pem
INFO: certificate already present, skipping
INFO: writing raw edk2 varstore to /var/run/qemu-server/qsd-104-efidisk0-enroll.fuse
TASK ERROR: close (rename) atomic file '/etc/pve/nodes/regis/qemu-server/104.conf' failed: File exists
When I remove the efidisk from the vm live migration works flawless. The Problem is that I have clusters with more than 50 vms running. So shutting all the vms down to remove the efidisk is not an option.