We use the latest and greatest proxmox version 8.x in a 3 node cluster. All VMs are using zfs volumes for their disks and the volumes are replicated to every node of the cluster.
While migrating a vm from node 2 to node 3 of the cluster the following shows up in the syslog of node 2:
2024-11-05T11:32:53.132057+01:00 devmox2 pve-ha-crm[1622]: got crm command: migrate vm:119 devmox3
2024-11-05T11:32:53.132248+01:00 devmox2 pve-ha-crm[1622]: migrate service 'vm:119' to node 'devmox3'
2024-11-05T11:32:53.132402+01:00 devmox2 pve-ha-crm[1622]: service 'vm:119': state changed from 'started' to 'migrate' (node = devmox2, target = devmox3)
2024-11-05T11:33:00.785824+01:00 devmox2 pve-ha-lrm[74157]: <root@pam> starting task UPID:devmox2:000121AF:00074429:6729F45C:qmigrate:119:root@pam:
2024-11-05T11:33:04.982222+01:00 devmox2 pmxcfs[1483]: [status] notice: received log
2024-11-05T11:33:05.650625+01:00 devmox2 pmxcfs[1483]: [status] notice: received log
2024-11-05T11:33:05.791871+01:00 devmox2 pve-ha-lrm[74157]: Task 'UPID:devmox2:000121AF:00074429:6729F45C:qmigrate:119:root@pam:' still active, waiting
2024-11-05T11:33:08.648264+01:00 devmox2 QEMU[10140]: kvm: Bitmap 'repl_efidisk0' is currently in use by another operation and cannot be used
2024-11-05T11:33:08.648430+01:00 devmox2 QEMU[10140]: kvm: Bitmap 'repl_scsi0' is currently in use by another operation and cannot be used
2024-11-05T11:33:10.793501+01:00 devmox2 pve-ha-lrm[74157]: Task 'UPID:devmox2:000121AF:00074429:6729F45C:qmigrate:119:root@pam:' still active, waiting
2024-11-05T11:33:11.882165+01:00 devmox2 kernel: [ 4772.598367] zd512: p1 p14 p15
2024-11-05T11:33:15.795538+01:00 devmox2 pve-ha-lrm[74157]: Task 'UPID:devmox2:000121AF:00074429:6729F45C:qmigrate:119:root@pam:' still active, waiting
2024-11-05T11:33:16.872309+01:00 devmox2 kernel: [ 4777.588533] tap119i0: left allmulticast mode
2024-11-05T11:33:16.872320+01:00 devmox2 kernel: [ 4777.588553] vmbr0: port 4(tap119i0) entered disabled state
2024-11-05T11:33:16.901160+01:00 devmox2 qmeventd[1106]: read: Connection reset by peer
2024-11-05T11:33:16.942072+01:00 devmox2 systemd[1]: 119.scope: Deactivated successfully.
2024-11-05T11:33:16.942218+01:00 devmox2 systemd[1]: 119.scope: Consumed 1min 23.161s CPU time.
2024-11-05T11:33:17.628059+01:00 devmox2 pve-ha-lrm[74159]: migration problems
2024-11-05T11:33:17.637204+01:00 devmox2 pve-ha-lrm[74157]: <root@pam> end task UPID:devmox2:000121AF:00074429:6729F45C:qmigrate:119:root@pam: migration problems
2024-11-05T11:33:23.159431+01:00 devmox2 pve-ha-crm[1622]: service 'vm:119': state changed from 'migrate' to 'started' (node = devmox3)
2024-11-05T11:33:26.972986+01:00 devmox2 pmxcfs[1483]: [status] notice: received log
2024-11-05T11:33:27.610960+01:00 devmox2 pmxcfs[1483]: [status] notice: received log
The vm is migrated to node 3 but during the migration it is stoped and restarted on node 3.
Thanks for any hint for solving the issue.
While migrating a vm from node 2 to node 3 of the cluster the following shows up in the syslog of node 2:
2024-11-05T11:32:53.132057+01:00 devmox2 pve-ha-crm[1622]: got crm command: migrate vm:119 devmox3
2024-11-05T11:32:53.132248+01:00 devmox2 pve-ha-crm[1622]: migrate service 'vm:119' to node 'devmox3'
2024-11-05T11:32:53.132402+01:00 devmox2 pve-ha-crm[1622]: service 'vm:119': state changed from 'started' to 'migrate' (node = devmox2, target = devmox3)
2024-11-05T11:33:00.785824+01:00 devmox2 pve-ha-lrm[74157]: <root@pam> starting task UPID:devmox2:000121AF:00074429:6729F45C:qmigrate:119:root@pam:
2024-11-05T11:33:04.982222+01:00 devmox2 pmxcfs[1483]: [status] notice: received log
2024-11-05T11:33:05.650625+01:00 devmox2 pmxcfs[1483]: [status] notice: received log
2024-11-05T11:33:05.791871+01:00 devmox2 pve-ha-lrm[74157]: Task 'UPID:devmox2:000121AF:00074429:6729F45C:qmigrate:119:root@pam:' still active, waiting
2024-11-05T11:33:08.648264+01:00 devmox2 QEMU[10140]: kvm: Bitmap 'repl_efidisk0' is currently in use by another operation and cannot be used
2024-11-05T11:33:08.648430+01:00 devmox2 QEMU[10140]: kvm: Bitmap 'repl_scsi0' is currently in use by another operation and cannot be used
2024-11-05T11:33:10.793501+01:00 devmox2 pve-ha-lrm[74157]: Task 'UPID:devmox2:000121AF:00074429:6729F45C:qmigrate:119:root@pam:' still active, waiting
2024-11-05T11:33:11.882165+01:00 devmox2 kernel: [ 4772.598367] zd512: p1 p14 p15
2024-11-05T11:33:15.795538+01:00 devmox2 pve-ha-lrm[74157]: Task 'UPID:devmox2:000121AF:00074429:6729F45C:qmigrate:119:root@pam:' still active, waiting
2024-11-05T11:33:16.872309+01:00 devmox2 kernel: [ 4777.588533] tap119i0: left allmulticast mode
2024-11-05T11:33:16.872320+01:00 devmox2 kernel: [ 4777.588553] vmbr0: port 4(tap119i0) entered disabled state
2024-11-05T11:33:16.901160+01:00 devmox2 qmeventd[1106]: read: Connection reset by peer
2024-11-05T11:33:16.942072+01:00 devmox2 systemd[1]: 119.scope: Deactivated successfully.
2024-11-05T11:33:16.942218+01:00 devmox2 systemd[1]: 119.scope: Consumed 1min 23.161s CPU time.
2024-11-05T11:33:17.628059+01:00 devmox2 pve-ha-lrm[74159]: migration problems
2024-11-05T11:33:17.637204+01:00 devmox2 pve-ha-lrm[74157]: <root@pam> end task UPID:devmox2:000121AF:00074429:6729F45C:qmigrate:119:root@pam: migration problems
2024-11-05T11:33:23.159431+01:00 devmox2 pve-ha-crm[1622]: service 'vm:119': state changed from 'migrate' to 'started' (node = devmox3)
2024-11-05T11:33:26.972986+01:00 devmox2 pmxcfs[1483]: [status] notice: received log
2024-11-05T11:33:27.610960+01:00 devmox2 pmxcfs[1483]: [status] notice: received log
The vm is migrated to node 3 but during the migration it is stoped and restarted on node 3.
Thanks for any hint for solving the issue.