(I know too broad a question ... I'll try to narrow it down)
I'm often migrating vm's but sometimes it fails.
E.g. I migrate a vm from node 1 to node 2, and it failed after memory was transferred. (log below)
A second attempt between the same hosts worked OK.
node1: proxmox-ve: 6.2-1 (running kernel: 5.3.18-3-pve)
node 2: proxmox-ve: 6.2-1 (running kernel: 5.4.44-1-pve)
Both nodes use local storage on LVM. I found no relevant messages in syslog.
The worst thing is that after such a failure the vm is not running any more. The disks are cleaned up on the target node, and I can just start it again on node 1. But it would be nice if the VM was started again (or restored) on the source node automatically.
I'm often migrating vm's but sometimes it fails.
E.g. I migrate a vm from node 1 to node 2, and it failed after memory was transferred. (log below)
A second attempt between the same hosts worked OK.
node1: proxmox-ve: 6.2-1 (running kernel: 5.3.18-3-pve)
node 2: proxmox-ve: 6.2-1 (running kernel: 5.4.44-1-pve)
Both nodes use local storage on LVM. I found no relevant messages in syslog.
The worst thing is that after such a failure the vm is not running any more. The disks are cleaned up on the target node, and I can just start it again on node 1. But it would be nice if the VM was started again (or restored) on the source node automatically.
Code:
...
2020-07-13 15:26:10 migration xbzrle cachesize: 536870912 transferred 0 pages 0 cachemiss 113245 overflow 0
2020-07-13 15:26:10 migration status: active (transferred 4220021110, remaining 7905280), total 4312604672)
2020-07-13 15:26:10 migration xbzrle cachesize: 536870912 transferred 0 pages 0 cachemiss 116150 overflow 0
query migrate failed: VM 115 qmp command 'query-migrate' failed - client closed connection
2020-07-13 15:26:10 query migrate failed: VM 115 qmp command 'query-migrate' failed - client closed connection
query migrate failed: VM 115 not running
2020-07-13 15:26:11 query migrate failed: VM 115 not running
query migrate failed: VM 115 not running
2020-07-13 15:26:13 query migrate failed: VM 115 not running
query migrate failed: VM 115 not running
2020-07-13 15:26:14 query migrate failed: VM 115 not running
query migrate failed: VM 115 not running
2020-07-13 15:26:15 query migrate failed: VM 115 not running
query migrate failed: VM 115 not running
2020-07-13 15:26:16 query migrate failed: VM 115 not running
2020-07-13 15:26:16 ERROR: online migrate failure - too many query migrate failures - aborting
2020-07-13 15:26:16 aborting phase 2 - cleanup resources
2020-07-13 15:26:16 migrate_cancel
2020-07-13 15:26:16 migrate_cancel error: VM 115 not running
drive-scsi0: Cancelling block job
2020-07-13 15:26:16 ERROR: VM 115 not running
2020-07-13 15:26:19 ERROR: migration finished with problems (duration 00:04:24)
TASK ERROR: migration problems