Hello all.
I have a cluster with 4 nodes and shared storage. The storage is connected to the cluster by iscsi with multipathing. Every works correctly, but I found some problem after testing. After rebooting one of the clusters, I can't perform migration back the vm's.
The error that I get:
After 3 or more rebooting of the server, the error was gone. But if I reboot one of the servers from the cluster, the error occurs again.
Maybe someone is familiar with the problem, and can suggest something?
Thank you!
Aleksei
I have a cluster with 4 nodes and shared storage. The storage is connected to the cluster by iscsi with multipathing. Every works correctly, but I found some problem after testing. After rebooting one of the clusters, I can't perform migration back the vm's.
The error that I get:
Task viewer: VM 101 - Migrate
OutputStatus
Stop
task started by HA resource agent
2019-03-24 12:07:59 starting migration of VM 101 to node 'PVE003' (10.100.0.30)
2019-03-24 12:07:59 copying disk images
2019-03-24 12:07:59 starting VM 101 on remote node 'PVE003'
2019-03-24 12:08:01 can't activate LV '/dev/vg1_iscsi/vm-101-disk-0': Cannot activate LVs in VG vg1_iscsi while PVs appear on duplicate devices.
2019-03-24 12:08:01 ERROR: online migrate failure - command '/usr/bin/ssh -e none -o 'BatchMode=yes' -o 'HostKeyAlias=PVE003' root@10.100.0.30 qm start 101 --skiplock --migratedfrom PVE004 --migration_type secure --stateuri unix --machine pc-i440fx-2.12' failed: exit code 255
2019-03-24 12:08:01 aborting phase 2 - cleanup resources
2019-03-24 12:08:01 migrate_cancel
2019-03-24 12:08:02 ERROR: migration finished with problems (duration 00:00:03)
TASK ERROR: migration problems
OutputStatus
Stop
task started by HA resource agent
2019-03-24 12:07:59 starting migration of VM 101 to node 'PVE003' (10.100.0.30)
2019-03-24 12:07:59 copying disk images
2019-03-24 12:07:59 starting VM 101 on remote node 'PVE003'
2019-03-24 12:08:01 can't activate LV '/dev/vg1_iscsi/vm-101-disk-0': Cannot activate LVs in VG vg1_iscsi while PVs appear on duplicate devices.
2019-03-24 12:08:01 ERROR: online migrate failure - command '/usr/bin/ssh -e none -o 'BatchMode=yes' -o 'HostKeyAlias=PVE003' root@10.100.0.30 qm start 101 --skiplock --migratedfrom PVE004 --migration_type secure --stateuri unix --machine pc-i440fx-2.12' failed: exit code 255
2019-03-24 12:08:01 aborting phase 2 - cleanup resources
2019-03-24 12:08:01 migrate_cancel
2019-03-24 12:08:02 ERROR: migration finished with problems (duration 00:00:03)
TASK ERROR: migration problems
After 3 or more rebooting of the server, the error was gone. But if I reboot one of the servers from the cluster, the error occurs again.
Maybe someone is familiar with the problem, and can suggest something?
Thank you!
Aleksei