Hello,
I have weird situation when I am live migrating VM between two nodes.
Both have AMD cpu but different type.
First node type :
Second node type :
And now !
when I am migrating from second type node to first type - everything works.
when I am migrating from first type to second type - vm suspend.
I can reproduce issue.
here is gui log :
journactl don't have any issues :
vm not blinking " - "
I have weird situation when I am live migrating VM between two nodes.
Both have AMD cpu but different type.
First node type :
Code:
CPU(s) 64 x AMD EPYC 7513 32-Core Processor (2 Sockets)
Kernel Version Linux 5.15.60-2-pve #1 SMP PVE 5.15.60-2 (Tue, 04 Oct 2022 16:52:28 +0200)
PVE Manager Version pve-manager/7.2-11/b76d3178
Second node type :
Code:
CPU(s) 128 x AMD EPYC 7662 64-Core Processor (2 Sockets)
Kernel Version Linux 5.15.60-2-pve #1 SMP PVE 5.15.60-2 (Tue, 04 Oct 2022 16:52:28 +0200)
PVE Manager Version pve-manager/7.2-11/b76d3178
And now !
when I am migrating from second type node to first type - everything works.
when I am migrating from first type to second type - vm suspend.
I can reproduce issue.
here is gui log :
Code:
()
2022-10-19 07:02:14 starting migration of VM 99998 to node 'HV-COSSACK-C-01' (10.42.254.135)
2022-10-19 07:02:14 starting VM 99998 on remote node 'HV-COSSACK-C-01'
2022-10-19 07:02:17 start remote tunnel
2022-10-19 07:02:18 ssh tunnel ver 1
2022-10-19 07:02:18 starting online/live migration on unix:/run/qemu-server/99998.migrate
2022-10-19 07:02:18 set migration capabilities
2022-10-19 07:02:18 migration downtime limit: 100 ms
2022-10-19 07:02:18 migration cachesize: 512.0 MiB
2022-10-19 07:02:18 set migration parameters
2022-10-19 07:02:18 start migrate command to unix:/run/qemu-server/99998.migrate
2022-10-19 07:02:19 migration active, transferred 250.5 MiB of 4.0 GiB VM-state, 1000.1 MiB/s
2022-10-19 07:02:20 average migration speed: 2.0 GiB/s - downtime 93 ms
2022-10-19 07:02:20 migration status: completed
2022-10-19 07:02:23 migration finished successfully (duration 00:00:09)
TASK OK
journactl don't have any issues :
Code:
paź 19 06:47:15 hostname-1 sshd[1410497]: Accepted publickey for root from >here is IPv4< port 47388 ssh2: RSA SHA256:68BpbZuE8rw0MCKtQhL4HDNXIgjJOtiI+QSbfvrJugw
paź 19 06:47:15 hostname-1 sshd[1410497]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0)
paź 19 06:47:15 hostname-1 systemd-logind[7765]: New session 1094 of user root.
paź 19 06:47:15 hostname-1 systemd[1]: Started Session 1094 of user root.
paź 19 06:47:16 hostname-1 sshd[1410497]: Received disconnect from >here is IPv4< port 47388:11: disconnected by user
paź 19 06:47:16 hostname-1 sshd[1410497]: Disconnected from user root >here is IPv4< port 47388
paź 19 06:47:16 hostname-1 sshd[1410497]: pam_unix(sshd:session): session closed for user root
paź 19 06:47:16 hostname-1 systemd[1]: session-1094.scope: Succeeded.
paź 19 06:47:16 hostname-1 systemd-logind[7765]: Session 1094 logged out. Waiting for processes to exit.
paź 19 06:47:16 hostname-1 systemd-logind[7765]: Removed session 1094.
paź 19 06:47:16 hostname-1 pmxcfs[2562]: [status] notice: received log
paź 19 06:47:16 hostname-1 pmxcfs[2562]: [status] notice: received log
paź 19 06:47:16 hostname-1 sshd[1410532]: Accepted publickey for root from >here is IPv4< port 37682 ssh2: RSA SHA256:gQoCOH/xCQvl/W3jDlzhTmVEdMnPQb9a4rIeEBRpHNo
paź 19 06:47:16 hostname-1 sshd[1410532]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0)
paź 19 06:47:16 hostname-1 systemd-logind[7765]: New session 1095 of user root.
paź 19 06:47:16 hostname-1 systemd[1]: Started Session 1095 of user root.
paź 19 06:47:17 hostname-1 sshd[1410541]: Accepted publickey for root from >here is IPv4< port 47394 ssh2: RSA SHA256:68BpbZuE8rw0MCKtQhL4HDNXIgjJOtiI+QSbfvrJugw
paź 19 06:47:17 hostname-1 sshd[1410541]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0)
paź 19 06:47:17 hostname-1 systemd-logind[7765]: New session 1096 of user root.
paź 19 06:47:17 hostname-1 systemd[1]: Started Session 1096 of user root.
paź 19 06:47:18 hostname-1 sshd[1410541]: Received disconnect from >here is IPv4< port 47394:11: disconnected by user
paź 19 06:47:18 hostname-1 sshd[1410541]: Disconnected from user root >here is IPv4< port 47394
paź 19 06:47:18 hostname-1 sshd[1410541]: pam_unix(sshd:session): session closed for user root
paź 19 06:47:18 hostname-1 systemd[1]: session-1096.scope: Succeeded.
paź 19 06:47:18 hostname-1 systemd[1]: session-1096.scope: Consumed 1.015s CPU time.
paź 19 06:47:18 hostname-1 systemd-logind[7765]: Session 1096 logged out. Waiting for processes to exit.
paź 19 06:47:18 hostname-1 systemd-logind[7765]: Removed session 1096.
paź 19 06:47:18 hostname-1 pmxcfs[2562]: [status] notice: received log
paź 19 06:47:35 hostname-1 pmxcfs[2562]: [status] notice: received log
paź 19 06:47:35 hostname-1 sshd[1410742]: Accepted publickey for root from >here is IPv4< port 50260 ssh2: RSA SHA256:gQoCOH/xCQvl/W3jDlzhTmVEdMnPQb9a4rIeEBRpHNo
paź 19 06:47:35 hostname-1 sshd[1410742]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0)
paź 19 06:47:35 hostname-1 systemd-logind[7765]: New session 1097 of user root.
paź 19 06:47:35 hostname-1 systemd[1]: Started Session 1097 of user root.
paź 19 06:47:42 hostname-1 sshd[1410742]: Received disconnect from >here is IPv4< port 50260:11: disconnected by user
paź 19 06:47:42 hostname-1 sshd[1410742]: Disconnected from user root >here is IPv4< port 50260
paź 19 06:47:42 hostname-1 sshd[1410742]: pam_unix(sshd:session): session closed for user root
paź 19 06:47:42 hostname-1 systemd[1]: session-1097.scope: Succeeded.
paź 19 06:47:42 hostname-1 systemd-logind[7765]: Session 1097 logged out. Waiting for processes to exit.
paź 19 06:47:42 hostname-1 systemd-logind[7765]: Removed session 1097.
paź 19 06:47:42 hostname-1 pmxcfs[2562]: [status] notice: received log
vm not blinking " - "