VM freezes when migrating from node to node

systemctl

Member
Jun 3, 2022
19
0
6
Hello, we have a cluster with 2 nodes on different hardware - E5-2680 v3 and Silver 4214 .
Any VM that migrates from one to another freezes, all time. We have to reset them.
All VMs have kvm64 processor type, but honestly it doesn't matter, we tried on any processor type and even custom types.

Migration between v3 - v2 ( and vice versa ) or between Silver - Gold works fine.

Any suggestions?

screened VM console just after migration with kernel panic log.

pveversion -v:
# pveversion -v
proxmox-ve: 7.2-1 (running kernel: 5.15.74-1-pve)
pve-manager: 7.2-11 (running version: 7.2-11/b76d3178)
pve-kernel-5.15: 7.2-14
pve-kernel-helper: 7.2-14
pve-kernel-5.11: 7.0-10
pve-kernel-5.4: 6.4-5
pve-kernel-5.15.74-1-pve: 5.15.74-1
pve-kernel-5.15.35-2-pve: 5.15.35-5
pve-kernel-5.11.22-7-pve: 5.11.22-12
pve-kernel-5.11.22-4-pve: 5.11.22-9
pve-kernel-5.4.128-1-pve: 5.4.128-2
ceph-fuse: 14.2.21-1
corosync: 3.1.5-pve2
criu: 3.15-1+pve-1
glusterfs-client: 9.2-1
ifupdown: 0.8.36+pve2
libjs-extjs: 7.0.0-1
libknet1: 1.24-pve1
libproxmox-acme-perl: 1.4.2
libproxmox-backup-qemu0: 1.3.1-1
libpve-access-control: 7.2-4
libpve-apiclient-perl: 3.2-1
libpve-common-perl: 7.2-4
libpve-guest-common-perl: 4.1-4
libpve-http-server-perl: 4.1-5
libpve-storage-perl: 7.2-10
libspice-server1: 0.14.3-2.1
lvm2: 2.03.11-2.1
lxc-pve: 5.0.0-3
lxcfs: 4.0.12-pve1
novnc-pve: 1.3.0-3
proxmox-backup-client: 2.2.7-1
proxmox-backup-file-restore: 2.2.7-1
proxmox-mini-journalreader: 1.3-1
proxmox-widget-toolkit: 3.5.1
pve-cluster: 7.2-2
pve-container: 4.2-3
pve-docs: 7.2-2
pve-edk2-firmware: 3.20220526-1
pve-firewall: 4.2-6
pve-firmware: 3.5-6
pve-ha-manager: 3.4.0
pve-i18n: 2.7-2
pve-qemu-kvm: 7.0.0-4
pve-xtermjs: 4.16.0-1
qemu-server: 7.2-5
smartmontools: 7.2-pve3
spiceterm: 3.2-2
swtpm: 0.8.0~bpo11+2
vncterm: 1.7-1
zfsutils-linux: 2.1.6-pve1
 

Attachments

  • Screenshot 2022-12-12 at 15.04.46.png
    Screenshot 2022-12-12 at 15.04.46.png
    504.4 KB · Views: 10
Can you try upgrading to kernel version 5.19 and see if that fixes the issue? We had some issues with 5.15 and live migration, but that was mostly related to AMD CPUs iirc.

Migrating between different CPUs can always be a bit tricky. Would it be an option to shutdown the VM, migrate it, then boot it again? This should work in any case.
 
  • Like
Reactions: systemctl and rason
Hi,
Can you try upgrading to kernel version 5.19 and see if that fixes the issue? We had some issues with 5.15 and live migration, but that was mostly related to AMD CPUs iirc.

Migrating between different CPUs can always be a bit tricky. Would it be an option to shutdown the VM, migrate it, then boot it again? This should work in any case.
FYI, there's also the FPU issue which does affect Intel CPUs: https://forum.proxmox.com/threads/109645/ and is also solved by upgrading to kernel 5.19 :)
 
  • Like
Reactions: shanreich