I have been struggling with an issue for the past few weeks where using either "Move Disk" in the GUI or "qm move disk" in the CLI from one ZFS pool to another soft locks all storage pools on the host, VMs become entirely unresponsive. This only seems to happen with VMs >30GB, for some reason <=30GB the move disk process completes incredibly quickly. After about 31GB have been moved, the disk move progress slows down to an absolute crawl and "zpool iostat" for all pools goes to 0 despite having running workloads that end up freezing.
Canceling the task and monitoring the processes waiting for disk I/O using the following command (watch -n 1 "(ps aux | awk '$8 ~ /D/ { print $0 }')"), there's a bunch of zfs operations queued up along with the "zfs create -s -V" command that "qm disk move" likely triggered. It takes about 10 minutes for the pools to start writing data again and then everything returns to normal like nothing happened. Both pools are ashift=12 and blocksize=8k. If I stop all loads on both pools it doesn't have any different impact on this, the soft lock still occurs.
PVEVersion and package versions, latest updates are all applied and still experiencing this issue.
Canceling the task and monitoring the processes waiting for disk I/O using the following command (watch -n 1 "(ps aux | awk '$8 ~ /D/ { print $0 }')"), there's a bunch of zfs operations queued up along with the "zfs create -s -V" command that "qm disk move" likely triggered. It takes about 10 minutes for the pools to start writing data again and then everything returns to normal like nothing happened. Both pools are ashift=12 and blocksize=8k. If I stop all loads on both pools it doesn't have any different impact on this, the soft lock still occurs.
PVEVersion and package versions, latest updates are all applied and still experiencing this issue.
Code:
# pveversion -v
proxmox-ve: 8.2.0 (running kernel: 6.8.8-2-pve)
pve-manager: 8.2.4 (running version: 8.2.4/faa83925c9641325)
proxmox-kernel-helper: 8.1.0
pve-kernel-5.15: 7.4-14
proxmox-kernel-6.8: 6.8.8-2
proxmox-kernel-6.8.8-2-pve-signed: 6.8.8-2
pve-kernel-5.15.158-1-pve: 5.15.158-1
pve-kernel-5.15.102-1-pve: 5.15.102-1
ceph-fuse: 16.2.11+ds-2
corosync: 3.1.7-pve3
criu: 3.17.1-2
glusterfs-client: 10.3-5
ifupdown2: 3.2.0-1+pmx8
ksm-control-daemon: 1.5-1
libjs-extjs: 7.0.0-4
libknet1: 1.28-pve1
libproxmox-acme-perl: 1.5.1
libproxmox-backup-qemu0: 1.4.1
libproxmox-rs-perl: 0.3.3
libpve-access-control: 8.1.4
libpve-apiclient-perl: 3.3.2
libpve-cluster-api-perl: 8.0.7
libpve-cluster-perl: 8.0.7
libpve-common-perl: 8.2.1
libpve-guest-common-perl: 5.1.3
libpve-http-server-perl: 5.1.0
libpve-network-perl: 0.9.8
libpve-rs-perl: 0.8.9
libpve-storage-perl: 8.2.3
libspice-server1: 0.15.1-1
lvm2: 2.03.16-2
lxc-pve: 6.0.0-1
lxcfs: 6.0.0-pve2
novnc-pve: 1.4.0-3
proxmox-backup-client: 3.2.7-1
proxmox-backup-file-restore: 3.2.7-1
proxmox-firewall: 0.4.2
proxmox-kernel-helper: 8.1.0
proxmox-mail-forward: 0.2.3
proxmox-mini-journalreader: 1.4.0
proxmox-widget-toolkit: 4.2.3
pve-cluster: 8.0.7
pve-container: 5.1.12
pve-docs: 8.2.2
pve-edk2-firmware: 4.2023.08-4
pve-esxi-import-tools: 0.7.1
pve-firewall: 5.0.7
pve-firmware: 3.12-1
pve-ha-manager: 4.0.5
pve-i18n: 3.2.2
pve-qemu-kvm: 9.0.0-5
pve-xtermjs: 5.3.0-3
qemu-server: 8.2.1
smartmontools: 7.3-pve1
spiceterm: 3.3.0
swtpm: 0.8.0+pve1
vncterm: 1.8.0
zfsutils-linux: 2.2.4-pve1
Last edited: