Hi.
My setup is simple.
A proxmox VE (zfs on all disks), with a proxmox backup server virtualized.
The proxmox backup server is temporary inside.
So, in proxmox backup server virtualized, i need rsync a datastore to another, (the first disk in one SSD and the second disk in another and both is passed on smartctl).
Inside proxmox backup server is XFS on both discs.
When i make rsync from the first disc to the second, with some seconds the vm freeze totaly.
When i try to shutdown on proxmox ve the vm, got timeout.
When i stop (force) is ok, is shutdown.
When i try to restart, i got timeout on systemd start.
Just restarting the server i got all back.
I read some posts here telling the bug from zfs is fixed, but is not.
I have other nodes (6.x and 7.4) and none of then have such problems. I don't remember a proxmox version with so much problems on zfs.
Log from dmesg:
[ 245.101161] INFO: task txg_sync:1915 blocked for more than 120 seconds.
[ 245.101184] Tainted: P IO 6.5.11-7-pve #1
[ 245.101197] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 245.101210] task:txg_sync state: D stack:0 pid:1915 ppid:2 flags:0x00004000
[ 245.101214] Call Trace:
[ 245.101217] <TASK>
[ 245.101221] __schedule+0x3fd/0x1450
[ 245.101229] schedule+0x63/0x110
[ 245.101232] schedule_timeout+0x95/0x170
[ 245.101236] ? __pfx_process_timeout+0x10/0x10
[ 245.101241] io_schedule_timeout+0x51/0x80
[ 245.101246] __cv_timedwait_common+0x140/0x180 [spl]
[ 245.101280] ? __pfx_autoremove_wake_function+0x10/0x10
[ 245.101284] __cv_timedwait_io+0x19/0x30 [spl]
[ 245.101295] zio_wait+0x13a/0x2c0 [zfs]
[ 245.101620] dsl_pool_sync+0xce/0x4e0 [zfs]
[ 245.101800] spa_sync+0x57a/0x1030 [zfs]
[ 245.101977] ? spa_txg_history_init_io+0x120/0x130 [zfs]
[ 245.102150] txg_sync_thread+0x1fd/0x390 [zfs]
[ 245.102322] ? __pfx_txg_sync_thread+0x10/0x10 [zfs]
[ 245.102493] ? __pfx_thread_generic_wrapper+0x10/0x10 [spl]
[ 245.102505] thread_generic_wrapper+0x5c/0x70 [spl]
[ 245.102516] kthread+0xef/0x120
[ 245.102518] ? __pfx_kthread+0x10/0x10
[ 245.102521] ret_from_fork+0x44/0x70
[ 245.102524] ? __pfx_kthread+0x10/0x10
[ 245.102527] ret_from_fork_asm+0x1b/0x30
[ 245.102532] </TASK>
My setup is simple.
A proxmox VE (zfs on all disks), with a proxmox backup server virtualized.
The proxmox backup server is temporary inside.
So, in proxmox backup server virtualized, i need rsync a datastore to another, (the first disk in one SSD and the second disk in another and both is passed on smartctl).
Inside proxmox backup server is XFS on both discs.
When i make rsync from the first disc to the second, with some seconds the vm freeze totaly.
When i try to shutdown on proxmox ve the vm, got timeout.
When i stop (force) is ok, is shutdown.
When i try to restart, i got timeout on systemd start.
Just restarting the server i got all back.
I read some posts here telling the bug from zfs is fixed, but is not.
I have other nodes (6.x and 7.4) and none of then have such problems. I don't remember a proxmox version with so much problems on zfs.
Kernel Version Linux 6.5.11-7-pve (2023-12-05T09:44Z) |
Manager Version pve-manager/8.1.3/b46aac3b42da5d15 |
#zfs --version zfs-2.2.2-pve1 zfs-kmod-2.2.2-pve1 |
Log from dmesg:
[ 245.101161] INFO: task txg_sync:1915 blocked for more than 120 seconds.
[ 245.101184] Tainted: P IO 6.5.11-7-pve #1
[ 245.101197] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 245.101210] task:txg_sync state: D stack:0 pid:1915 ppid:2 flags:0x00004000
[ 245.101214] Call Trace:
[ 245.101217] <TASK>
[ 245.101221] __schedule+0x3fd/0x1450
[ 245.101229] schedule+0x63/0x110
[ 245.101232] schedule_timeout+0x95/0x170
[ 245.101236] ? __pfx_process_timeout+0x10/0x10
[ 245.101241] io_schedule_timeout+0x51/0x80
[ 245.101246] __cv_timedwait_common+0x140/0x180 [spl]
[ 245.101280] ? __pfx_autoremove_wake_function+0x10/0x10
[ 245.101284] __cv_timedwait_io+0x19/0x30 [spl]
[ 245.101295] zio_wait+0x13a/0x2c0 [zfs]
[ 245.101620] dsl_pool_sync+0xce/0x4e0 [zfs]
[ 245.101800] spa_sync+0x57a/0x1030 [zfs]
[ 245.101977] ? spa_txg_history_init_io+0x120/0x130 [zfs]
[ 245.102150] txg_sync_thread+0x1fd/0x390 [zfs]
[ 245.102322] ? __pfx_txg_sync_thread+0x10/0x10 [zfs]
[ 245.102493] ? __pfx_thread_generic_wrapper+0x10/0x10 [spl]
[ 245.102505] thread_generic_wrapper+0x5c/0x70 [spl]
[ 245.102516] kthread+0xef/0x120
[ 245.102518] ? __pfx_kthread+0x10/0x10
[ 245.102521] ret_from_fork+0x44/0x70
[ 245.102524] ? __pfx_kthread+0x10/0x10
[ 245.102527] ret_from_fork_asm+0x1b/0x30
[ 245.102532] </TASK>
Last edited: