Hello,
I have some errors on one of our zfs pools. I cannot perform any action that is related to ZFS operations like pool scrub, snapshots, ...
Maybe anyone is able to help ?
The node is one of our old nodes with consumer hardware (Asus H170M-PLUS, Core i7 and no ecc ram). ZFS is running on Samsung SSD EVO.
Zpool status looks good so far. Maybe it is somehow related to the hardware. But please have a look.
proxmox-ve: 6.2-1 (running kernel: 5.4.60-1-pve)
zfsutils-linux: 0.8.4-pve1
zpool status
pool: zpool1
state: ONLINE
status: Some supported features are not enabled on the pool. The pool can
still be used, but some features are unavailable.
action: Enable all features using 'zpool upgrade'. Once this is done,
the pool may no longer be accessible by software that does not support
the features. See zpool-features(5) for details.
scan: scrub repaired 0B in 0 days 00:51:30 with 0 errors on Sat Sep 12 23:15:31 2020
config:
NAME STATE READ WRITE CKSUM
zpool1 ONLINE 0 0 0
mirror-0 ONLINE 0 0 0
7f9ce18d-634e-4315-8504-ba65ed8f6327 ONLINE 0 0 0
b673d8e7-7995-457e-a7a3-bf30aefb89a6 ONLINE 0 0 0
errors: No known data errors
Output of dmesg:
[ 821.067625] ata5.00: Enabling discard_zeroes_data
[ 850.836877] ata6.00: Enabling discard_zeroes_data
[ 1005.452750] zd0: p1
[ 1005.455945] zd16: p1 p2 p3
[ 1005.460989] zd32: p1 p2
[ 1005.468935] zd48: p1 p2 p3 p4 < p5 p6 >
[ 1005.474918] zd64: p1 p3
[ 1005.708613] PANIC: blkptr at 00000000baa0bfe2 has invalid TYPE 108
[ 1005.708615] PANIC: blkptr at 00000000c30871fb DVA 0 has invalid VDEV 131072
[ 1005.708616] Showing stack for process 11383
[ 1005.708617] Showing stack for process 11320
[ 1005.708618] CPU: 1 PID: 11383 Comm: z_wr_int Tainted: P O 5.4.0-42-generic #46-Ubuntu
[ 1005.708618] Hardware name: System manufacturer System Product Name/H170M-PLUS, BIOS 0704 02/15/2016
[ 1005.708619] Call Trace:
[ 1005.708624] dump_stack+0x6d/0x9a
[ 1005.708631] spl_dumpstack+0x29/0x2b [spl]
[ 1005.708636] vcmn_err.cold+0x60/0x99 [spl]
[ 1005.708638] ? check_preempt_curr+0x20/0x90
[ 1005.708639] ? ttwu_do_wakeup+0x1e/0x150
[ 1005.708641] ? ttwu_do_activate+0x5b/0x70
[ 1005.708642] ? try_to_wake_up+0x224/0x6a0
[ 1005.708698] zfs_panic_recover+0x6f/0x90 [zfs]
[ 1005.708744] zfs_blkptr_verify+0x380/0x440 [zfs]
[ 1005.708786] zio_free+0x22/0xf0 [zfs]
[ 1005.708825] dsl_free+0x11/0x20 [zfs]
[ 1005.708860] dsl_dataset_block_kill+0x2ba/0x480 [zfs]
[ 1005.708891] dbuf_write_done+0x1b2/0x1e0 [zfs]
[ 1005.708921] arc_write_done+0x235/0x440 [zfs]
[ 1005.708964] zio_done+0x3aa/0xe20 [zfs]
[ 1005.709006] zio_execute+0x91/0xe0 [zfs]
[ 1005.709012] taskq_thread+0x245/0x430 [spl]
[ 1005.709014] ? __switch_to_asm+0x40/0x70
[ 1005.709016] ? wake_up_q+0x70/0x70
[ 1005.709059] ? zio_execute_stack_check.constprop.0+0x10/0x10 [zfs]
[ 1005.709061] kthread+0x104/0x140
[ 1005.709065] ? task_done+0x90/0x90 [spl]
[ 1005.709066] ? kthread_park+0x90/0x90
[ 1005.709067] ret_from_fork+0x35/0x40
[ 1005.709069] CPU: 5 PID: 11320 Comm: z_wr_int Tainted: P O 5.4.0-42-generic #46-Ubuntu
[ 1005.709070] Hardware name: System manufacturer System Product Name/H170M-PLUS, BIOS 0704 02/15/2016
[ 1005.709070] Call Trace:
[ 1005.709073] dump_stack+0x6d/0x9a
[ 1005.709079] spl_dumpstack+0x29/0x2b [spl]
[ 1005.709085] vcmn_err.cold+0x60/0x99 [spl]
[ 1005.709086] ? check_preempt_curr+0x7a/0x90
[ 1005.709088] ? ttwu_do_wakeup+0x1e/0x150
[ 1005.709089] ? ttwu_do_activate+0x5b/0x70
[ 1005.709091] ? try_to_wake_up+0x224/0x6a0
[ 1005.709093] ? dm_make_request+0x58/0xb0
[ 1005.709137] zfs_panic_recover+0x6f/0x90 [zfs]
[ 1005.709185] zfs_blkptr_verify+0x3ab/0x440 [zfs]
[ 1005.709236] zio_free+0x22/0xf0 [zfs]
[ 1005.709288] dsl_free+0x11/0x20 [zfs]
[ 1005.709336] dsl_dataset_block_kill+0x2ba/0x480 [zfs]
[ 1005.709380] dbuf_write_done+0x1b2/0x1e0 [zfs]
[ 1005.709416] arc_write_done+0x235/0x440 [zfs]
[ 1005.709463] zio_done+0x3aa/0xe20 [zfs]
[ 1005.709510] zio_execute+0x91/0xe0 [zfs]
[ 1005.709514] taskq_thread+0x245/0x430 [spl]
[ 1005.709516] ? wake_up_q+0x70/0x70
[ 1005.709567] ? zio_execute_stack_check.constprop.0+0x10/0x10 [zfs]
[ 1005.709568] kthread+0x104/0x140
[ 1005.709572] ? task_done+0x90/0x90 [spl]
[ 1005.709573] ? kthread_park+0x90/0x90
[ 1005.709574] ret_from_fork+0x35/0x40
I have some errors on one of our zfs pools. I cannot perform any action that is related to ZFS operations like pool scrub, snapshots, ...
Maybe anyone is able to help ?
The node is one of our old nodes with consumer hardware (Asus H170M-PLUS, Core i7 and no ecc ram). ZFS is running on Samsung SSD EVO.
Zpool status looks good so far. Maybe it is somehow related to the hardware. But please have a look.
proxmox-ve: 6.2-1 (running kernel: 5.4.60-1-pve)
zfsutils-linux: 0.8.4-pve1
zpool status
pool: zpool1
state: ONLINE
status: Some supported features are not enabled on the pool. The pool can
still be used, but some features are unavailable.
action: Enable all features using 'zpool upgrade'. Once this is done,
the pool may no longer be accessible by software that does not support
the features. See zpool-features(5) for details.
scan: scrub repaired 0B in 0 days 00:51:30 with 0 errors on Sat Sep 12 23:15:31 2020
config:
NAME STATE READ WRITE CKSUM
zpool1 ONLINE 0 0 0
mirror-0 ONLINE 0 0 0
7f9ce18d-634e-4315-8504-ba65ed8f6327 ONLINE 0 0 0
b673d8e7-7995-457e-a7a3-bf30aefb89a6 ONLINE 0 0 0
errors: No known data errors
Output of dmesg:
[ 821.067625] ata5.00: Enabling discard_zeroes_data
[ 850.836877] ata6.00: Enabling discard_zeroes_data
[ 1005.452750] zd0: p1
[ 1005.455945] zd16: p1 p2 p3
[ 1005.460989] zd32: p1 p2
[ 1005.468935] zd48: p1 p2 p3 p4 < p5 p6 >
[ 1005.474918] zd64: p1 p3
[ 1005.708613] PANIC: blkptr at 00000000baa0bfe2 has invalid TYPE 108
[ 1005.708615] PANIC: blkptr at 00000000c30871fb DVA 0 has invalid VDEV 131072
[ 1005.708616] Showing stack for process 11383
[ 1005.708617] Showing stack for process 11320
[ 1005.708618] CPU: 1 PID: 11383 Comm: z_wr_int Tainted: P O 5.4.0-42-generic #46-Ubuntu
[ 1005.708618] Hardware name: System manufacturer System Product Name/H170M-PLUS, BIOS 0704 02/15/2016
[ 1005.708619] Call Trace:
[ 1005.708624] dump_stack+0x6d/0x9a
[ 1005.708631] spl_dumpstack+0x29/0x2b [spl]
[ 1005.708636] vcmn_err.cold+0x60/0x99 [spl]
[ 1005.708638] ? check_preempt_curr+0x20/0x90
[ 1005.708639] ? ttwu_do_wakeup+0x1e/0x150
[ 1005.708641] ? ttwu_do_activate+0x5b/0x70
[ 1005.708642] ? try_to_wake_up+0x224/0x6a0
[ 1005.708698] zfs_panic_recover+0x6f/0x90 [zfs]
[ 1005.708744] zfs_blkptr_verify+0x380/0x440 [zfs]
[ 1005.708786] zio_free+0x22/0xf0 [zfs]
[ 1005.708825] dsl_free+0x11/0x20 [zfs]
[ 1005.708860] dsl_dataset_block_kill+0x2ba/0x480 [zfs]
[ 1005.708891] dbuf_write_done+0x1b2/0x1e0 [zfs]
[ 1005.708921] arc_write_done+0x235/0x440 [zfs]
[ 1005.708964] zio_done+0x3aa/0xe20 [zfs]
[ 1005.709006] zio_execute+0x91/0xe0 [zfs]
[ 1005.709012] taskq_thread+0x245/0x430 [spl]
[ 1005.709014] ? __switch_to_asm+0x40/0x70
[ 1005.709016] ? wake_up_q+0x70/0x70
[ 1005.709059] ? zio_execute_stack_check.constprop.0+0x10/0x10 [zfs]
[ 1005.709061] kthread+0x104/0x140
[ 1005.709065] ? task_done+0x90/0x90 [spl]
[ 1005.709066] ? kthread_park+0x90/0x90
[ 1005.709067] ret_from_fork+0x35/0x40
[ 1005.709069] CPU: 5 PID: 11320 Comm: z_wr_int Tainted: P O 5.4.0-42-generic #46-Ubuntu
[ 1005.709070] Hardware name: System manufacturer System Product Name/H170M-PLUS, BIOS 0704 02/15/2016
[ 1005.709070] Call Trace:
[ 1005.709073] dump_stack+0x6d/0x9a
[ 1005.709079] spl_dumpstack+0x29/0x2b [spl]
[ 1005.709085] vcmn_err.cold+0x60/0x99 [spl]
[ 1005.709086] ? check_preempt_curr+0x7a/0x90
[ 1005.709088] ? ttwu_do_wakeup+0x1e/0x150
[ 1005.709089] ? ttwu_do_activate+0x5b/0x70
[ 1005.709091] ? try_to_wake_up+0x224/0x6a0
[ 1005.709093] ? dm_make_request+0x58/0xb0
[ 1005.709137] zfs_panic_recover+0x6f/0x90 [zfs]
[ 1005.709185] zfs_blkptr_verify+0x3ab/0x440 [zfs]
[ 1005.709236] zio_free+0x22/0xf0 [zfs]
[ 1005.709288] dsl_free+0x11/0x20 [zfs]
[ 1005.709336] dsl_dataset_block_kill+0x2ba/0x480 [zfs]
[ 1005.709380] dbuf_write_done+0x1b2/0x1e0 [zfs]
[ 1005.709416] arc_write_done+0x235/0x440 [zfs]
[ 1005.709463] zio_done+0x3aa/0xe20 [zfs]
[ 1005.709510] zio_execute+0x91/0xe0 [zfs]
[ 1005.709514] taskq_thread+0x245/0x430 [spl]
[ 1005.709516] ? wake_up_q+0x70/0x70
[ 1005.709567] ? zio_execute_stack_check.constprop.0+0x10/0x10 [zfs]
[ 1005.709568] kthread+0x104/0x140
[ 1005.709572] ? task_done+0x90/0x90 [spl]
[ 1005.709573] ? kthread_park+0x90/0x90
[ 1005.709574] ret_from_fork+0x35/0x40
Last edited: