- Both nodes are in a cluster, which seems to work fine.
- I have configured a backup from my main node to my NAS node in 'Datacentre'.
- Both nodes are running 'Linux 6.2.16-6-pve #1 SMP PREEMPT_DYNAMIC PMX 6.2.16-7 (2023-08-01T11:23Z)'
Using TurnKey fileserver on my NAS node, the SMB shares (I have three) randomly go offline, causing multiple zed delay entries in the syslog for my ZFS pool involving all disks within the pool I use for my backup job. Both ZFS pools on my NAS node have no faults and look to be ok, I scrubbed both multiple times, but the same problem exists.
The backup job for all my VMs & containers completes successfully, but there is something wrong somewhere.
Code:
Aug 18 11:46:58 nas kernel: <TASK>
Aug 18 11:46:58 nas kernel: __schedule+0x402/0x1510
Aug 18 11:46:58 nas kernel: ? try_to_unlazy+0x60/0xe0
Aug 18 11:46:58 nas kernel: ? terminate_walk+0x65/0x100
Aug 18 11:46:58 nas kernel: ? path_parentat+0x49/0x90
Aug 18 11:46:58 nas kernel: schedule+0x63/0x110
Aug 18 11:46:58 nas kernel: rwsem_down_write_slowpath+0x373/0x710
Aug 18 11:46:58 nas kernel: down_write+0x65/0x90
Aug 18 11:46:58 nas kernel: do_unlinkat+0x1df/0x310
Aug 18 11:46:58 nas kernel: __x64_sys_unlink+0x42/0x70
Aug 18 11:46:58 nas kernel: do_syscall_64+0x5b/0x90
Aug 18 11:46:58 nas kernel: ? syscall_exit_to_user_mode+0x29/0x50
Aug 18 11:46:58 nas kernel: ? do_syscall_64+0x67/0x90
Aug 18 11:46:58 nas kernel: entry_SYSCALL_64_after_hwframe+0x72/0xdc
Aug 18 11:46:58 nas kernel: RIP: 0033:0x7f81c7c2a0c7
Aug 18 11:46:58 nas kernel: RSP: 002b:00007ffe4c95c9e8 EFLAGS: 00000297 ORIG_RAX: 0000000000000057
Aug 18 11:46:58 nas kernel: RAX: ffffffffffffffda RBX: 0000556b88f1e280 RCX: 00007f81c7c2a0c7
Aug 18 11:46:58 nas kernel: RDX: 0000000000000000 RSI: 00007f81c733baa5 RDI: 00007ffe4c95c9f0
Aug 18 11:46:58 nas kernel: RBP: 00007ffe4c95c9f0 R08: 0000000000000000 R09: 00007ffe4c95c870
Aug 18 11:46:58 nas kernel: R10: 00000000000027a2 R11: 0000000000000297 R12: 00007f81c733b870
Aug 18 11:46:58 nas kernel: R13: 0000556b88f17eb0 R14: 00007f81c7d63070 R15: 0000556b88f1e280
Aug 18 11:46:58 nas kernel: </TASK>
Aug 18 11:46:58 nas kernel: INFO: task smbd:172788 blocked for more than 120 seconds.
Aug 18 11:46:58 nas kernel: Tainted: P O 6.2.16-6-pve #1
Aug 18 11:46:58 nas kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 18 11:46:58 nas kernel: task:smbd state:D stack:0 pid:172788 ppid:7398 flags:0x00000000
Aug 18 11:46:58 nas kernel: Call Trace:
Aug 18 11:46:58 nas kernel: <TASK>
Aug 18 11:46:58 nas kernel: __schedule+0x402/0x1510
Aug 18 11:46:58 nas kernel: ? try_to_unlazy+0x60/0xe0
Aug 18 11:46:58 nas kernel: ? terminate_walk+0x65/0x100
Aug 18 11:46:58 nas kernel: ? path_parentat+0x49/0x90
Aug 18 11:46:58 nas kernel: schedule+0x63/0x110
Aug 18 11:46:58 nas kernel: rwsem_down_write_slowpath+0x373/0x710
Aug 18 11:46:58 nas kernel: down_write+0x65/0x90
Aug 18 11:46:58 nas kernel: do_unlinkat+0x1df/0x310
Aug 18 11:46:58 nas kernel: __x64_sys_unlink+0x42/0x70
Aug 18 11:46:58 nas kernel: do_syscall_64+0x5b/0x90
Aug 18 11:46:58 nas kernel: ? exit_to_user_mode_prepare+0x39/0x190
Aug 18 11:46:58 nas kernel: ? syscall_exit_to_user_mode+0x29/0x50
Aug 18 11:46:58 nas kernel: ? do_syscall_64+0x67/0x90
Aug 18 11:46:58 nas kernel: ? exit_to_user_mode_prepare+0x39/0x190
Aug 18 11:46:58 nas kernel: ? irqentry_exit_to_user_mode+0x9/0x20
Aug 18 11:46:58 nas kernel: ? irqentry_exit+0x43/0x50
Aug 18 11:46:58 nas kernel: ? exc_page_fault+0x91/0x1b0
Aug 18 11:46:58 nas kernel: entry_SYSCALL_64_after_hwframe+0x72/0xdc
Aug 18 11:46:58 nas kernel: RIP: 0033:0x7f81c7c2a0c7
Aug 18 11:46:58 nas kernel: RSP: 002b:00007ffe4c95c9e8 EFLAGS: 00000297 ORIG_RAX: 0000000000000057
Aug 18 11:46:58 nas kernel: RAX: ffffffffffffffda RBX: 0000556b88f1e280 RCX: 00007f81c7c2a0c7
Aug 18 11:46:58 nas kernel: RDX: 0000000000000000 RSI: 00007f81c733baa5 RDI: 00007ffe4c95c9f0
Aug 18 11:46:58 nas kernel: RBP: 00007ffe4c95c9f0 R08: 0000000000000000 R09: 00007ffe4c95c870
Aug 18 11:46:58 nas kernel: R10: 00000000000027a3 R11: 0000000000000297 R12: 00007f81c733b870
Aug 18 11:46:58 nas kernel: R13: 0000556b88f17eb0 R14: 00007f81c7d63070 R15: 0000556b88f1e280
Aug 18 11:46:58 nas kernel: </TASK>