Hi everyone,
I’m dealing with persistent I/O stalls in a setup where:
- Proxmox VE (host) runs TrueNAS SCALE as a VM
- The Proxmox host mounts NFS/SMB exports from the TrueNAS VM
- These mounts are bind-mounted into unprivileged LXC containers (Plex, Tdarr, TubeArchivist)
Under heavier read/write workloads, I consistently see I/O freezes, kernel logs with NFS “not responding”, CIFS reconnect loops, and occasional hung tasks (Tdarr_Server, ffprobe, HandBrakeCLI, dmx0:matroska,w) stuck in state D.
Here is my environment:
- Proxmox VE: kernel 6.14.11-4-pve
- TrueNAS SCALE: running as a VM (VirtIO NIC + disks via HBA passthrough)
- TrueNAS VM: 4 vCPUs (tested with 8 vCPUs too), ZFS on raw disks
- Networking: 10 GbE (LACP bond, MTU 1500, RSTP + flow control disabled, VirtIO interface)
- LXC containers (unprivileged):
- 117: Plex – mostly read, rare writes
- 120: Tdarr – transcodes (heavy RW)
- 130: TubeArchivist – RW for metadata and thumbnails
SYMPTOMS/LOGS:
NFS (on Proxmox host):
nfs: server 192.168.30.104 not responding, still trying
...
nfs: server 192.168.30.104 OK
Typically repeating in long cycles:
[31958.410247] nfs: server 192.168.30.104 not responding, still trying
[32741.776511] nfs: server 192.168.30.104 OK
[33028.495824] nfs: server 192.168.30.104 not responding, still trying
[33749.397840] nfs: server 192.168.30.104 OK
And frequent hung tasks (state D) during flush/close:
INFO: task Tdarr_Server:661925 blocked for more than 122 seconds.
...
nfs_wb_all -> nfs4_file_flush -> filp_flush -> __x64_sys_close
SMB (on Proxmox host):
CIFS: VFS: \\192.168.30.104 has not responded in 45 seconds. Reconnecting...
CIFS: trying to dequeue a deleted mid
What I’ve already tested:
NFS and SMB tuning – various option profiles, Mount separation (each container gets its own NFS/SMB mount), Network & kernel tuning
(Flow Control + RSTP disabled)).
Result: despite all above, NFS “not responding” and CIFS “Reconnecting…” persist, especially during concurrent RW (Tdarr) and RO (Plex) workloads.
Occasionally Tdarr processes enter D state due to kernel waits in nfs_wb_all or netfs_write.
To be clear, in my previous version of Proxmox, which was 8, everything worked perfectly without any problems with NFS or SMB mounts in unprivileged LXC containers.