Dell T440 here. Unifi network stack though.Maybe the Unifi UNAS Pro is the common link? Seems like a couple of us have the same NAS.
Anyone having this issue not on the Unifi UNAS Pro?
Dell T440 here. Unifi network stack though.Maybe the Unifi UNAS Pro is the common link? Seems like a couple of us have the same NAS.
Anyone having this issue not on the Unifi UNAS Pro?
What version are you on? 9.1.6 seams to have improved things, but I don't know if it is 100% resolved.Has this just "magically" gone away for people or just me?
I have had a huge media conversion job running for the past 8 hours and it has not hung once (normally it cant do more than 30 mins).
I am on 9.1.6What version are you on? 9.1.6 seams to have improved things, but I don't know if it is 100% resolved.
I have a Dell t440 with TrueNAS and Plex in VMs. Yesterday one of my friends got "Please check that this file exist and the necessary drive is mounted." which has been indicative of this issue, but overall I'm not seeing the kinds of errors or other issues I was before.
FWIW, I switched my one share that experiences heavy writes to CIFS shortly after posting this and it has been stable for ~1 week so far.+1 to this. Running PVE 9.1.5 with kernel 6.14. I have an OpenMediaVault VM which has a disk passed through and exposes a few NFS shares that are mounted in various LXCs and the PVE host itself. During heavy writes CPUs in the OMV MV become hung and occasionally deadlocks the entire host.
I've seen similar behavior running PVE 9.1.5 with kernel 6.17 and also with OMV 8.1 kernel 6.18 and OMV 7.4 with kernels 6.1 and 6.12.
nas:/backup/proxmox-VMs on /mnt/pve/backup-NAS type nfs4 (rw,relatime,vers=4.2,rsize=1048576,wsize=1048576,namlen=255,hard,fatal_neterrors=none,proto=tcp,timeo=600,retrans=2,sec=sys,clientaddr=192.168.1.40,local_lock=none,addr=192.168.1.250)
[ 1107.487583] INFO: task CPU 2/KVM:1947 blocked for more than 245 seconds.
[ 1107.487591] Tainted: P O 6.17.13-1-pve #1
...
[ 1107.487626] __schedule+0x468/0x1310
[ 1107.487660] schedule+0x27/0xf0
[ 1107.487673] folio_wait_bit_common+0x124/0x2f0
[ 1107.487708] folio_wait_writeback+0x2b/0xa0
[ 1107.487721] nfs_wb_folio+0x94/0x1e0 [nfs]
[ 1107.487893] nfs_release_folio+0x72/0x110 [nfs]
...
[ 5285.096199] INFO: task iou-wrk-1941:24182 is blocked on an rw-semaphore
[ 5285.096224] task:iou-wrk-1941 state:D ...
[ 5285.096296] rwsem_down_read_slowpath+0x24e/0x540
[ 5285.096308] down_read+0x48/0xc0
[ 5285.096328] do_exit+0x1f2/0xa20
[ 5285.096339] io_wq_worker+0x2d6/0x390
...
[ 5343.886173] systemd[1]: systemd-journald.service: start operation timed out. Terminating.
cp /mnt/pve/nvme-vms/images/202/vm-202-disk-0.qcow2 /mnt/pve/backup-NAS/test.qcow2
proxmox-boot-tool kernel pin 6.8.12-18-pve
proxmox-boot-tool refresh
reboot
Workaround:
Pinning kernel to 6.8.12-18-pve resolved the issue completely. Backups now run at full speed (~120 MB/s) without any freezes nor direct NFS copies:
Code:proxmox-boot-tool kernel pin 6.8.12-18-pve proxmox-boot-tool refresh reboot
Hi @kouellette,+1 to this. Running PVE 9.1.5 with kernel 6.14. I have an OpenMediaVault VM which has a disk passed through and exposes a few NFS shares that are mounted in various LXCs and the PVE host itself. During heavy writes CPUs in the OMV MV become hung and occasionally deadlocks the entire host.
I've seen similar behavior running PVE 9.1.5 with kernel 6.17 and also with OMV 8.1 kernel 6.18 and OMV 7.4 with kernels 6.1 and 6.12.
We use essential cookies to make this site work, and optional cookies to enhance your experience.