I'm hitting this on my production node as well, but I actually attacked it from a ZFS perspective and found the following:
https://github.com/openzfs/zfs/issues/7631
As you can see I did some commenting and I've managed to help a bit, but it's still not ideal. For reference my hardware is as follows:
CPU(s): 64 x AMD EPYC 7502P 32-Core Processor (1 Socket)
Kernel Version: Linux 5.3.13-1-pve #1 SMP PVE 5.3.13-1 (Thu, 05 Dec 2019 07:18:14 +0100)
PVE Manager Version: pve-manager/6.1-5/9bf06119
Node SN Model Namespace Usage Format FW Rev
---------------- -------------------- ---------------------------------------- --------- -------------------------- ---------------- --------
/dev/nvme0n1 S437NY0M701561 SAMSUNG MZQLB960HAJR-00007 1 953.48 GB / 960.20 GB 512 B + 0 B EDA5202Q
/dev/nvme1n1 S437NY0M701840 SAMSUNG MZQLB960HAJR-00007 1 953.48 GB / 960.20 GB 512 B + 0 B EDA5202Q
/dev/nvme2n1 S437NY0M701866 SAMSUNG MZQLB960HAJR-00007 1 953.48 GB / 960.20 GB 512 B + 0 B EDA5202Q
/dev/nvme3n1 S437NY0M701498 SAMSUNG MZQLB960HAJR-00007 1 953.48 GB / 960.20 GB 512 B + 0 B EDA5202Q
/dev/nvme4n1 S35ENA0K621779 SAMSUNG MZVLW256HEHP-000L7 1 254.16 GB / 256.06 GB 512 B + 0 B 5L7QCXB7
/dev/nvme5n1 S35ENA0K621968 SAMSUNG MZVLW256HEHP-000L7 1 254.16 GB / 256.06 GB 512 B + 0 B 5L7QCXB7
/dev/nvme6n1 CVPF6414004G1P2NGN INTEL SSDPE2MX012T7 1 1.20 TB / 1.20 TB 4 KiB + 0 B MDV1NX27
/dev/nvme7n1 CVPF641400MY1P2NGN INTEL SSDPE2MX012T7 1 1.20 TB / 1.20 TB 4 KiB + 0 B MDV1NX27
/dev/nvme8n1 BTPF7516030T1P2NGN INTEL SSDPE2MX012T7 1 1.20 TB / 1.20 TB 4 KiB + 0 B MDV10271
/dev/nvme9n1 BTPF751603801P2NGN INTEL SSDPE2MX012T7 1 1.20 TB / 1.20 TB 4 KiB + 0 B MDV10271