I cannot get my head around this, as I`m experiencing some very weird server IO delay and server crash. It seems I have this exact same issue [1][2].
The setup has ZFS + ZIL + L2ARC, however the RAM was getting used up to 100+ GB and the L2ARC was (and is set) to 150GB/read/write on the SSD.
Current HDDs:
ZFS is currently running with 2 pools:
I know and understand that normal spinning HDD's cannot cope too much, this is why I setup ZFS with cache and ARC hoping this will improve, and to be honest it did because speed was amazing.
But I don't understand why randomly the sudden high IO and server hung... can someone advise me what to do ?
PS: HW cannot be modified.
[1] https://forum.proxmox.com/threads/proxmox-ve-new-server-high-io-delay.39162/
[2] https://forum.proxmox.com/threads/zfs-high-io-again.55331/
The setup has ZFS + ZIL + L2ARC, however the RAM was getting used up to 100+ GB and the L2ARC was (and is set) to 150GB/read/write on the SSD.
Current HDDs:
- 2× 960GB SSD NVMe (SAMSUNG MZQLB960HAJR-00007) and 2× 6TB HDD SATA Soft RAID (HGST_HUS726T6TALE6L1 )
ZFS is currently running with 2 pools:
- 1 for SSD (rpool)
- 1 for HDD
Bash:
NAME SIZE ALLOC FREE CKPOINT EXPANDSZ FRAG CAP DEDUP HEALTH ALTROOT
rpool 888G 115G 773G - - 5% 12% 1.00x ONLINE -
mirror 888G 115G 773G - - 5% 12.9% - ONLINE
nvme0n1p2 - - - - - - - - ONLINE
nvme1n1p2 - - - - - - - - ONLINE
vmpool 5.44T 72.5G 5.37T - - 0% 1% 1.00x ONLINE -
mirror 5.44T 72.5G 5.37T - - 0% 1.30% - ONLINE
sdb - - - - - - - - ONLINE
sda - - - - - - - - ONLINE
I know and understand that normal spinning HDD's cannot cope too much, this is why I setup ZFS with cache and ARC hoping this will improve, and to be honest it did because speed was amazing.
But I don't understand why randomly the sudden high IO and server hung... can someone advise me what to do ?
PS: HW cannot be modified.
[1] https://forum.proxmox.com/threads/proxmox-ve-new-server-high-io-delay.39162/
[2] https://forum.proxmox.com/threads/zfs-high-io-again.55331/