For the past 4 months I had PVE 4.4 running exclusively on a 250GB m.2 SSD via PCIe adapter (EXT4 filesystem). The drive had less than 4 total TBs written to it, I had 12 Ubuntu containers, and <50% of overall total space utilization as well as <50% of consumed space within each container. Then, I started getting logical I/O errors, PVE locked to read-only, and I had to back-up what I could before throwing in the towel.
I ran diagnostics on the SSD, and gparted showed 0 physical bad blocks, and the firmware is the latest available firmware from Samsung (960 EVO). I searched through Google trying to find out what could have gone wrong, and I found some other folks that mentioned containers and VMs outgrowing their allotted storage, despite PVE reporting otherwise.
Regardless, I'm hoping that fresh installing with ZFS on v5.1 will eliminate this problem, but even then I was reading about some people with arc_prune issues on Proxmox. Is that one of the issues being fixed with the addition of ZFS v0.7.2 on PVE v5.1 ?
This is my approach for re-installing with v5.1; let me know if you think I'm missing something to avoid data corruption in the future:
I ran diagnostics on the SSD, and gparted showed 0 physical bad blocks, and the firmware is the latest available firmware from Samsung (960 EVO). I searched through Google trying to find out what could have gone wrong, and I found some other folks that mentioned containers and VMs outgrowing their allotted storage, despite PVE reporting otherwise.
Regardless, I'm hoping that fresh installing with ZFS on v5.1 will eliminate this problem, but even then I was reading about some people with arc_prune issues on Proxmox. Is that one of the issues being fixed with the addition of ZFS v0.7.2 on PVE v5.1 ?
This is my approach for re-installing with v5.1; let me know if you think I'm missing something to avoid data corruption in the future:
- I bought a 2nd identical m.2 Samsung 250GB SSD, and am going to install PVE v5.1 using ZFS RAID-1 instead of EXT4. I already have 64GB of ECC RAM, which should be more than plenty for now and the future.
- Additionally, I mounted a spare 2TB HDD for writes, logging, and plex transcoding.
- Finally, I'm going to schedule automatic container/VM backups to a separate FreeNAS machine on my LAN via an NFS share.