ZFS data slowly getting corrupt

mayrjohannes

New Member
Jan 24, 2024
9
0
1
Hi all,
I run pve on a Lexxar SSD NM620 with zfs. I know this is not a enterprise SSD, but I only use it in my homelab on my HPE Proliant ML30 Server. The RAM is EEC protected.
A second zfs pool holds my important data (2 redundand HDD's).
A while ago I noticed that some old data on the Lexxar zfs pool is corrupt (the data in the main pool and in all snapshots). It was only 3 or 4 files. Now this weekend a few more files are added to this list. It is all very old data, being there from the time of installing pve.
Is this the first sign of the SSD slowly passing away? The SSD is only a year old.

I also noticed, that if i run to many LXC Containers (more than 10), the iodelay is getting very high. A few weeks ago, I moved some of the LXC's to the HDD Pool and the iodelay is now back to a few percent. Maybe this is also related?

Thx
 
I would check your SMART data. Go to -> proxmox VE GUI - > Host -> Disks and check if it still says passed by the S.M.A.R.T column.

It might also be bit rot but for just 1 year it seems to be really early to consider bit rot as a reason.

Also check your ZFS pool.
proxmox VE GUI - > Host -> ZFS and check the pool health.
 
If you are doing zfs and you want self-healing scrubs, you need at least a mirror... And you want everything on a UPS with NUT installed so you're getting clean power + shutdowns
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!