Evening all... so I am at a bit of a quandary... One of my pools is reporting an error, and when I get the details from it, it tells me that a particular VMs disk is in the firing line...
Now that VM is a Zabbix server, so its not really the end of the world if it dies. I only loose historic data, and the config setup. I could backup the mariadb off to another location, and that means I don't even loose anything. This is my problem... The disk is obviously getting thrown up as some sort of checksum issue where it sits on the physical disk, I have a raid ) setup, so that checksum has replicated across disks, so not able to fix that problem, but the VM is actually working fine, without it reporting errors. I tried moving the disk off to another pool, but that failed half way through, so that kind of shows that there is some issue there.
What would you do? Ignore it? Rebuild the VM? Some how override the error so it doesn;t reappear after clearing and then scrubs?? What to do?
Code:
pool: rpool
state: ONLINE
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-8A
scan: scrub repaired 0B in 03:04:19 with 2 errors on Sun Jan 14 03:28:24 2024
config:
NAME STATE READ WRITE CKSUM
rpool ONLINE 0 0 0
ata-SAMSUNG_HD161HJ_S14LJA0Q536960-part3 ONLINE 0 0 4
ata-SAMSUNG_HD161GJ_S1VCJ9FZ318818-part3 ONLINE 0 0 0
errors: Permanent errors have been detected in the following files:
rpool/data/vm-101-disk-0:<0x1>
What would you do? Ignore it? Rebuild the VM? Some how override the error so it doesn;t reappear after clearing and then scrubs?? What to do?