Does your PVE is capable of writing mails? If so, please see here.There is an easy way in proxmox to receive email notifications about the zfs raid pool errors?
If you check the smart values (also via smartctl), a quick self test is also a good idea....and in this situation what I can do to recheck the faulted disk?
root@osx1:~# zpool status -v
pool: zfs-sas
state: ONLINE
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-9P
scan: resilvered 2.09G in 00:02:07 with 0 errors on Mon Mar 13 18:31:55 2023
config:
NAME STATE READ WRITE CKSUM
zfs-sas ONLINE 0 0 0
mirror-0 ONLINE 0 0 0
scsi-35000c50062e2f95b ONLINE 0 0 0
scsi-35000c50062e2f3f7 ONLINE 0 0 2
mirror-1 ONLINE 0 0 0
scsi-35000c50062e2f18f ONLINE 0 0 0
scsi-35000c50062e3329f ONLINE 0 0 0
root@osx1:~# smartctl -a /dev/sdc
smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.15.85-1-pve] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Vendor: SEAGATE
Product: ST4000NM0023
Revision: 0004
Compliance: SPC-4
User Capacity: 4,000,787,030,016 bytes [4.00 TB]
Logical block size: 512 bytes
LU is fully provisioned
Rotation Rate: 7200 rpm
Form Factor: 3.5 inches
Logical Unit id: 0x5000c50062e2f3f7
Serial number: Z1Z6RY370000W5198S5U
Device type: disk
Transport protocol: SAS (SPL-3)
Local Time is: Mon Mar 13 20:06:06 2023 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Enabled
=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK
Current Drive Temperature: 38 C
Drive Trip Temperature: 60 C
Accumulated power on time, hours:minutes 34417:58
Manufactured in week 51 of year 2014
Specified cycle count over device lifetime: 10000
Accumulated start-stop cycles: 168
Specified load-unload count over device lifetime: 300000
Accumulated load-unload cycles: 1733
Elements in grown defect list: 0
Vendor (Seagate Cache) information
Blocks sent to initiator = 234162016
Blocks received from initiator = 15886246
Blocks read from cache and sent to initiator = 4482441
Number of read and write commands whose size <= segment size = 72480
Number of read and write commands whose size > segment size = 136
Vendor (Seagate/Hitachi) factory information
number of hours powered up = 34417.97
number of minutes until next internal SMART test = 22
Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9 bytes] errors
read: 317916583 0 0 317916583 0 119.891 0
write: 0 0 0 0 0 12.579 0
Non-medium error count: 0
We use essential cookies to make this site work, and optional cookies to enhance your experience.