sas disk faulted, how to receive notifications?

openaspace

Active Member
Sep 16, 2019
486
13
38
Italy
Hi.
There is an easy way in proxmox to receive email notifications about the zfs raid pool errors?

...and in this situation what I can do to recheck the faulted disk?

Thank you.

osx1-Proxmox-Virtual-Environment(4).png
 
There is an easy way in proxmox to receive email notifications about the zfs raid pool errors?
Does your PVE is capable of writing mails? If so, please see here.


...and in this situation what I can do to recheck the faulted disk?
If you check the smart values (also via smartctl), a quick self test is also a good idea.
If everything looks OK to you, you can readd the disk, do a resilver and hope the error goes away.
 
Thank you!
I think was solved.. without the resilver.. I have detached two other disks on the same power cable .. probably is a slowdown for electric overload.

After the reboot I don't see any error
 
Code:
 root@osx1:~# zpool status -v
  pool: zfs-sas
 state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
        attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
        using 'zpool clear' or replace the device with 'zpool replace'.
   see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-9P
  scan: resilvered 2.09G in 00:02:07 with 0 errors on Mon Mar 13 18:31:55 2023
config:

        NAME                        STATE     READ WRITE CKSUM
        zfs-sas                     ONLINE       0     0     0
          mirror-0                  ONLINE       0     0     0
            scsi-35000c50062e2f95b  ONLINE       0     0     0
            scsi-35000c50062e2f3f7  ONLINE       0     0     2
          mirror-1                  ONLINE       0     0     0
            scsi-35000c50062e2f18f  ONLINE       0     0     0
            scsi-35000c50062e3329f  ONLINE       0     0     0
 
Code:
root@osx1:~# smartctl -a /dev/sdc
smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.15.85-1-pve] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Vendor:               SEAGATE
Product:              ST4000NM0023
Revision:             0004
Compliance:           SPC-4
User Capacity:        4,000,787,030,016 bytes [4.00 TB]
Logical block size:   512 bytes
LU is fully provisioned
Rotation Rate:        7200 rpm
Form Factor:          3.5 inches
Logical Unit id:      0x5000c50062e2f3f7
Serial number:        Z1Z6RY370000W5198S5U
Device type:          disk
Transport protocol:   SAS (SPL-3)
Local Time is:        Mon Mar 13 20:06:06 2023 CET
SMART support is:     Available - device has SMART capability.
SMART support is:     Enabled
Temperature Warning:  Enabled

=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK

Current Drive Temperature:     38 C
Drive Trip Temperature:        60 C

Accumulated power on time, hours:minutes 34417:58
Manufactured in week 51 of year 2014
Specified cycle count over device lifetime:  10000
Accumulated start-stop cycles:  168
Specified load-unload count over device lifetime:  300000
Accumulated load-unload cycles:  1733
Elements in grown defect list: 0

Vendor (Seagate Cache) information
  Blocks sent to initiator = 234162016
  Blocks received from initiator = 15886246
  Blocks read from cache and sent to initiator = 4482441
  Number of read and write commands whose size <= segment size = 72480
  Number of read and write commands whose size > segment size = 136

Vendor (Seagate/Hitachi) factory information
  number of hours powered up = 34417.97
  number of minutes until next internal SMART test = 22

Error counter log:
           Errors Corrected by           Total   Correction     Gigabytes    Total
               ECC          rereads/    errors   algorithm      processed    uncorrected
           fast | delayed   rewrites  corrected  invocations   [10^9 bytes]  errors
read:   317916583        0         0  317916583          0        119.891           0
write:         0        0         0         0          0         12.579           0

Non-medium error count:        0
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!