I am having issues where my SAS HDD disconnects every now and then causing errors. I already had similar issues with sdc and sdd, those were resolved by a host update and the addition of a proper cooling solution for the HBA (LSI-9207-8i). That was a few months ago and i didn't have any issues since then up until a few days ago. What could this be caused by?
Result of a long SMART test:
dmesg -w output (i cut out repeating stuff to keep it short):
Result of a long SMART test:
Code:
root@trinity:~# sudo smartctl -a /dev/sdd
smartctl 7.3 2022-02-28 r5338 [x86_64-linux-6.8.12-9-pve] (local build)
Copyright (C) 2002-22, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Vendor: HGST
Product: H7280A520SUN8.0T
Revision: PD51
Compliance: SPC-4
User Capacity: 7,865,536,647,168 bytes [7.86 TB]
Logical block size: 512 bytes
Physical block size: 4096 bytes
Formatted with type 1 protection
8 bytes of protection information per logical block
LU is fully provisioned
Rotation Rate: 7200 rpm
Form Factor: 3.5 inches
Logical Unit id: 0x5000cca262181f30
Serial number: 001721PE87TV R5GE87TV
Device type: disk
Transport protocol: SAS (SPL-4)
Local Time is: Thu Apr 17 14:10:34 2025 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Enabled
=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK
Current Drive Temperature: 31 C
Drive Trip Temperature: 85 C
Accumulated power on time, hours:minutes 3747:17
Manufactured in week 21 of year 2017
Specified cycle count over device lifetime: 50000
Accumulated start-stop cycles: 569
Specified load-unload count over device lifetime: 600000
Accumulated load-unload cycles: 3177
Elements in grown defect list: 0
Vendor (Seagate Cache) information
Blocks sent to initiator = 44102477045301248
Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9 bytes] errors
read: 0 24 0 24 19137688 416286.190 0
write: 0 0 0 0 2609689 344981.632 0
verify: 0 0 0 0 65628 0.000 0
Non-medium error count: 0
SMART Self-test log
Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
Description number (hours)
# 1 Background long Self test in progress ... - NOW - [- - -]
Long (extended) Self-test duration: 69720 seconds [19.4 hours]
dmesg -w output (i cut out repeating stuff to keep it short):
Code:
[ 148.600865] sd 0:0:3:0: device_block, handle(0x000c)
[ 149.600404] sd 0:0:3:0: device_unblock and setting to running, handle(0x000c)
[ 164.642244] sd 0:0:3:0: Power-on or device reset occurred
[ 466.617990] sd 0:0:3:0: device_block, handle(0x000c)
[ 467.617726] sd 0:0:3:0: device_unblock and setting to running, handle(0x000c)
[ 471.854613] sd 0:0:3:0: Power-on or device reset occurred
[ 474.337589] sd 0:0:3:0: [sdd] tag#2746 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=2s
[ 474.337593] sd 0:0:3:0: [sdd] tag#2746 Sense Key : Illegal Request [current] [descriptor]
[ 474.337595] sd 0:0:3:0: [sdd] tag#2746 Add. Sense: Invalid field in cdb
[ 474.337597] sd 0:0:3:0: [sdd] tag#2746 CDB: Write(16) 8a 20 00 00 00 00 08 80 32 28 00 00 00 08 00 00
[ 474.337598] critical target error, dev sdd, sector 142619176 op 0x1:(WRITE) flags 0x0 phys_seg 1 prio class 0
[ 474.337602] zio pool=zfs-pool vdev=/dev/disk/by-id/scsi-35000cca262181f30-part1 error=121 type=2 offset=73019969536 size=4096 flags=1572992
[ 474.337806] sd 0:0:3:0: [sdd] tag#2747 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=2s
[ 474.337809] sd 0:0:3:0: [sdd] tag#2747 Sense Key : Illegal Request [current] [descriptor]
[ 474.337811] sd 0:0:3:0: [sdd] tag#2747 Add. Sense: Invalid field in cdb
[ 474.337812] sd 0:0:3:0: [sdd] tag#2747 CDB: Write(16) 8a 20 00 00 00 00 28 80 2d 58 00 00 00 08 00 00
[ 474.337813] critical target error, dev sdd, sector 679488856 op 0x1:(WRITE) flags 0x0 phys_seg 1 prio class 0
[ 474.337816] zio pool=zfs-pool vdev=/dev/disk/by-id/scsi-35000cca262181f30-part1 error=121 type=2 offset=347897245696 size=4096 flags=1572992
[ 474.338164] sd 0:0:3:0: [sdd] tag#2748 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[ 474.338168] sd 0:0:3:0: [sdd] tag#2748 Sense Key : Illegal Request [current] [descriptor]
[ 474.338170] sd 0:0:3:0: [sdd] tag#2748 Add. Sense: Invalid field in cdb
[ 474.338195] sd 0:0:3:0: [sdd] tag#2748 CDB: Write(16) 8a 20 00 00 00 00 18 80 2c d0 00 00 00 08 00 00
[ 474.338197] critical target error, dev sdd, sector 411053264 op 0x1:(WRITE) flags 0x0 phys_seg 1 prio class 0
[ 474.338200] zio pool=zfs-pool vdev=/dev/disk/by-id/scsi-35000cca262181f30-part1 error=121 type=2 offset=210458222592 size=4096 flags=1572992
[ 474.338354] sd 0:0:3:0: [sdd] tag#2749 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[ 474.338356] sd 0:0:3:0: [sdd] tag#2749 Sense Key : Illegal Request [current] [descriptor]
[ 474.338358] sd 0:0:3:0: [sdd] tag#2749 Add. Sense: Invalid field in cdb
[ 474.338360] sd 0:0:3:0: [sdd] tag#2749 CDB: Read(16) 88 20 00 00 00 00 00 00 0a 10 00 00 00 10 00 00
[ 474.338361] critical target error, dev sdd, sector 2576 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[ 474.338364] zio pool=zfs-pool vdev=/dev/disk/by-id/scsi-35000cca262181f30-part1 error=121 type=1 offset=270336 size=8192 flags=721089
[ 474.338594] sd 0:0:3:0: [sdd] tag#2750 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[ 474.338596] sd 0:0:3:0: [sdd] tag#2750 Sense Key : Illegal Request [current] [descriptor]
[ 474.338598] sd 0:0:3:0: [sdd] tag#2750 Add. Sense: Invalid field in cdb
[ 474.338599] sd 0:0:3:0: [sdd] tag#2750 CDB: Read(16) 88 20 00 00 00 03 93 aa fc 10 00 00 00 10 00 00
[ 474.338600] critical target error, dev sdd, sector 15362358288 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[ 474.338603] zio pool=zfs-pool vdev=/dev/disk/by-id/scsi-35000cca262181f30-part1 error=121 type=1 offset=7865526394880 size=8192 flags=721089
[ 474.338824] sd 0:0:3:0: [sdd] tag#2751 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[ 474.338826] sd 0:0:3:0: [sdd] tag#2751 Sense Key : Illegal Request [current] [descriptor]
[ 474.338828] sd 0:0:3:0: [sdd] tag#2751 Add. Sense: Invalid field in cdb
[ 474.338830] sd 0:0:3:0: [sdd] tag#2751 CDB: Read(16) 88 20 00 00 00 03 93 aa fe 10 00 00 00 10 00 00
[ 474.338831] critical target error, dev sdd, sector 15362358800 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[ 474.338833] zio pool=zfs-pool vdev=/dev/disk/by-id/scsi-35000cca262181f30-part1 error=121 type=1 offset=7865526657024 size=8192 flags=721089
[ 474.339056] sd 0:0:3:0: [sdd] tag#2240 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[ 474.339059] sd 0:0:3:0: [sdd] tag#2240 Sense Key : Illegal Request [current] [descriptor]
[ 474.339060] sd 0:0:3:0: [sdd] tag#2240 Add. Sense: Invalid field in cdb
[ 474.339062] sd 0:0:3:0: [sdd] tag#2240 CDB: Write(16) 8a 20 00 00 00 00 28 80 2d 60 00 00 00 08 00 00
[ 474.339063] critical target error, dev sdd, sector 679488864 op 0x1:(WRITE) flags 0x0 phys_seg 1 prio class 0
[ 474.339066] zio pool=zfs-pool vdev=/dev/disk/by-id/scsi-35000cca262181f30-part1 error=121 type=2 offset=347897249792 size=4096 flags=1572992
[ 474.339275] sd 0:0:3:0: [sdd] tag#2241 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[ 474.339277] sd 0:0:3:0: [sdd] tag#2241 Sense Key : Illegal Request [current] [descriptor]
[ 474.339279] sd 0:0:3:0: [sdd] tag#2241 Add. Sense: Invalid field in cdb
[ 474.339281] sd 0:0:3:0: [sdd] tag#2241 CDB: Write(16) 8a 20 00 00 00 01 b5 00 2e 68 00 00 00 08 00 00
[ 474.339282] critical target error, dev sdd, sector 7331655272 op 0x1:(WRITE) flags 0x0 phys_seg 1 prio class 0
[ 474.339284] zio pool=zfs-pool vdev=/dev/disk/by-id/scsi-35000cca262181f30-part1 error=121 type=2 offset=3753806450688 size=4096 flags=1572992
[ 780.134869] sd 0:0:3:0: device_block, handle(0x000c)
[ 781.134475] sd 0:0:3:0: device_unblock and setting to running, handle(0x000c)
[ 3617.644571] sd 0:0:3:0: Power-on or device reset occurred
[ 3619.707360] sd 0:0:3:0: [sdd] tag#2470 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=2s
[ 3619.707364] sd 0:0:3:0: [sdd] tag#2470 Sense Key : Illegal Request [current] [descriptor]
[ 3619.707366] sd 0:0:3:0: [sdd] tag#2470 Add. Sense: Invalid field in cdb
[ 3619.707368] sd 0:0:3:0: [sdd] tag#2470 CDB: Read(16) 88 20 00 00 00 03 93 ab 41 80 00 00 00 08 00 00
[ 3619.707369] critical target error, dev sdd, sector 15362376064 op 0x0:(READ) flags 0x80700 phys_seg 1 prio class 0
[ 3619.707822] sd 0:0:3:0: [sdd] tag#2471 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[ 3619.707832] sd 0:0:3:0: [sdd] tag#2471 Sense Key : Illegal Request [current] [descriptor]
[ 3619.707848] sd 0:0:3:0: [sdd] tag#2471 Add. Sense: Invalid field in cdb
[ 3619.707850] sd 0:0:3:0: [sdd] tag#2471 CDB: Read(16) 88 20 00 00 00 03 93 ab 41 80 00 00 00 08 00 00
[ 3619.707852] critical target error, dev sdd, sector 15362376064 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[ 3619.707855] Buffer I/O error on dev sdd, logical block 1920297008, async page read
[ 3925.804224] sd 0:0:3:0: device_block, handle(0x000c)
[ 3926.803956] sd 0:0:3:0: device_unblock and setting to running, handle(0x000c)
[39619.976374] sd 0:0:3:0: Power-on or device reset occurred
[39627.306406] sd 0:0:3:0: [sdd] tag#2283 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=7s
[39627.306421] sd 0:0:3:0: [sdd] tag#2283 Sense Key : Illegal Request [current] [descriptor]
[39627.306429] sd 0:0:3:0: [sdd] tag#2283 Add. Sense: Invalid field in cdb
[39627.306438] sd 0:0:3:0: [sdd] tag#2283 CDB: Read(16) 88 20 00 00 00 03 93 ab 41 80 00 00 00 08 00 00
[39627.306455] critical target error, dev sdd, sector 15362376064 op 0x0:(READ) flags 0x80700 phys_seg 1 prio class 0
[39627.306767] sd 0:0:3:0: [sdd] tag#2284 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[39627.306776] sd 0:0:3:0: [sdd] tag#2284 Sense Key : Illegal Request [current] [descriptor]
[39627.306783] sd 0:0:3:0: [sdd] tag#2284 Add. Sense: Invalid field in cdb
[39627.306800] sd 0:0:3:0: [sdd] tag#2284 CDB: Read(16) 88 20 00 00 00 03 93 ab 41 80 00 00 00 08 00 00
[39627.306801] critical target error, dev sdd, sector 15362376064 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[39627.306803] Buffer I/O error on dev sdd, logical block 1920297008, async page read
[39932.993310] sd 0:0:3:0: device_block, handle(0x000c)
[39934.242832] sd 0:0:3:0: device_unblock and setting to running, handle(0x000c)
[40247.760270] sd 0:0:3:0: device_block, handle(0x000c)