SATA link down issues

TheMat556 · Nov 25, 2025

Hello everyone,

I am currently struggling a lot with my ZFS Pool (mainly SATA Issues). Every now and then i get a "SATA link down", "hard resetting link", "link is slow to respond, please be patient (ready=0)". This then leads to ZFS Pool error, which than degregate my whole pool. As I thought a HDD is the cause of this whole issue, I tried to replace this HDD. But currently during resilvering, the SATA link issues still happen. I dug into the logs but just couldnt find any cause of the issue. Eventually you guys have an idea to solve this issue. First to my setup:

Motherboard: AsRock B450 Pro4 - i already checked for Aggressive Link Power Management (didnt find this option in the BIOS) and other options that could influence the behavior. The BIOS version is 10.41. Every HDD / SSD
CPU: Ryzen 5 5600G
HDD: 4x SEAGATE 4TB IronWolf (these are different models)
SSD: 2x SANDISK 1TB
OS: Proxmox VE 9.1.1
GPU: Intel ARC A380 (mainly for transcoding
Power Supply: BeQuiet! Power 11 Platinum (1000W Platinum Plus)

I will provide an whole system overview here: https://pastebin.com/FuUcD67w

I run the whole ZFS Pool for 2 months now, here and then i got some issues. I already got the issue about a month ago, then just started from 0 and setup the pool again - which then worked like a charm. About two weeks ago - again i got a lot of SATA Link Errors, which i resolved just with a scrub and then the system worked nice until now! Currently the 4 drives are connected via 3 different SATA power lines (which i read could be an issue, but didnt resolve anything). I also have the feeling that the change of the HDD is not quite the solution to this problem - as I think the system have another issue. Also i tried to change the SATA cables, without any luck (tried 3 different pairs, I think CableMatters was one of them). For the drives in detail:

lsblk: https://pastebin.com/shJn2ryK
more detailed lsblk: https://pastebin.com/JszCL33G
dmesg -T: https://pastebin.com/DG159WLU (interestingly the drives operate for quite some time, and suddenly start loosing SATA connection, then operate again)

Code:

[Mon Nov 24 21:20:28 2025] audit: type=1400 audit(1764015628.258:513): apparmor="DENIED" operation="sendmsg" class="file" namespace="root//lxc-123_<-var-lib-lxc>" profile="rsyslogd" name="/run/systemd/journal/dev-log" pid=14739 comm="systemd-journal" requested_mask="r" denied_mask="r" fsuid=100000 ouid=100000
[Mon Nov 24 21:21:49 2025] ata9.00: exception Emask 0x10 SAct 0x20400 SErr 0x40002 action 0x6 frozen
[Mon Nov 24 21:21:49 2025] ata9.00: irq_stat 0x08000000, interface fatal error
[Mon Nov 24 21:21:49 2025] ata9: SError: { RecovComm CommWake }
[Mon Nov 24 21:21:49 2025] ata9.00: failed command: WRITE FPDMA QUEUED
[Mon Nov 24 21:21:49 2025] ata9.00: cmd 61/50:50:c0:47:82/00:00:2b:00:00/40 tag 10 ncq dma 40960 out
                                    res 40/00:01:00:4f:c2/00:00:00:00:00/00 Emask 0x10 (ATA bus error)
[Mon Nov 24 21:21:49 2025] ata9.00: status: { DRDY }
[Mon Nov 24 21:21:49 2025] ata9.00: failed command: WRITE FPDMA QUEUED
[Mon Nov 24 21:21:49 2025] ata9.00: cmd 61/50:88:18:48:82/00:00:2b:00:00/40 tag 17 ncq dma 40960 out
                                    res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x10 (ATA bus error)
[Mon Nov 24 21:21:49 2025] ata9.00: status: { DRDY }
[Mon Nov 24 21:21:49 2025] ata9: hard resetting link
[Mon Nov 24 21:21:49 2025] ata6.00: limiting speed to UDMA/100:PIO4
[Mon Nov 24 21:21:49 2025] ata6.00: exception Emask 0x52 SAct 0x1000 SErr 0x30c02 action 0xe frozen
[Mon Nov 24 21:21:49 2025] ata6.00: irq_stat 0x00400000, PHY RDY changed
[Mon Nov 24 21:21:49 2025] ata6: SError: { RecovComm Proto HostInt PHYRdyChg PHYInt }
[Mon Nov 24 21:21:49 2025] ata6.00: failed command: READ FPDMA QUEUED
[Mon Nov 24 21:21:49 2025] ata6.00: cmd 60/e8:60:a0:4e:82/07:00:2b:00:00/40 tag 12 ncq dma 1036288 in
                                    res 40/00:01:06:4f:c2/00:00:00:00:00/00 Emask 0x52 (ATA bus error)
[Mon Nov 24 21:21:49 2025] ata6.00: status: { DRDY }
[Mon Nov 24 21:21:49 2025] ata6: hard resetting link
[Mon Nov 24 21:21:54 2025] ata9: link is slow to respond, please be patient (ready=0)
[Mon Nov 24 21:21:55 2025] ata6: link is slow to respond, please be patient (ready=0)
[Mon Nov 24 21:21:56 2025] ata9: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
[Mon Nov 24 21:21:56 2025] ata9.00: configured for UDMA/33
[Mon Nov 24 21:21:56 2025] ata9: EH complete
[Mon Nov 24 21:21:59 2025] ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
[Mon Nov 24 21:21:59 2025] ata6.00: configured for UDMA/100
[Mon Nov 24 21:21:59 2025] ata6: EH complete
[Mon Nov 24 21:25:01 2025] audit: type=1400 audit(1764015901.480:514): apparmor="DENIED" operation="sendmsg" class="file" namespace="root//lxc-123_<-var-lib-lxc>" profile="rsyslogd" name="/run/systemd/journal/dev-log" pid=14739 comm="systemd-journal" requested_mask="r" denied_mask="r" fsuid=100000 ouid=100000
[Mon Nov 24 21:25:01 2025] audit: type=1400 audit(1764015901.480:515): apparmor="DENIED" operation="sendmsg" class="file" namespace="root//lxc-123_<-var-lib-lxc>" profile="rsyslogd" name="/run/systemd/journal/dev-log" pid=14739 comm="systemd-journal" requested_mask="r" denied_mask="r" fsuid=100000 ouid=100000

smartctl -a /dev/sdc: https://pastebin.com/fFK5Nwam
smartctl -a /dev/sdd: https://pastebin.com/E907QRx7
smartctl -a /dev/sde: https://pastebin.com/DvVsDxnc
smartctl -a /dev/sdf: https://pastebin.com/9vVxc2F0

I am not that of a professional in smartctl so my knockledge is no the best here - but from my view each drive should be okay.
As i tried to replace one drive, as said, the pool is currently resilvering - but i have the feeling this will not solve the issue (for a long time). Also i have a second pool (with SSDs, which dont make any problem), see:

zpool status: https://pastebin.com/ErKPvhne
zpool status -v: https://pastebin.com/WtdX81UB
hdparm -Tt dev/xxx: https://pastebin.com/1BKquwK1
zpool iostat -v 1 10: https://pastebin.com/dJNn53Z5

I already posted this issue on reddit, but as i am not quite sure whats the case of the issue I am also trying my luck here.
I know this is a lot of information / logs - but i would preciate any kind of hint that could help me to reduce this errors! If i forgot any kind of infromation, please let me know. Thanks in advance!!!

Sviat · Nov 26, 2025

Hi,

It can be not drivers related, check the Asus forum.
You can try to add kernel parameters to limit link speed to 3.0 Gbps instead of auto-negotiation

Code:

libata.force=3.0Gbps

beposec · Mar 6, 2026

@TheMat556 Were you able to resolve your issue?
I suspect this might be related to a problem with the 6.17 kernel:

https://bugzilla.kernel.org/show_bug.cgi?id=220693
https://bbs.archlinux.org/viewtopic.php?id=310008

Perhaps the Proxmox team could take a look at this issue.

matada · Mar 7, 2026

So it's not just me with this issue. Well that's good to know I guess?

I thought the single hdd in my server was dying because it'd randomly disappear and crash any containers that used the NFS mount for it when there was heavy read/write to that disk so badly I'd have to reboot the entire proxmox machine to get it to restart again. The last crash was so severe it nearly crashed almost all of my containers (most of which didn't even use the hdd) and almost proxmox itself (webUI down, API down, could SSH into proxmox to reboot at least).

Went back to 6.14 and it's been fine ever since. Wonder why despite having similar hardware in both the server (9900x and a Gigabyte B550 motherboard) and my desktop (9950x and a ASRock X870 steel legend) my desktop had zero hdd issues on 6.17.

floh8 · Mar 7, 2026

Test it with other data cable and other hba port. Replace the HDD.

TheMat556 · Mar 11, 2026

beposec said:
@TheMat556 Were you able to resolve your issue?
I suspect this might be related to a problem with the 6.17 kernel:

https://bugzilla.kernel.org/show_bug.cgi?id=220693
https://bbs.archlinux.org/viewtopic.php?id=310008

Perhaps the Proxmox team could take a look at this issue.

matada said:
So it's not just me with this issue. Well that's good to know I guess?

I thought the single hdd in my server was dying because it'd randomly disappear and crash any containers that used the NFS mount for it when there was heavy read/write to that disk so badly I'd have to reboot the entire proxmox machine to get it to restart again. The last crash was so severe it nearly crashed almost all of my containers (most of which didn't even use the hdd) and almost proxmox itself (webUI down, API down, could SSH into proxmox to reboot at least).

Went back to 6.14 and it's been fine ever since. Wonder why despite having similar hardware in both the server (9900x and a Gigabyte B550 motherboard) and my desktop (9950x and a ASRock X870 steel legend) my desktop had zero hdd issues on 6.17.

Sorry for the late reply, I completely forgot to post my solution here. I may have been a little blind. Although it was pointed out to me several times that it could be the PSU, I thought, “It's a beQuiet Platinium, that can't be it.” It turns out that the PSU was the source of the error. It was strange that everything worked fine without the HDDs. I assume that the SATA rails of the PSU were causing some kind of problem. I had tested everything (HBA card, other cables, etc.) until I tried a different PSU!

Note: Since your problems may concern the kernel, I would like to reiterate that I did not encounter any issues with it; it was purely a hardware problem.

floh8 · Mar 11, 2026

thx for info
mark this thread as. Solved?

SATA link down issues

TheMat556

New Member

Sviat

Active Member

beposec

New Member

matada

New Member

floh8

Renowned Member

TheMat556

New Member

floh8

Renowned Member

We value your privacy