Hello.
Time by time i have a problem that my SSDs have an errors an storage is unaccessible.
A few lines from log:
And then with all othed SSDs.
But NVMe (on which proxmox is installed works fine).
After host restart all works fine.
Same problem i have on few nodes (on other proxmox installed on SSD too). But lost only disks with lmv-thin storages.
Time by time i have a problem that my SSDs have an errors an storage is unaccessible.
A few lines from log:
Apr 6 12:34:00 ps13 systemd[1]: Started Proxmox VE replication runner.
Apr 6 12:35:00 ps13 systemd[1]: Starting Proxmox VE replication runner...
Apr 6 12:35:00 ps13 systemd[1]: pvesr.service: Succeeded.
Apr 6 12:35:00 ps13 systemd[1]: Started Proxmox VE replication runner.
Apr 6 12:36:00 ps13 systemd[1]: Starting Proxmox VE replication runner...
Apr 6 12:36:00 ps13 systemd[1]: pvesr.service: Succeeded.
Apr 6 12:36:00 ps13 systemd[1]: Started Proxmox VE replication runner.
Apr 6 12:36:46 ps13 kernel: [362858.768260] ahci 0000:02:00.1: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0010 address=0xfca8e000 flags=0x0000]
Apr 6 12:36:47 ps13 kernel: [362859.050225] ata2.00: exception Emask 0x10 SAct 0x700063ff SErr 0x0 action 0x6 frozen
Apr 6 12:36:47 ps13 kernel: [362859.050227] ata2.00: irq_stat 0x08000000, interface fatal error
Apr 6 12:36:47 ps13 kernel: [362859.050230] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050232] ata2.00: cmd 61/08:00:58:c3:36/00:00:0a:00:00/40 tag 0 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050232] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050235] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050236] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050238] ata2.00: cmd 61/08:08:f8:2b:3f/00:00:0a:00:00/40 tag 1 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050238] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050241] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050242] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050245] ata2.00: cmd 61/08:10:60:2c:3f/00:00:0a:00:00/40 tag 2 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050245] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050247] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050248] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050251] ata2.00: cmd 61/10:18:c8:74:de/00:00:01:00:00/40 tag 3 ncq dma 8192 out
Apr 6 12:36:47 ps13 kernel: [362859.050251] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050253] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050254] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050257] ata2.00: cmd 61/08:20:08:c6:01/00:00:02:00:00/40 tag 4 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050257] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050259] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050260] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050263] ata2.00: cmd 61/08:28:18:c6:01/00:00:02:00:00/40 tag 5 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050263] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050265] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050267] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050269] ata2.00: cmd 61/30:30:c0:02:02/00:00:02:00:00/40 tag 6 ncq dma 24576 out
Apr 6 12:36:47 ps13 kernel: [362859.050269] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050271] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050273] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050275] ata2.00: cmd 61/08:38:00:39:4e/00:00:03:00:00/40 tag 7 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050275] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050277] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050279] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050281] ata2.00: cmd 61/08:40:a8:3d:4e/00:00:03:00:00/40 tag 8 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050281] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050283] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050285] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050287] ata2.00: cmd 61/10:48:58:44:4e/00:00:03:00:00/40 tag 9 ncq dma 8192 out
Apr 6 12:36:47 ps13 kernel: [362859.050287] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050289] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050291] ata2.00: failed command: READ FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050293] ata2.00: cmd 60/08:68:40:ae:43/00:00:73:00:00/40 tag 13 ncq dma 4096 in
Apr 6 12:36:47 ps13 kernel: [362859.050293] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050296] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050297] ata2.00: failed command: READ FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050299] ata2.00: cmd 60/00:70:00:00:00/01:00:00:00:00/40 tag 14 ncq dma 131072 in
Apr 6 12:36:47 ps13 kernel: [362859.050299] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050302] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050303] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050305] ata2.00: cmd 61/08:e0:90:c2:4e/00:00:08:00:00/40 tag 28 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050305] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050308] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050309] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050311] ata2.00: cmd 61/08:e8:c0:8e:4f/00:00:08:00:00/40 tag 29 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050311] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050314] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050315] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050317] ata2.00: cmd 61/10:f0:00:08:50/00:00:08:00:00/40 tag 30 ncq dma 8192 out
Apr 6 12:36:47 ps13 kernel: [362859.050317] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050320] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050323] ata2: hard resetting link
Apr 6 12:36:57 ps13 kernel: [362869.051079] ata2: softreset failed (1st FIS failed)
Apr 6 12:36:57 ps13 kernel: [362869.051085] ata2: hard resetting link
Apr 6 12:37:00 ps13 systemd[1]: Starting Proxmox VE replication runner...
Apr 6 12:37:00 ps13 systemd[1]: pvesr.service: Succeeded.
Apr 6 12:37:00 ps13 systemd[1]: Started Proxmox VE replication runner.
Apr 6 12:37:07 ps13 kernel: [362879.050916] ata2: softreset failed (1st FIS failed)
Apr 6 12:37:07 ps13 kernel: [362879.050922] ata2: hard resetting link
Apr 6 12:37:17 ps13 kernel: [362889.002142] ata6.00: exception Emask 0x0 SAct 0x400000 SErr 0x0 action 0x6 frozen
Apr 6 12:37:17 ps13 kernel: [362889.002148] ata6.00: failed command: READ FPDMA QUEUED
Apr 6 12:37:17 ps13 kernel: [362889.002152] ata6.00: cmd 60/00:b0:00:00:00/01:00:00:00:00/40 tag 22 ncq dma 131072 in
Apr 6 12:37:17 ps13 kernel: [362889.002152] res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
Apr 6 12:37:17 ps13 kernel: [362889.002155] ata6.00: status: { DRDY }
Apr 6 12:37:17 ps13 kernel: [362889.002158] ata6: hard resetting link
Apr 6 12:37:17 ps13 kernel: [362889.002169] ata5.00: exception Emask 0x0 SAct 0x400000 SErr 0x0 action 0x6 frozen
Apr 6 12:37:17 ps13 kernel: [362889.002173] ata5.00: failed command: READ FPDMA QUEUED
Apr 6 12:37:17 ps13 kernel: [362889.002176] ata5.00: cmd 60/00:b0:00:00:00/01:00:00:00:00/40 tag 22 ncq dma 131072 in
Apr 6 12:37:17 ps13 kernel: [362889.002176] res 40/00:00:00:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
Apr 6 12:37:17 ps13 kernel: [362889.002180] ata5.00: status: { DRDY }
Apr 6 12:37:17 ps13 kernel: [362889.002182] ata5: hard resetting link
Apr 6 12:37:17 ps13 kernel: [362889.002191] ata1.00: exception Emask 0x0 SAct 0x3000 SErr 0x0 action 0x6 frozen
Apr 6 12:37:17 ps13 kernel: [362889.002193] ata1.00: failed command: READ FPDMA QUEUED
Apr 6 12:37:17 ps13 kernel: [362889.002196] ata1.00: cmd 60/00:60:00:00:00/01:00:00:00:00/40 tag 12 ncq dma 131072 in
Apr 6 12:37:17 ps13 kernel: [362889.002196] res 40/00:01:06:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
Apr 6 12:37:17 ps13 kernel: [362889.002198] ata1.00: status: { DRDY }
Apr 6 12:37:17 ps13 kernel: [362889.002199] ata1.00: failed command: READ FPDMA QUEUED
Apr 6 12:37:17 ps13 kernel: [362889.002202] ata1.00: cmd 60/00:68:00:08:00/01:00:00:00:00/40 tag 13 ncq dma 131072 in
Apr 6 12:37:17 ps13 kernel: [362889.002202] res 40/00:ff:ff:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Apr 6 12:37:17 ps13 kernel: [362889.002204] ata1.00: status: { DRDY }
Apr 6 12:37:17 ps13 kernel: [362889.002206] ata1: hard resetting link
Apr 6 12:37:27 ps13 kernel: [362899.002609] ata1: softreset failed (1st FIS failed)
Apr 6 12:37:27 ps13 kernel: [362899.002615] ata1: hard resetting link
Apr 6 12:37:27 ps13 kernel: [362899.002653] ata5: softreset failed (1st FIS failed)
Apr 6 12:37:27 ps13 kernel: [362899.002656] ata5: hard resetting link
Apr 6 12:37:27 ps13 kernel: [362899.002706] ata6: softreset failed (1st FIS failed)
Apr 6 12:37:27 ps13 kernel: [362899.002709] ata6: hard resetting link
Apr 6 12:37:37 ps13 kernel: [362909.002577] ata1: softreset failed (1st FIS failed)
Apr 6 12:37:37 ps13 kernel: [362909.002583] ata1: hard resetting link
Apr 6 12:37:37 ps13 kernel: [362909.002596] ata6: softreset failed (1st FIS failed)
Apr 6 12:37:37 ps13 kernel: [362909.002599] ata6: hard resetting link
Apr 6 12:37:37 ps13 kernel: [362909.002637] ata5: softreset failed (1st FIS failed)
Apr 6 12:37:37 ps13 kernel: [362909.002640] ata5: hard resetting link
Apr 6 12:37:42 ps13 kernel: [362914.050962] ata2: softreset failed (1st FIS failed)
Apr 6 12:37:42 ps13 kernel: [362914.050969] ata2: limiting SATA link speed to 3.0 Gbps
Apr 6 12:37:42 ps13 kernel: [362914.050970] ata2: hard resetting link
Apr 6 12:37:47 ps13 kernel: [362919.050857] ata2: softreset failed (1st FIS failed)
Apr 6 12:37:47 ps13 kernel: [362919.050864] ata2: reset failed, giving up
Apr 6 12:37:47 ps13 kernel: [362919.050866] ata2.00: disabled
Apr 6 12:37:47 ps13 kernel: [362919.050886] ata2: EH complete
Apr 6 12:37:47 ps13 kernel: [362919.050911] sd 1:0:0:0: [sdb] tag#15 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Apr 6 12:37:47 ps13 kernel: [362919.050912] sd 1:0:0:0: [sdb] tag#15 CDB: Read(10) 28 00 02 a1 f6 30 00 00 08 00
Apr 6 12:37:47 ps13 kernel: [362919.050914] blk_update_request: I/O error, dev sdb, sector 44168752 op 0x0READ) flags 0x0 phys_seg 1 prio class 0
Apr 6 12:37:47 ps13 kernel: [362919.050930] sd 1:0:0:0: [sdb] tag#18 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Apr 6 12:37:47 ps13 kernel: [362919.050931] sd 1:0:0:0: [sdb] tag#18 CDB: Read(10) 28 00 01 de 47 98 00 00 08 00
Apr 6 12:37:47 ps13 kernel: [362919.050931] blk_update_request: I/O error, dev sdb, sector 31344536 op 0x0READ) flags 0x0 phys_seg 1 prio class 0
Apr 6 12:37:47 ps13 kernel: [362919.050940] sd 1:0:0:0: [sdb] tag#19 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Apr 6 12:37:47 ps13 kernel: [362919.050941] sd 1:0:0:0: [sdb] tag#19 CDB: Write(10) 2a 00 08 50 08 00 00 00 10 00
Apr 6 12:37:47 ps13 kernel: [362919.050942] blk_update_request: I/O error, dev sdb, sector 139462656 op 0x1WRITE) flags 0x800 phys_seg 2 prio class 0
Apr 6 12:37:47 ps13 kernel: [362919.050949] Buffer I/O error on dev dm-24, logical block 358784, lost async page write
Apr 6 12:37:47 ps13 kernel: [362919.050957] Buffer I/O error on dev dm-24, logical block 358785, lost async page write
Apr 6 12:37:47 ps13 kernel: [362919.050961] sd 1:0:0:0: [sdb] tag#20 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Apr 6 12:37:47 ps13 kernel: [362919.050962] sd 1:0:0:0: [sdb] tag#20 CDB: Write(10) 2a 00 08 4f 8e c0 00 00 08 00
Apr 6 12:37:47 ps13 kernel: [362919.050963] blk_update_request: I/O error, dev sdb, sector 139431616 op 0x1WRITE) flags 0x800 phys_seg 1 prio class 0
Apr 6 12:35:00 ps13 systemd[1]: Starting Proxmox VE replication runner...
Apr 6 12:35:00 ps13 systemd[1]: pvesr.service: Succeeded.
Apr 6 12:35:00 ps13 systemd[1]: Started Proxmox VE replication runner.
Apr 6 12:36:00 ps13 systemd[1]: Starting Proxmox VE replication runner...
Apr 6 12:36:00 ps13 systemd[1]: pvesr.service: Succeeded.
Apr 6 12:36:00 ps13 systemd[1]: Started Proxmox VE replication runner.
Apr 6 12:36:46 ps13 kernel: [362858.768260] ahci 0000:02:00.1: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0010 address=0xfca8e000 flags=0x0000]
Apr 6 12:36:47 ps13 kernel: [362859.050225] ata2.00: exception Emask 0x10 SAct 0x700063ff SErr 0x0 action 0x6 frozen
Apr 6 12:36:47 ps13 kernel: [362859.050227] ata2.00: irq_stat 0x08000000, interface fatal error
Apr 6 12:36:47 ps13 kernel: [362859.050230] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050232] ata2.00: cmd 61/08:00:58:c3:36/00:00:0a:00:00/40 tag 0 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050232] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050235] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050236] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050238] ata2.00: cmd 61/08:08:f8:2b:3f/00:00:0a:00:00/40 tag 1 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050238] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050241] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050242] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050245] ata2.00: cmd 61/08:10:60:2c:3f/00:00:0a:00:00/40 tag 2 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050245] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050247] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050248] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050251] ata2.00: cmd 61/10:18:c8:74:de/00:00:01:00:00/40 tag 3 ncq dma 8192 out
Apr 6 12:36:47 ps13 kernel: [362859.050251] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050253] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050254] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050257] ata2.00: cmd 61/08:20:08:c6:01/00:00:02:00:00/40 tag 4 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050257] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050259] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050260] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050263] ata2.00: cmd 61/08:28:18:c6:01/00:00:02:00:00/40 tag 5 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050263] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050265] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050267] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050269] ata2.00: cmd 61/30:30:c0:02:02/00:00:02:00:00/40 tag 6 ncq dma 24576 out
Apr 6 12:36:47 ps13 kernel: [362859.050269] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050271] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050273] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050275] ata2.00: cmd 61/08:38:00:39:4e/00:00:03:00:00/40 tag 7 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050275] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050277] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050279] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050281] ata2.00: cmd 61/08:40:a8:3d:4e/00:00:03:00:00/40 tag 8 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050281] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050283] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050285] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050287] ata2.00: cmd 61/10:48:58:44:4e/00:00:03:00:00/40 tag 9 ncq dma 8192 out
Apr 6 12:36:47 ps13 kernel: [362859.050287] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050289] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050291] ata2.00: failed command: READ FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050293] ata2.00: cmd 60/08:68:40:ae:43/00:00:73:00:00/40 tag 13 ncq dma 4096 in
Apr 6 12:36:47 ps13 kernel: [362859.050293] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050296] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050297] ata2.00: failed command: READ FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050299] ata2.00: cmd 60/00:70:00:00:00/01:00:00:00:00/40 tag 14 ncq dma 131072 in
Apr 6 12:36:47 ps13 kernel: [362859.050299] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050302] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050303] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050305] ata2.00: cmd 61/08:e0:90:c2:4e/00:00:08:00:00/40 tag 28 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050305] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050308] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050309] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050311] ata2.00: cmd 61/08:e8:c0:8e:4f/00:00:08:00:00/40 tag 29 ncq dma 4096 out
Apr 6 12:36:47 ps13 kernel: [362859.050311] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050314] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050315] ata2.00: failed command: WRITE FPDMA QUEUED
Apr 6 12:36:47 ps13 kernel: [362859.050317] ata2.00: cmd 61/10:f0:00:08:50/00:00:08:00:00/40 tag 30 ncq dma 8192 out
Apr 6 12:36:47 ps13 kernel: [362859.050317] res 40/00:48:58:44:4e/00:00:03:00:00/40 Emask 0x10 (ATA bus error)
Apr 6 12:36:47 ps13 kernel: [362859.050320] ata2.00: status: { DRDY }
Apr 6 12:36:47 ps13 kernel: [362859.050323] ata2: hard resetting link
Apr 6 12:36:57 ps13 kernel: [362869.051079] ata2: softreset failed (1st FIS failed)
Apr 6 12:36:57 ps13 kernel: [362869.051085] ata2: hard resetting link
Apr 6 12:37:00 ps13 systemd[1]: Starting Proxmox VE replication runner...
Apr 6 12:37:00 ps13 systemd[1]: pvesr.service: Succeeded.
Apr 6 12:37:00 ps13 systemd[1]: Started Proxmox VE replication runner.
Apr 6 12:37:07 ps13 kernel: [362879.050916] ata2: softreset failed (1st FIS failed)
Apr 6 12:37:07 ps13 kernel: [362879.050922] ata2: hard resetting link
Apr 6 12:37:17 ps13 kernel: [362889.002142] ata6.00: exception Emask 0x0 SAct 0x400000 SErr 0x0 action 0x6 frozen
Apr 6 12:37:17 ps13 kernel: [362889.002148] ata6.00: failed command: READ FPDMA QUEUED
Apr 6 12:37:17 ps13 kernel: [362889.002152] ata6.00: cmd 60/00:b0:00:00:00/01:00:00:00:00/40 tag 22 ncq dma 131072 in
Apr 6 12:37:17 ps13 kernel: [362889.002152] res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
Apr 6 12:37:17 ps13 kernel: [362889.002155] ata6.00: status: { DRDY }
Apr 6 12:37:17 ps13 kernel: [362889.002158] ata6: hard resetting link
Apr 6 12:37:17 ps13 kernel: [362889.002169] ata5.00: exception Emask 0x0 SAct 0x400000 SErr 0x0 action 0x6 frozen
Apr 6 12:37:17 ps13 kernel: [362889.002173] ata5.00: failed command: READ FPDMA QUEUED
Apr 6 12:37:17 ps13 kernel: [362889.002176] ata5.00: cmd 60/00:b0:00:00:00/01:00:00:00:00/40 tag 22 ncq dma 131072 in
Apr 6 12:37:17 ps13 kernel: [362889.002176] res 40/00:00:00:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
Apr 6 12:37:17 ps13 kernel: [362889.002180] ata5.00: status: { DRDY }
Apr 6 12:37:17 ps13 kernel: [362889.002182] ata5: hard resetting link
Apr 6 12:37:17 ps13 kernel: [362889.002191] ata1.00: exception Emask 0x0 SAct 0x3000 SErr 0x0 action 0x6 frozen
Apr 6 12:37:17 ps13 kernel: [362889.002193] ata1.00: failed command: READ FPDMA QUEUED
Apr 6 12:37:17 ps13 kernel: [362889.002196] ata1.00: cmd 60/00:60:00:00:00/01:00:00:00:00/40 tag 12 ncq dma 131072 in
Apr 6 12:37:17 ps13 kernel: [362889.002196] res 40/00:01:06:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
Apr 6 12:37:17 ps13 kernel: [362889.002198] ata1.00: status: { DRDY }
Apr 6 12:37:17 ps13 kernel: [362889.002199] ata1.00: failed command: READ FPDMA QUEUED
Apr 6 12:37:17 ps13 kernel: [362889.002202] ata1.00: cmd 60/00:68:00:08:00/01:00:00:00:00/40 tag 13 ncq dma 131072 in
Apr 6 12:37:17 ps13 kernel: [362889.002202] res 40/00:ff:ff:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Apr 6 12:37:17 ps13 kernel: [362889.002204] ata1.00: status: { DRDY }
Apr 6 12:37:17 ps13 kernel: [362889.002206] ata1: hard resetting link
Apr 6 12:37:27 ps13 kernel: [362899.002609] ata1: softreset failed (1st FIS failed)
Apr 6 12:37:27 ps13 kernel: [362899.002615] ata1: hard resetting link
Apr 6 12:37:27 ps13 kernel: [362899.002653] ata5: softreset failed (1st FIS failed)
Apr 6 12:37:27 ps13 kernel: [362899.002656] ata5: hard resetting link
Apr 6 12:37:27 ps13 kernel: [362899.002706] ata6: softreset failed (1st FIS failed)
Apr 6 12:37:27 ps13 kernel: [362899.002709] ata6: hard resetting link
Apr 6 12:37:37 ps13 kernel: [362909.002577] ata1: softreset failed (1st FIS failed)
Apr 6 12:37:37 ps13 kernel: [362909.002583] ata1: hard resetting link
Apr 6 12:37:37 ps13 kernel: [362909.002596] ata6: softreset failed (1st FIS failed)
Apr 6 12:37:37 ps13 kernel: [362909.002599] ata6: hard resetting link
Apr 6 12:37:37 ps13 kernel: [362909.002637] ata5: softreset failed (1st FIS failed)
Apr 6 12:37:37 ps13 kernel: [362909.002640] ata5: hard resetting link
Apr 6 12:37:42 ps13 kernel: [362914.050962] ata2: softreset failed (1st FIS failed)
Apr 6 12:37:42 ps13 kernel: [362914.050969] ata2: limiting SATA link speed to 3.0 Gbps
Apr 6 12:37:42 ps13 kernel: [362914.050970] ata2: hard resetting link
Apr 6 12:37:47 ps13 kernel: [362919.050857] ata2: softreset failed (1st FIS failed)
Apr 6 12:37:47 ps13 kernel: [362919.050864] ata2: reset failed, giving up
Apr 6 12:37:47 ps13 kernel: [362919.050866] ata2.00: disabled
Apr 6 12:37:47 ps13 kernel: [362919.050886] ata2: EH complete
Apr 6 12:37:47 ps13 kernel: [362919.050911] sd 1:0:0:0: [sdb] tag#15 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Apr 6 12:37:47 ps13 kernel: [362919.050912] sd 1:0:0:0: [sdb] tag#15 CDB: Read(10) 28 00 02 a1 f6 30 00 00 08 00
Apr 6 12:37:47 ps13 kernel: [362919.050914] blk_update_request: I/O error, dev sdb, sector 44168752 op 0x0READ) flags 0x0 phys_seg 1 prio class 0
Apr 6 12:37:47 ps13 kernel: [362919.050930] sd 1:0:0:0: [sdb] tag#18 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Apr 6 12:37:47 ps13 kernel: [362919.050931] sd 1:0:0:0: [sdb] tag#18 CDB: Read(10) 28 00 01 de 47 98 00 00 08 00
Apr 6 12:37:47 ps13 kernel: [362919.050931] blk_update_request: I/O error, dev sdb, sector 31344536 op 0x0READ) flags 0x0 phys_seg 1 prio class 0
Apr 6 12:37:47 ps13 kernel: [362919.050940] sd 1:0:0:0: [sdb] tag#19 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Apr 6 12:37:47 ps13 kernel: [362919.050941] sd 1:0:0:0: [sdb] tag#19 CDB: Write(10) 2a 00 08 50 08 00 00 00 10 00
Apr 6 12:37:47 ps13 kernel: [362919.050942] blk_update_request: I/O error, dev sdb, sector 139462656 op 0x1WRITE) flags 0x800 phys_seg 2 prio class 0
Apr 6 12:37:47 ps13 kernel: [362919.050949] Buffer I/O error on dev dm-24, logical block 358784, lost async page write
Apr 6 12:37:47 ps13 kernel: [362919.050957] Buffer I/O error on dev dm-24, logical block 358785, lost async page write
Apr 6 12:37:47 ps13 kernel: [362919.050961] sd 1:0:0:0: [sdb] tag#20 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Apr 6 12:37:47 ps13 kernel: [362919.050962] sd 1:0:0:0: [sdb] tag#20 CDB: Write(10) 2a 00 08 4f 8e c0 00 00 08 00
Apr 6 12:37:47 ps13 kernel: [362919.050963] blk_update_request: I/O error, dev sdb, sector 139431616 op 0x1WRITE) flags 0x800 phys_seg 1 prio class 0
But NVMe (on which proxmox is installed works fine).
After host restart all works fine.
Same problem i have on few nodes (on other proxmox installed on SSD too). But lost only disks with lmv-thin storages.
pveversion --verbose
proxmox-ve: 6.3-1 (running kernel: 5.4.78-2-pve)
pve-manager: 6.3-6 (running version: 6.3-6/2184247e)
pve-kernel-5.4: 6.3-7
pve-kernel-helper: 6.3-7
pve-kernel-5.4.103-1-pve: 5.4.103-1
pve-kernel-5.4.78-2-pve: 5.4.78-2
pve-kernel-5.4.73-1-pve: 5.4.73-1
ceph-fuse: 12.2.11+dfsg1-2.1+b1
corosync: 3.1.0-pve1
criu: 3.11-3
glusterfs-client: 5.5-3
ifupdown: 0.8.35+pve1
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.20-pve1
libproxmox-acme-perl: 1.0.7
libproxmox-backup-qemu0: 1.0.3-1
libpve-access-control: 6.1-3
libpve-apiclient-perl: 3.1-3
libpve-common-perl: 6.3-5
libpve-guest-common-perl: 3.1-5
libpve-http-server-perl: 3.1-1
libpve-storage-perl: 6.3-7
libqb0: 1.0.5-1
libspice-server1: 0.14.2-4~pve6+1
lvm2: 2.03.02-pve4
lxc-pve: 4.0.6-2
lxcfs: 4.0.6-pve1
novnc-pve: 1.1.0-1
proxmox-backup-client: 1.0.10-1
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.4-6
pve-cluster: 6.2-1
pve-container: 3.3-4
pve-docs: 6.3-1
pve-edk2-firmware: 2.20200531-1
pve-firewall: 4.1-3
pve-firmware: 3.2-2
pve-ha-manager: 3.1-1
pve-i18n: 2.2-2
pve-qemu-kvm: 5.2.0-3
pve-xtermjs: 4.7.0-3
qemu-server: 6.3-8
smartmontools: 7.2-pve2
spiceterm: 3.1-1
vncterm: 1.6-2
zfsutils-linux: 2.0.3-pve2
proxmox-ve: 6.3-1 (running kernel: 5.4.78-2-pve)
pve-manager: 6.3-6 (running version: 6.3-6/2184247e)
pve-kernel-5.4: 6.3-7
pve-kernel-helper: 6.3-7
pve-kernel-5.4.103-1-pve: 5.4.103-1
pve-kernel-5.4.78-2-pve: 5.4.78-2
pve-kernel-5.4.73-1-pve: 5.4.73-1
ceph-fuse: 12.2.11+dfsg1-2.1+b1
corosync: 3.1.0-pve1
criu: 3.11-3
glusterfs-client: 5.5-3
ifupdown: 0.8.35+pve1
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.20-pve1
libproxmox-acme-perl: 1.0.7
libproxmox-backup-qemu0: 1.0.3-1
libpve-access-control: 6.1-3
libpve-apiclient-perl: 3.1-3
libpve-common-perl: 6.3-5
libpve-guest-common-perl: 3.1-5
libpve-http-server-perl: 3.1-1
libpve-storage-perl: 6.3-7
libqb0: 1.0.5-1
libspice-server1: 0.14.2-4~pve6+1
lvm2: 2.03.02-pve4
lxc-pve: 4.0.6-2
lxcfs: 4.0.6-pve1
novnc-pve: 1.1.0-1
proxmox-backup-client: 1.0.10-1
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.4-6
pve-cluster: 6.2-1
pve-container: 3.3-4
pve-docs: 6.3-1
pve-edk2-firmware: 2.20200531-1
pve-firewall: 4.1-3
pve-firmware: 3.2-2
pve-ha-manager: 3.1-1
pve-i18n: 2.2-2
pve-qemu-kvm: 5.2.0-3
pve-xtermjs: 4.7.0-3
qemu-server: 6.3-8
smartmontools: 7.2-pve2
spiceterm: 3.1-1
vncterm: 1.6-2
zfsutils-linux: 2.0.3-pve2