L
linuxdatacenter
Guest
Hi,
I've recently upgraded the kernel on proxmox VE hypervisor to the latest ubuntu version. It was done to solve some problems with multipath. With kernel-2.6.32-2-pve multipath would hang for some time when we were doing reboots of our storage arrays.
We had this sort of messages and stack traces from qlogic driver as well as emulex (fc hba):
Oct 1 10:10:19 s10826 kernel: sd 1:0:1:0: [sdd] CDB: Write(10): 2a 00 0a db 3f d0 00 00 08 00
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): DEVICE RESET ISSUED.
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): DEVICE RESET FAILED: Task management failed.
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): TARGET RESET ISSUED.
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): TARGET RESET FAILED: Task management failed.
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): TARGET RESET ISSUED.
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): TARGET RESET FAILED: Task management failed.
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): BUS RESET ISSUED.
Oct 1 10:10:19 s10826 kernel: sd 1:0:1:0: [sdd] Synchronizing SCSI cache
Oct 1 10:10:25 s10826 kernel: qla2xxx 0000:06:00.1: qla2xxx_eh_bus_reset: reset succeded
Oct 1 10:10:35 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): ADAPTER RESET ISSUED.
Oct 1 10:13:49 s10826 kernel: qla2xxx_1_dpc D 0000000000000000 0 958 2 0x00000000
Oct 1 10:13:49 s10826 kernel: ffff880214701ad0 0000000000000046 ffff880228415c30 0000000000000000
[here goes stack trace]
Upgrading our kernel to latest ubuntu version (2.6.35) cleared the problem. Machines are running fine and we have no more issues with multipath failover. However extensive testing is required before it hits production.
Is anyone running such kinda setup (proxmox ve + ubuntu kernel)?
I've recently upgraded the kernel on proxmox VE hypervisor to the latest ubuntu version. It was done to solve some problems with multipath. With kernel-2.6.32-2-pve multipath would hang for some time when we were doing reboots of our storage arrays.
We had this sort of messages and stack traces from qlogic driver as well as emulex (fc hba):
Oct 1 10:10:19 s10826 kernel: sd 1:0:1:0: [sdd] CDB: Write(10): 2a 00 0a db 3f d0 00 00 08 00
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): DEVICE RESET ISSUED.
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): DEVICE RESET FAILED: Task management failed.
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): TARGET RESET ISSUED.
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): TARGET RESET FAILED: Task management failed.
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): TARGET RESET ISSUED.
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): TARGET RESET FAILED: Task management failed.
Oct 1 10:10:19 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): BUS RESET ISSUED.
Oct 1 10:10:19 s10826 kernel: sd 1:0:1:0: [sdd] Synchronizing SCSI cache
Oct 1 10:10:25 s10826 kernel: qla2xxx 0000:06:00.1: qla2xxx_eh_bus_reset: reset succeded
Oct 1 10:10:35 s10826 kernel: qla2xxx 0000:06:00.1: scsi(1:1:0): ADAPTER RESET ISSUED.
Oct 1 10:13:49 s10826 kernel: qla2xxx_1_dpc D 0000000000000000 0 958 2 0x00000000
Oct 1 10:13:49 s10826 kernel: ffff880214701ad0 0000000000000046 ffff880228415c30 0000000000000000
[here goes stack trace]
Upgrading our kernel to latest ubuntu version (2.6.35) cleared the problem. Machines are running fine and we have no more issues with multipath failover. However extensive testing is required before it hits production.
Is anyone running such kinda setup (proxmox ve + ubuntu kernel)?