H330 sas controller failed after upgrading to Kernel 6.17.2-2-pve

tbomb456

Member
Jul 11, 2023
8
0
6
After upgrading the kernel to 6.17.2-2-pve my Dell H330 Sas Controller has errored out and is now unable to pass 4 HDDs to the kernel for a ZFS pool

[ 7.502037] megaraid_sas 0000:41:00.0: Ignore DCMD timeout: megasas_get_ctrl_info 5389
[ 7.502188] megaraid_sas 0000:41:00.0: Could not get controller info. Fail from megasas_init_adapter_fusion 1907
[ 7.504125] megaraid_sas 0000:41:00.0: Failed from megasas_init_fw 6553

so far I have added intel_iommu=on, iommu=pt and megaraid_sas.smp_affinity_enable=0 to grub and tried to roll back to the 6.14.11-4-pve Kernel but when i rolled back i encountered many other errors that were ceph Mon related.

I cant find anything else about this issue that is related to this kernel

modinfo megaraid_sas
filename: /lib/modules/6.17.2-2-pve/kernel/drivers/scsi/megaraid/megaraid_sas.ko
description: Broadcom MegaRAID SAS Driver
author: megaraidlinux.pdl@broadcom.com
version: 07.734.00.00-rc1

also found this in dmesg

[ 2.213768] DMAR: DRHD: handling fault status reg 2
[ 2.213985] DMAR: [DMA Read NO_PASID] Request device [41:00.0] fault addr 0xb5840000 [fault reason 0x06] PTE Read access is not set
 
Last edited:
Read this post:

"I think it's related to the fact that this card can work in both RAID and NON-RAID (Passthrough) modes. When IOMMU is enabled it doesn't see your raid configurations and produces this error."

Try replacing the boot kernel parameter ‘intel_iommu=on’ with ‘intel_iommu=pt’ “to set up pass through (PT) mode in context mapping entry”. DMAR is then disabled in the GNU/Linux kernel
 
Last edited:
It's these one-off situations with the megaraid_sas driver and just installing a Dell HBA330 using the much simpler mpt3sas driver will avoid all this drama. LOL.

In addition, the Dell HBA330 is very cheap to get.
 
Read this post:

"I think it's related to the fact that this card can work in both RAID and NON-RAID (Passthrough) modes. When IOMMU is enabled it doesn't see your raid configurations and produces this error."

Try replacing the boot kernel parameter ‘intel_iommu=on’ with ‘intel_iommu=pt’ “to set up pass through (PT) mode in context mapping entry”. DMAR is then disabled in the GNU/Linux kernel
After editing grub to have intel_iommu=pt, i am still having this issue

[ 7.458995] megaraid_sas 0000:41:00.0: Ignore DCMD timeout: megasas_get_ctrl_info 5389
[ 7.459264] megaraid_sas 0000:41:00.0: Could not get controller info. Fail from megasas_init_adapter_fusion 1907
[ 7.461268] megaraid_sas 0000:41:00.0: Failed from megasas_init_fw 6553