I have 5d:00.0 Ethernet controller: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller (rev 01) on HP DL380G10.
On fresh installation proxmox 8.1 there is an error like this. I have tried to upgrade kernel to 6.5.11-7-pve from apt but the error still persist:
NIC is working fine but there is strange error on dmesg.
I have downgraded kernel to 6.2.16-20-pve and the error disappeared, so the bug is in kernel 6.5.11.
Any help?
On fresh installation proxmox 8.1 there is an error like this. I have tried to upgrade kernel to 6.5.11-7-pve from apt but the error still persist:
Code:
[ 5.123860] ================================================================================
[ 5.123862] UBSAN: shift-out-of-bounds in ./include/linux/log2.h:57:13
[ 5.123865] shift exponent 64 is too large for 64-bit type 'long unsigned int'
[ 5.123867] CPU: 60 PID: 935 Comm: (udev-worker) Tainted: P O 6.5.11-7-pve #1
[ 5.123871] Hardware name: HPE ProLiant DL380 Gen10/ProLiant DL380 Gen10, BIOS U30 07/20/2023
[ 5.123872] Call Trace:
[ 5.123875] <TASK>
[ 5.123878] dump_stack_lvl+0x48/0x70
[ 5.123892] dump_stack+0x10/0x20
[ 5.123895] __ubsan_handle_shift_out_of_bounds+0x1ac/0x360
[ 5.123904] bnxt_qplib_alloc_init_hwq.cold+0xa9/0x104 [bnxt_re]
[ 5.123925] bnxt_qplib_create_qp+0x1b4/0x7a0 [bnxt_re]
[ 5.123939] ? bnxt_qplib_rcfw_send_message+0x3e/0x70 [bnxt_re]
[ 5.123957] bnxt_re_create_qp+0x995/0xd80 [bnxt_re]
[ 5.123974] create_qp+0x17a/0x290 [ib_core]
[ 5.124015] ? create_qp+0x17a/0x290 [ib_core]
[ 5.124054] ib_create_qp_kernel+0x3b/0xe0 [ib_core]
[ 5.124093] create_mad_qp+0x8e/0x100 [ib_core]
[ 5.124137] ? __pfx_qp_event_handler+0x10/0x10 [ib_core]
[ 5.124181] ib_mad_init_device+0x294/0x840 [ib_core]
[ 5.124228] add_client_context+0x127/0x1c0 [ib_core]
[ 5.124270] enable_device_and_get+0xe6/0x1e0 [ib_core]
[ 5.124311] ib_register_device+0x506/0x610 [ib_core]
[ 5.124355] ? __kmalloc+0x4d/0xd0
[ 5.124361] ? ib_device_set_netdev+0x160/0x1b0 [ib_core]
[ 5.124403] bnxt_re_probe+0xd7a/0x1070 [bnxt_re]
[ 5.124418] ? __pfx_bnxt_re_probe+0x10/0x10 [bnxt_re]
[ 5.124430] auxiliary_bus_probe+0x3e/0xa0
[ 5.124438] really_probe+0x1c9/0x430
[ 5.124444] __driver_probe_device+0x8c/0x190
[ 5.124448] driver_probe_device+0x24/0xd0
[ 5.124452] __driver_attach+0x10b/0x210
[ 5.124455] ? __pfx___driver_attach+0x10/0x10
[ 5.124458] bus_for_each_dev+0x8a/0xf0
[ 5.124462] driver_attach+0x1e/0x30
[ 5.124465] bus_add_driver+0x127/0x240
[ 5.124469] driver_register+0x5e/0x130
[ 5.124472] __auxiliary_driver_register+0x73/0xf0
[ 5.124476] ? __pfx_bnxt_re_mod_init+0x10/0x10 [bnxt_re]
[ 5.124488] bnxt_re_mod_init+0x3e/0xff0 [bnxt_re]
[ 5.124499] ? __pfx_bnxt_re_mod_init+0x10/0x10 [bnxt_re]
[ 5.124509] do_one_initcall+0x5b/0x340
[ 5.124518] do_init_module+0x68/0x260
[ 5.124528] load_module+0x213a/0x22a0
[ 5.124534] ? security_kernel_post_read_file+0x75/0x90
[ 5.124542] init_module_from_file+0x96/0x100
[ 5.124546] ? init_module_from_file+0x96/0x100
[ 5.124549] ? mmap_region+0x698/0x9e0
[ 5.124558] idempotent_init_module+0x11c/0x2b0
[ 5.124564] __x64_sys_finit_module+0x64/0xd0
[ 5.124569] do_syscall_64+0x58/0x90
[ 5.124576] ? ksys_mmap_pgoff+0x120/0x240
[ 5.124578] ? __secure_computing+0x89/0xf0
[ 5.124586] ? exit_to_user_mode_prepare+0x39/0x190
[ 5.124590] ? syscall_exit_to_user_mode+0x37/0x60
[ 5.124597] ? do_syscall_64+0x67/0x90
[ 5.124599] ? exit_to_user_mode_prepare+0x39/0x190
[ 5.124602] ? syscall_exit_to_user_mode+0x37/0x60
[ 5.124605] ? do_syscall_64+0x67/0x90
[ 5.124609] entry_SYSCALL_64_after_hwframe+0x6e/0xd8
[ 5.124615] RIP: 0033:0x7ff4cca87559
[ 5.124637] Code: 08 89 e8 5b 5d c3 66 2e 0f 1f 84 00 00 00 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 77 08 0d 00 f7 d8 64 89 01 48
[ 5.124640] RSP: 002b:00007fff3b7a5bb8 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
[ 5.124644] RAX: ffffffffffffffda RBX: 00005613bd360a70 RCX: 00007ff4cca87559
[ 5.124646] RDX: 0000000000000000 RSI: 00007ff4ccc1aefd RDI: 000000000000000f
[ 5.124647] RBP: 00007ff4ccc1aefd R08: 0000000000000000 R09: 00005613bd35f7e0
[ 5.124649] R10: 000000000000000f R11: 0000000000000246 R12: 0000000000020000
[ 5.124651] R13: 0000000000000000 R14: 00005613bd39f780 R15: 00005613bcd1aec1
[ 5.124654] </TASK>
[ 5.124656] ================================================================================
[ 5.125035] bnxt_en 0000:5d:00.0: QPLIB: cmdq[0xe]=0x3 status 0x3
[ 5.125042] bnxt_en 0000:5d:00.0 bnxt_re0: Failed to modify HW QP
[ 5.125045] infiniband bnxt_re0: Couldn't change QP1 state to INIT: -14
[ 5.125048] infiniband bnxt_re0: Couldn't start port
[ 5.125371] infiniband bnxt_re0: Couldn't open port 1
[ 5.125654] infiniband bnxt_re0: Device registered with IB successfully
[ 5.131241] ioatdma 0000:00:04.5: enabling device (0004 -> 0006)
[ 5.139986] cryptd: max_cpu_qlen set to 1000
[ 5.143670] ioatdma 0000:00:04.6: enabling device (0004 -> 0006)
[ 5.146297] AVX2 version of gcm_enc/dec engaged.
[ 5.146429] AES CTR mode by8 optimization enabled
[ 5.161123] Console: switching to colour frame buffer device 128x48
[ 5.176153] bnxt_en 0000:5d:00.1: QPLIB: cmdq[0xe]=0x3 status 0x3
[ 5.176159] bnxt_en 0000:5d:00.1 bnxt_re1: Failed to modify HW QP
[ 5.176161] infiniband bnxt_re1: Couldn't change QP1 state to INIT: -14
[ 5.176164] infiniband bnxt_re1: Couldn't start port
[ 5.176444] infiniband bnxt_re1: Couldn't open port 1
[ 5.176632] infiniband bnxt_re1: Device registered with IB successfully
NIC is working fine but there is strange error on dmesg.
I have downgraded kernel to 6.2.16-20-pve and the error disappeared, so the bug is in kernel 6.5.11.
Any help?