sfc network driver issue with kernel 3.10.0-10

udo

Distinguished Member
Apr 22, 2009
5,975
196
163
Ahrensburg; Germany
Hi,
I have an test system (for testing zfs) with an 10GB-Solarflare NIC.
With networktraffic I got an lot of kernel errors:
Code:
May 27 11:18:58 proxtest-a kernel: [ 8043.930604] ------------[ cut here ]------------
May 27 11:18:58 proxtest-a kernel: [ 8043.930640] WARNING: at drivers/net/ethernet/sfc/net_driver.h:492 efx_poll+0x149/0x1e0 [sfc]()
May 27 11:18:58 proxtest-a kernel: [ 8043.930642] Modules linked in: ip_set ip6table_filter ip6_tables iptable_filter ip_tables vhost_net tun macvtap macvlan nfnetlink_log kvm_amd nfnetlink kv
m ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi nfsd auth_rpcgss nfs_acl nfs lockd fscache sunrpc 8021q garp mrp openvswitch gr
e vxlan ip_tunnel libcrc32c fuse zfs(PO) zunicode(PO) zavl(PO) zcommon(PO) znvpair(PO) spl(O) zlib_deflate eeepc_wmi asus_wmi sparse_keymap rfkill video mxm_wmi amd64_edac_mod edac_mce_amd k1
0temp fam15h_power serio_raw pcspkr edac_core sp5100_tco i2c_piix4 tpm_infineon snd_hda_codec_realtek snd_hda_codec_hdmi snd_hda_codec_generic radeon snd_hda_intel snd_hda_controller snd_hda_
codec ttm snd_hwdep snd_pcm drm_kms_helper snd_timer drm snd soundcore wmi ext4 mbcache jbd2 sg r8169 mii ahci libahci sfc mpt2sas mtd raid_class i2c_algo_bit libata i2c_core ptp scsi_transpo
rt_sas pps_core mdio
May 27 11:18:58 proxtest-a kernel: [ 8043.930690] CPU: 0 PID: 8456 Comm: z_wr_iss/0 Tainted: P        W  O--------------   3.10.0-10-pve #1
May 27 11:18:58 proxtest-a kernel: [ 8043.930692] Hardware name: To be filled by O.E.M. To be filled by O.E.M./SABERTOOTH 990FX R2.0, BIOS 1302 11/14/2012
May 27 11:18:58 proxtest-a kernel: [ 8043.930695]  ffffffffa01527b0 ffff88083ec03da0 ffffffff8163619b ffff88083ec03de0
May 27 11:18:58 proxtest-a kernel: [ 8043.930699]  ffffffff8106cbcc ffff88083ec03dd0 ffff88081287d0a4 0000000000000040
May 27 11:18:58 proxtest-a kernel: [ 8043.930702]  ffff88081287d030 ffff88081287d030 ffff88080e6008c0 ffff88083ec03df0
May 27 11:18:58 proxtest-a kernel: [ 8043.930705] Call Trace:
May 27 11:18:58 proxtest-a kernel: [ 8043.930707]  <IRQ>  [<ffffffff8163619b>] dump_stack+0x19/0x1b
May 27 11:18:58 proxtest-a kernel: [ 8043.930719]  [<ffffffff8106cbcc>] warn_slowpath_common+0x7c/0xc0
May 27 11:18:58 proxtest-a kernel: [ 8043.930722]  [<ffffffff8106cc2a>] warn_slowpath_null+0x1a/0x20
May 27 11:18:58 proxtest-a kernel: [ 8043.930728]  [<ffffffffa01207f9>] efx_poll+0x149/0x1e0 [sfc]
May 27 11:18:58 proxtest-a kernel: [ 8043.930731]  [<ffffffff8151c979>] net_rx_action+0x149/0x240
May 27 11:18:58 proxtest-a kernel: [ 8043.930734]  [<ffffffff81075878>] __do_softirq+0xe8/0x280
May 27 11:18:58 proxtest-a kernel: [ 8043.930737]  [<ffffffff81106d43>] ? handle_irq_event+0x53/0x70
May 27 11:18:58 proxtest-a kernel: [ 8043.930740]  [<ffffffff8164789c>] call_softirq+0x1c/0x30
May 27 11:18:58 proxtest-a kernel: [ 8043.930742]  [<ffffffff81015d15>] do_softirq+0x75/0xb0
May 27 11:18:58 proxtest-a kernel: [ 8043.930744]  [<ffffffff81075c15>] irq_exit+0x115/0x120
May 27 11:18:58 proxtest-a kernel: [ 8043.930747]  [<ffffffff81648447>] do_IRQ+0x67/0x110
May 27 11:18:58 proxtest-a kernel: [ 8043.930749]  [<ffffffff8163d96d>] common_interrupt+0x6d/0x6d
May 27 11:18:58 proxtest-a kernel: [ 8043.930750]  <EOI>  [<ffffffffa053a4e0>] ? fletcher_4_native+0x30/0x50 [zcommon]
May 27 11:18:58 proxtest-a kernel: [ 8043.930806]  [<ffffffffa06447f7>] ? zio_write_bp_init+0x77/0x6d0 [zfs]
May 27 11:18:58 proxtest-a kernel: [ 8043.930830]  [<ffffffffa0649d54>] zio_checksum_compute+0x94/0x170 [zfs]
May 27 11:18:58 proxtest-a kernel: [ 8043.930860]  [<ffffffffa06434c8>] zio_checksum_generate+0x48/0x70 [zfs]
May 27 11:18:58 proxtest-a kernel: [ 8043.930886]  [<ffffffffa06440f8>] zio_execute+0xc8/0x180 [zfs]
May 27 11:18:58 proxtest-a kernel: [ 8043.930903]  [<ffffffffa04f0a3e>] taskq_thread+0x19e/0x330 [spl]
May 27 11:18:58 proxtest-a kernel: [ 8043.930911]  [<ffffffff810a6170>] ? try_to_wake_up+0x290/0x290
May 27 11:18:58 proxtest-a kernel: [ 8043.930921]  [<ffffffffa04f08a0>] ? task_expire+0xf0/0xf0 [spl]
May 27 11:18:58 proxtest-a kernel: [ 8043.930926]  [<ffffffff810948c0>] kthread+0xc0/0xd0
May 27 11:18:58 proxtest-a kernel: [ 8043.930930]  [<ffffffff81094800>] ? kthread_create_on_node+0x120/0x120
May 27 11:18:58 proxtest-a kernel: [ 8043.930936]  [<ffffffff81645ebc>] ret_from_fork+0x7c/0xb0
May 27 11:18:58 proxtest-a kernel: [ 8043.930944]  [<ffffffff81094800>] ? kthread_create_on_node+0x120/0x120
May 27 11:18:58 proxtest-a kernel: [ 8043.930948] ---[ end trace e20fda1ac06a3673 ]---
This happens during an rsync of some TB inside an VM on this host quite often:
Code:
grep -c "cut here" syslog.1
454
I had the same with an simple 1-Port Solarflare NIC and with the "better" dual-port card now
Code:
06:00.0 Ethernet controller: Solarflare Communications SFC9020 [Solarstorm]
06:00.1 Ethernet controller: Solarflare Communications SFC9020 [Solarstorm]
Code:
pveversion -v
proxmox-ve-2.6.32: 3.4-156 (running kernel: 3.10.0-10-pve)
pve-manager: 3.4-6 (running version: 3.4-6/102d4547)
pve-kernel-2.6.32-39-pve: 2.6.32-156
pve-kernel-3.10.0-10-pve: 3.10.0-34
pve-kernel-2.6.32-37-pve: 2.6.32-150
lvm2: 2.02.98-pve4
clvm: 2.02.98-pve4
corosync-pve: 1.4.7-1
openais-pve: 1.1.4-3
libqb0: 0.11.1-2
redhat-cluster-pve: 3.2.0-2
resource-agents-pve: 3.9.2-4
fence-agents-pve: 4.0.10-2
pve-cluster: 3.0-17
qemu-server: 3.4-6
pve-firmware: 1.1-4
libpve-common-perl: 3.0-24
libpve-access-control: 3.0-16
libpve-storage-perl: 3.0-33
pve-libspice-server1: 0.12.4-3
vncterm: 1.1-8
vzctl: 4.0-1pve6
vzprocps: 2.0.11-2
vzquota: 3.1-2
pve-qemu-kvm: 2.2-10
ksm-control-daemon: 1.1-1
glusterfs-client: 3.5.2-1
Code:
modinfo sfc
filename:       /lib/modules/3.10.0-10-pve/kernel/drivers/net/ethernet/sfc/sfc.ko
license:        GPL
description:    Solarflare network driver
author:         Solarflare Communications and Michael Brown <mbrown@fensystems.co.uk>
rhelversion:    7.1
srcversion:     20FF7CA8FC785453A3BF4F4
alias:          pci:v00001924d00000923sv*sd*bc*sc*i*
alias:          pci:v00001924d00000903sv*sd*bc*sc*i*
alias:          pci:v00001924d00000813sv*sd*bc*sc*i*
alias:          pci:v00001924d00000803sv*sd*bc*sc*i*
alias:          pci:v00001924d00000710sv*sd*bc*sc*i*
alias:          pci:v00001924d00000703sv*sd*bc*sc*i*
depends:        mtd,i2c-core,mdio,ptp,i2c-algo-bit
intree:         Y
vermagic:       3.10.0-10-pve SMP mod_unload modversions 
parm:           vf_max_tx_channels:Limit the number of TX channels VFs can use (uint)
parm:           max_vfs:Reduce the number of VFs initialized by the driver (int)
parm:           rx_refill_threshold:RX descriptor ring refill threshold (%) (uint)
parm:           separate_tx_channels:Use separate channels for TX and RX (bool)
parm:           rss_cpus:Number of CPUs to use for Receive-Side Scaling (uint)
parm:           phy_flash_cfg:Set PHYs into reflash mode initially (bool)
parm:           irq_adapt_low_thresh:Threshold score for reducing IRQ moderation (uint)
parm:           irq_adapt_high_thresh:Threshold score for increasing IRQ moderation (uint)
parm:           debug:Bitmapped debugging message enable value (uint)
parm:           interrupt_mode:Interrupt mode (0=>MSIX 1=>MSI 2=>legacy) (uint)

lsmod | grep sfc
sfc                   281249  0 
mtd                    59531  7 sfc
i2c_algo_bit           13413  2 sfc,radeon
i2c_core               40407  6 drm,sfc,i2c_piix4,drm_kms_helper,i2c_algo_bit,radeon
ptp                    18933  1 sfc
mdio                   13807  1 sfc
Udo
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!