Hi,
I have an test system (for testing zfs) with an 10GB-Solarflare NIC.
With networktraffic I got an lot of kernel errors:
This happens during an rsync of some TB inside an VM on this host quite often:
I had the same with an simple 1-Port Solarflare NIC and with the "better" dual-port card now
Udo
I have an test system (for testing zfs) with an 10GB-Solarflare NIC.
With networktraffic I got an lot of kernel errors:
Code:
May 27 11:18:58 proxtest-a kernel: [ 8043.930604] ------------[ cut here ]------------
May 27 11:18:58 proxtest-a kernel: [ 8043.930640] WARNING: at drivers/net/ethernet/sfc/net_driver.h:492 efx_poll+0x149/0x1e0 [sfc]()
May 27 11:18:58 proxtest-a kernel: [ 8043.930642] Modules linked in: ip_set ip6table_filter ip6_tables iptable_filter ip_tables vhost_net tun macvtap macvlan nfnetlink_log kvm_amd nfnetlink kv
m ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi nfsd auth_rpcgss nfs_acl nfs lockd fscache sunrpc 8021q garp mrp openvswitch gr
e vxlan ip_tunnel libcrc32c fuse zfs(PO) zunicode(PO) zavl(PO) zcommon(PO) znvpair(PO) spl(O) zlib_deflate eeepc_wmi asus_wmi sparse_keymap rfkill video mxm_wmi amd64_edac_mod edac_mce_amd k1
0temp fam15h_power serio_raw pcspkr edac_core sp5100_tco i2c_piix4 tpm_infineon snd_hda_codec_realtek snd_hda_codec_hdmi snd_hda_codec_generic radeon snd_hda_intel snd_hda_controller snd_hda_
codec ttm snd_hwdep snd_pcm drm_kms_helper snd_timer drm snd soundcore wmi ext4 mbcache jbd2 sg r8169 mii ahci libahci sfc mpt2sas mtd raid_class i2c_algo_bit libata i2c_core ptp scsi_transpo
rt_sas pps_core mdio
May 27 11:18:58 proxtest-a kernel: [ 8043.930690] CPU: 0 PID: 8456 Comm: z_wr_iss/0 Tainted: P W O-------------- 3.10.0-10-pve #1
May 27 11:18:58 proxtest-a kernel: [ 8043.930692] Hardware name: To be filled by O.E.M. To be filled by O.E.M./SABERTOOTH 990FX R2.0, BIOS 1302 11/14/2012
May 27 11:18:58 proxtest-a kernel: [ 8043.930695] ffffffffa01527b0 ffff88083ec03da0 ffffffff8163619b ffff88083ec03de0
May 27 11:18:58 proxtest-a kernel: [ 8043.930699] ffffffff8106cbcc ffff88083ec03dd0 ffff88081287d0a4 0000000000000040
May 27 11:18:58 proxtest-a kernel: [ 8043.930702] ffff88081287d030 ffff88081287d030 ffff88080e6008c0 ffff88083ec03df0
May 27 11:18:58 proxtest-a kernel: [ 8043.930705] Call Trace:
May 27 11:18:58 proxtest-a kernel: [ 8043.930707] <IRQ> [<ffffffff8163619b>] dump_stack+0x19/0x1b
May 27 11:18:58 proxtest-a kernel: [ 8043.930719] [<ffffffff8106cbcc>] warn_slowpath_common+0x7c/0xc0
May 27 11:18:58 proxtest-a kernel: [ 8043.930722] [<ffffffff8106cc2a>] warn_slowpath_null+0x1a/0x20
May 27 11:18:58 proxtest-a kernel: [ 8043.930728] [<ffffffffa01207f9>] efx_poll+0x149/0x1e0 [sfc]
May 27 11:18:58 proxtest-a kernel: [ 8043.930731] [<ffffffff8151c979>] net_rx_action+0x149/0x240
May 27 11:18:58 proxtest-a kernel: [ 8043.930734] [<ffffffff81075878>] __do_softirq+0xe8/0x280
May 27 11:18:58 proxtest-a kernel: [ 8043.930737] [<ffffffff81106d43>] ? handle_irq_event+0x53/0x70
May 27 11:18:58 proxtest-a kernel: [ 8043.930740] [<ffffffff8164789c>] call_softirq+0x1c/0x30
May 27 11:18:58 proxtest-a kernel: [ 8043.930742] [<ffffffff81015d15>] do_softirq+0x75/0xb0
May 27 11:18:58 proxtest-a kernel: [ 8043.930744] [<ffffffff81075c15>] irq_exit+0x115/0x120
May 27 11:18:58 proxtest-a kernel: [ 8043.930747] [<ffffffff81648447>] do_IRQ+0x67/0x110
May 27 11:18:58 proxtest-a kernel: [ 8043.930749] [<ffffffff8163d96d>] common_interrupt+0x6d/0x6d
May 27 11:18:58 proxtest-a kernel: [ 8043.930750] <EOI> [<ffffffffa053a4e0>] ? fletcher_4_native+0x30/0x50 [zcommon]
May 27 11:18:58 proxtest-a kernel: [ 8043.930806] [<ffffffffa06447f7>] ? zio_write_bp_init+0x77/0x6d0 [zfs]
May 27 11:18:58 proxtest-a kernel: [ 8043.930830] [<ffffffffa0649d54>] zio_checksum_compute+0x94/0x170 [zfs]
May 27 11:18:58 proxtest-a kernel: [ 8043.930860] [<ffffffffa06434c8>] zio_checksum_generate+0x48/0x70 [zfs]
May 27 11:18:58 proxtest-a kernel: [ 8043.930886] [<ffffffffa06440f8>] zio_execute+0xc8/0x180 [zfs]
May 27 11:18:58 proxtest-a kernel: [ 8043.930903] [<ffffffffa04f0a3e>] taskq_thread+0x19e/0x330 [spl]
May 27 11:18:58 proxtest-a kernel: [ 8043.930911] [<ffffffff810a6170>] ? try_to_wake_up+0x290/0x290
May 27 11:18:58 proxtest-a kernel: [ 8043.930921] [<ffffffffa04f08a0>] ? task_expire+0xf0/0xf0 [spl]
May 27 11:18:58 proxtest-a kernel: [ 8043.930926] [<ffffffff810948c0>] kthread+0xc0/0xd0
May 27 11:18:58 proxtest-a kernel: [ 8043.930930] [<ffffffff81094800>] ? kthread_create_on_node+0x120/0x120
May 27 11:18:58 proxtest-a kernel: [ 8043.930936] [<ffffffff81645ebc>] ret_from_fork+0x7c/0xb0
May 27 11:18:58 proxtest-a kernel: [ 8043.930944] [<ffffffff81094800>] ? kthread_create_on_node+0x120/0x120
May 27 11:18:58 proxtest-a kernel: [ 8043.930948] ---[ end trace e20fda1ac06a3673 ]---
Code:
grep -c "cut here" syslog.1
454
Code:
06:00.0 Ethernet controller: Solarflare Communications SFC9020 [Solarstorm]
06:00.1 Ethernet controller: Solarflare Communications SFC9020 [Solarstorm]
Code:
pveversion -v
proxmox-ve-2.6.32: 3.4-156 (running kernel: 3.10.0-10-pve)
pve-manager: 3.4-6 (running version: 3.4-6/102d4547)
pve-kernel-2.6.32-39-pve: 2.6.32-156
pve-kernel-3.10.0-10-pve: 3.10.0-34
pve-kernel-2.6.32-37-pve: 2.6.32-150
lvm2: 2.02.98-pve4
clvm: 2.02.98-pve4
corosync-pve: 1.4.7-1
openais-pve: 1.1.4-3
libqb0: 0.11.1-2
redhat-cluster-pve: 3.2.0-2
resource-agents-pve: 3.9.2-4
fence-agents-pve: 4.0.10-2
pve-cluster: 3.0-17
qemu-server: 3.4-6
pve-firmware: 1.1-4
libpve-common-perl: 3.0-24
libpve-access-control: 3.0-16
libpve-storage-perl: 3.0-33
pve-libspice-server1: 0.12.4-3
vncterm: 1.1-8
vzctl: 4.0-1pve6
vzprocps: 2.0.11-2
vzquota: 3.1-2
pve-qemu-kvm: 2.2-10
ksm-control-daemon: 1.1-1
glusterfs-client: 3.5.2-1
Code:
modinfo sfc
filename: /lib/modules/3.10.0-10-pve/kernel/drivers/net/ethernet/sfc/sfc.ko
license: GPL
description: Solarflare network driver
author: Solarflare Communications and Michael Brown <mbrown@fensystems.co.uk>
rhelversion: 7.1
srcversion: 20FF7CA8FC785453A3BF4F4
alias: pci:v00001924d00000923sv*sd*bc*sc*i*
alias: pci:v00001924d00000903sv*sd*bc*sc*i*
alias: pci:v00001924d00000813sv*sd*bc*sc*i*
alias: pci:v00001924d00000803sv*sd*bc*sc*i*
alias: pci:v00001924d00000710sv*sd*bc*sc*i*
alias: pci:v00001924d00000703sv*sd*bc*sc*i*
depends: mtd,i2c-core,mdio,ptp,i2c-algo-bit
intree: Y
vermagic: 3.10.0-10-pve SMP mod_unload modversions
parm: vf_max_tx_channels:Limit the number of TX channels VFs can use (uint)
parm: max_vfs:Reduce the number of VFs initialized by the driver (int)
parm: rx_refill_threshold:RX descriptor ring refill threshold (%) (uint)
parm: separate_tx_channels:Use separate channels for TX and RX (bool)
parm: rss_cpus:Number of CPUs to use for Receive-Side Scaling (uint)
parm: phy_flash_cfg:Set PHYs into reflash mode initially (bool)
parm: irq_adapt_low_thresh:Threshold score for reducing IRQ moderation (uint)
parm: irq_adapt_high_thresh:Threshold score for increasing IRQ moderation (uint)
parm: debug:Bitmapped debugging message enable value (uint)
parm: interrupt_mode:Interrupt mode (0=>MSIX 1=>MSI 2=>legacy) (uint)
lsmod | grep sfc
sfc 281249 0
mtd 59531 7 sfc
i2c_algo_bit 13413 2 sfc,radeon
i2c_core 40407 6 drm,sfc,i2c_piix4,drm_kms_helper,i2c_algo_bit,radeon
ptp 18933 1 sfc
mdio 13807 1 sfc