Hi,
I saw a few post with people with a similar problem but they seem to have this problem on their VMs. Me its on my PVE host. I get this when I have multiple containers using a lot of CPU. I decided to test it with running the stress command on multiple containers. Usually it takes about 12 hours before the proxmox crashed and get this errors all over the screen.
kernel: BUG: soft lockup - CPU#6 stuck for 67s! [flush-0:27:265503]
Its always the CPU#6. Is it a hardware problem?
This the call trace in syslog :
I saw a few post with people with a similar problem but they seem to have this problem on their VMs. Me its on my PVE host. I get this when I have multiple containers using a lot of CPU. I decided to test it with running the stress command on multiple containers. Usually it takes about 12 hours before the proxmox crashed and get this errors all over the screen.
kernel: BUG: soft lockup - CPU#6 stuck for 67s! [flush-0:27:265503]
Its always the CPU#6. Is it a hardware problem?
This the call trace in syslog :
Code:
Aug 20 08:01:13 safecloud101 kernel: BUG: soft lockup - CPU#6 stuck for 67s! [flush-0:27:265503]
Aug 20 08:01:13 safecloud101 kernel: Modules linked in: vzethdev vznetdev pio_nfs pio_direct pfmt_raw pfmt_ploop1 ploop simfs vzrst nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 vzcpt nf_conntrack vzdquota vzmon vzdev ip6t_REJECT ip6table_mangle ip6table_filter ip6_tables xt_length xt_hl xt_tcpmss xt_TCPMSS iptable_mangle xt_limit xt_dscp ipt_REJECT vhost_net tun macvtap macvlan kvm_intel kvm xt_multiport iptable_filter ip_tables dlm configfs openvswitch vxlan vzevent ib_iser rdma_cm ib_addr iw_cm ib_cm ib_sa ib_mad ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi nfsd nfs nfs_acl auth_rpcgss fscache lockd sunrpc bonding 8021q garp ipv6 fuse iTCO_wdt iTCO_vendor_support snd_pcsp snd_pcm snd_page_alloc snd_timer snd soundcore sb_edac edac_core lpc_ich mfd_core shpchp i2c_i801 ioatdma wmi acpi_pad ext3 mbcache jbd sg usb_storage isci mpt2sas raid_class ahci libsas ixgbe scsi_transport_sas igb i2c_algo_bit i2c_core dca [last unloaded: scsi_wait_scan]
Aug 20 08:01:13 safecloud101 kernel: CPU 6
Aug 20 08:01:13 safecloud101 kernel: Modules linked in: vzethdev vznetdev pio_nfs pio_direct pfmt_raw pfmt_ploop1 ploop simfs vzrst nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 vzcpt nf_conntrack vzdquota vzmon vzdev ip6t_REJECT ip6table_mangle ip6table_filter ip6_tables xt_length xt_hl xt_tcpmss xt_TCPMSS iptable_mangle xt_limit xt_dscp ipt_REJECT vhost_net tun macvtap macvlan kvm_intel kvm xt_multiport iptable_filter ip_tables dlm configfs openvswitch vxlan vzevent ib_iser rdma_cm ib_addr iw_cm ib_cm ib_sa ib_mad ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi nfsd nfs nfs_acl auth_rpcgss fscache lockd sunrpc bonding 8021q garp ipv6 fuse iTCO_wdt iTCO_vendor_support snd_pcsp snd_pcm snd_page_alloc snd_timer snd soundcore sb_edac edac_core lpc_ich mfd_core shpchp i2c_i801 ioatdma wmi acpi_pad ext3 mbcache jbd sg usb_storage isci mpt2sas raid_class ahci libsas ixgbe scsi_transport_sas igb i2c_algo_bit i2c_core dca [last unloaded: scsi_wait_scan]
Aug 20 08:01:13 safecloud101 kernel:
Aug 20 08:01:13 safecloud101 kernel: Pid: 265503, comm: flush-0:27 veid: 0 Not tainted 2.6.32-29-pve #1 042stab088_4 Supermicro B9DRP/B9DRP
Aug 20 08:01:13 safecloud101 kernel: RIP: 0010:[<ffffffff8155edde>] [<ffffffff8155edde>] _spin_lock+0x1e/0x30
Aug 20 08:01:13 safecloud101 kernel: RSP: 0018:ffff88083f251c50 EFLAGS: 00000297
Aug 20 08:01:13 safecloud101 kernel: RAX: 000000000000c63f RBX: ffff88083f251c50 RCX: 0000000000000000
Aug 20 08:01:13 safecloud101 kernel: RDX: 000000000000c63e RSI: ffff88083f251cf0 RDI: ffffffff823ec660
Aug 20 08:01:13 safecloud101 kernel: RBP: ffffffff8100bc4e R08: ffff88107a4169c0 R09: 7fffffffffffffff
Aug 20 08:01:13 safecloud101 kernel: R10: ffff880e13bb36c0 R11: ffff880e18da0e00 R12: ffff88083f251c50
Aug 20 08:01:13 safecloud101 kernel: R13: 0000000000000000 R14: 0000000000000046 R15: ffff880866fbf198
Aug 20 08:01:13 safecloud101 kernel: FS: 0000000000000000(0000) GS:ffff8808a0700000(0000) knlGS:0000000000000000
Aug 20 08:01:13 safecloud101 kernel: CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
Aug 20 08:01:13 safecloud101 kernel: CR2: 00007f13f8151010 CR3: 0000000001a85000 CR4: 00000000000407e0
Aug 20 08:01:13 safecloud101 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Aug 20 08:01:13 safecloud101 kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Aug 20 08:01:13 safecloud101 kernel: Process flush-0:27 (pid: 265503, veid: 0, threadinfo ffff88083f250000, task ffff880853f0cf30)
Aug 20 08:01:13 safecloud101 kernel: Stack:
Aug 20 08:01:13 safecloud101 kernel: ffff88083f251ca0 ffffffff811db51f ffff88083f251c90 ffffffff81192728
Aug 20 08:01:13 safecloud101 kernel: <d> ffff88087fc00380 ffff880deb0eec80 ffff880866fbf168 0000000000000000
Aug 20 08:01:13 safecloud101 kernel: <d> ffffffff823ec660 ffff88083f251cf0 ffff88083f251db0 ffffffff811db993
Aug 20 08:01:13 safecloud101 kernel: Call Trace:
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff811db51f>] ? writeback_inodes_wb+0x3f/0x170
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff81192728>] ? kmem_freepages+0xd8/0x120
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff811db993>] ? wb_writeback+0x343/0x460
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff8119395c>] ? free_block+0x14c/0x170
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff811dbb6f>] ? wb_do_writeback+0xbf/0x260
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff811dbda8>] ? bdi_writeback_task+0x98/0x1e0
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff81161c10>] ? bdi_start_fn+0x0/0x110
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff81161c10>] ? bdi_start_fn+0x0/0x110
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff81161ca5>] ? bdi_start_fn+0x95/0x110
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff81161c10>] ? bdi_start_fn+0x0/0x110
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff810a2106>] ? kthread+0x96/0xa0
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff8100c34a>] ? child_rip+0xa/0x20
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff810a2070>] ? kthread+0x0/0xa0
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff8100c340>] ? child_rip+0x0/0x20
Aug 20 08:01:13 safecloud101 kernel: Code: 00 00 00 01 74 05 e8 c2 94 d3 ff c9 c3 55 48 89 e5 0f 1f 44 00 00 b8 00 00 01 00 f0 0f c1 07 0f b7 d0 c1 e8 10 39 c2 74 0e f3 90 <0f> b7 17 eb f5 83 3f 00 75 f4 eb df c9 c3 0f 1f 40 00 55 48 89
Aug 20 08:01:13 safecloud101 kernel: Call Trace:
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff811db51f>] ? writeback_inodes_wb+0x3f/0x170
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff81192728>] ? kmem_freepages+0xd8/0x120
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff811db993>] ? wb_writeback+0x343/0x460
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff8119395c>] ? free_block+0x14c/0x170
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff811dbb6f>] ? wb_do_writeback+0xbf/0x260
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff811dbda8>] ? bdi_writeback_task+0x98/0x1e0
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff81161c10>] ? bdi_start_fn+0x0/0x110
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff81161c10>] ? bdi_start_fn+0x0/0x110
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff81161ca5>] ? bdi_start_fn+0x95/0x110
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff81161c10>] ? bdi_start_fn+0x0/0x110
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff810a2106>] ? kthread+0x96/0xa0
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff8100c34a>] ? child_rip+0xa/0x20
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff810a2070>] ? kthread+0x0/0xa0
Aug 20 08:01:13 safecloud101 kernel: [<ffffffff8100c340>] ? child_rip+0x0/0x20