pve-kernel-3.10.0-2-pve - lockups memory issues

nicolasdiogo

Member
Mar 16, 2010
92
0
6
i have had a few lock ups on this server
i have changed the kernel to be able to use the hardware as the 2.6 kernel was not identifying working correctly on the hardware - a simple desktop motherboard with AMD x8 cores and 32GB ram

the 3.1 kernel identifying everything but gives this ramdon lockup.

any suggestions on how to correct this?

it seems to be a bug with the kernel


thanks,


Code:
# pveversion --verbose
proxmox-ve-2.6.32: not correctly installed (running kernel: 3.10.0-2-pve)
pve-manager: 3.2-4 (running version: 3.2-4/e24a91c1)
pve-kernel-3.10.0-2-pve: 3.10.0-10
lvm2: 2.02.98-pve4
clvm: 2.02.98-pve4
corosync-pve: 1.4.5-1
openais-pve: 1.1.4-3
libqb0: 0.11.1-2
redhat-cluster-pve: 3.2.0-2
resource-agents-pve: 3.9.2-4
fence-agents-pve: 4.0.5-1
pve-cluster: 3.0-12
qemu-server: 3.1-16
pve-firmware: 1.1-3
libpve-common-perl: 3.0-18
libpve-access-control: 3.0-11
libpve-storage-perl: 3.0-19
pve-libspice-server1: 0.12.4-3
vncterm: 1.1-6
vzctl: not correctly installed
vzprocps: not correctly installed
vzquota: not correctly installed
pve-qemu-kvm: 1.7-8
ksm-control-daemon: 1.1-1
glusterfs-client: 3.4.2-1


2014-07-15,17:35:19,warning,1,1,1,[57835.233010] ---[ end trace 8fb1d8f27c42f312 ]---
2014-07-15,17:35:19,warning,1,1,1,[57835.226746] RSP <ffff88075f1fd818>
2014-07-15,17:35:19,alert,1,1,1,[57835.225766] RIP [<ffffffff811820be>] policy_zonelist+0x1e/0xa0
2014-07-15,17:35:19,warning,1,1,1,[57835.223787] Code: 0f b8 c7 48 89 e5 5d c3 0f 1f 44 00 00 66 66 66 66 90 55 48 89 e5 53 48 83 ec 08 0f b7 46 04 66 83 f8 01 74 08 66 83 f8 02 74 42 <0f> 0b 89 fb 81 e3 00 00 04 00 f6 46 06 02 75 04 0f bf 56 08 31
2014-07-15,17:35:19,warning,1,1,1,[57835.222857] [<ffffffff8162c419>] system_call_fastpath+0x16/0x1b
2014-07-15,17:35:19,warning,1,1,1,[57835.221964] [<ffffffff811bc5d1>] SyS_ioctl+0x91/0xb0
2014-07-15,17:35:19,warning,1,1,1,[57835.221026] [<ffffffff810bf138>] ? SyS_futex+0x98/0x1a0
2014-07-15,17:35:19,warning,1,1,1,[57835.220072] [<ffffffff811bc0a0>] do_vfs_ioctl+0x90/0x530
2014-07-15,17:35:19,warning,1,1,1,[57835.219099] [<ffffffff810bebe1>] ? do_futex+0x111/0x5d0
2014-07-15,17:35:19,warning,1,1,1,[57835.218141] [<ffffffffa05824f4>] kvm_vcpu_ioctl+0x2b4/0x580 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.217129] [<ffffffffa0597dd0>] kvm_arch_vcpu_ioctl_run+0x210/0x420 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.216110] [<ffffffff8106f847>] ? __set_task_blocked+0x37/0x80
2014-07-15,17:35:19,warning,1,1,1,[57835.215063] [<ffffffffa05949eb>] ? x86_emulate_instruction+0x14b/0x420 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.213987] [<ffffffffa059763c>] vcpu_enter_guest+0x65c/0xbe0 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.212879] [<ffffffffa060dec5>] ? update_cr8_intercept+0x45/0xa0 [kvm_amd]
2014-07-15,17:35:19,warning,1,1,1,[57835.211750] [<ffffffffa060d3ff>] ? svm_vcpu_run+0x3ff/0x520 [kvm_amd]
2014-07-15,17:35:19,warning,1,1,1,[57835.210634] [<ffffffffa058cfab>] ? kvm_set_cr8+0x2b/0x40 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.209477] [<ffffffffa05afceb>] ? kvm_lapic_set_tpr+0x3b/0x50 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.208324] [<ffffffffa0611ccd>] handle_exit+0x12d/0x980 [kvm_amd]
2014-07-15,17:35:19,warning,1,1,1,[57835.207170] [<ffffffffa060bef8>] pf_interception+0xa8/0x150 [kvm_amd]
2014-07-15,17:35:19,warning,1,1,1,[57835.206030] [<ffffffffa059cb81>] kvm_mmu_page_fault+0x31/0x100 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.204900] [<ffffffffa05a2713>] tdp_page_fault+0x103/0x1f0 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.203785] [<ffffffffa05a037a>] try_async_pf+0x4a/0x1d0 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.202680] [<ffffffffa058215a>] gfn_to_pfn_async+0x1a/0x20 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.201565] [<ffffffffa0582070>] __gfn_to_pfn+0x60/0x70 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.200432] [<ffffffffa0581d85>] __gfn_to_pfn_memslot+0x165/0x3d0 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.199288] [<ffffffff81166786>] __get_user_pages+0x176/0x5c0
2014-07-15,17:35:19,warning,1,1,1,[57835.198130] [<ffffffff8116522a>] ? follow_page_mask+0x4ba/0x5c0
2014-07-15,17:35:19,warning,1,1,1,[57835.196977] [<ffffffff81161c89>] ? spin_unlock+0x9/0x10
2014-07-15,17:35:19,warning,1,1,1,[57835.195860] [<ffffffffa05aea0b>] ? apic_reg_write+0x2db/0x690 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.194712] [<ffffffff81165ceb>] handle_mm_fault+0x53b/0xd90
2014-07-15,17:35:19,warning,1,1,1,[57835.193556] [<ffffffffa0585e1a>] ? kvm_irq_delivery_to_apic+0xfa/0x250 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.192391] [<ffffffffa05af717>] ? kvm_apic_set_irq+0x27/0x30 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.191215] [<ffffffff811633e6>] do_wp_page+0xc6/0x7f0
2014-07-15,17:35:19,warning,1,1,1,[57835.190022] [<ffffffff81171740>] ? anon_vma_prepare+0x30/0x150
2014-07-15,17:35:19,warning,1,1,1,[57835.188846] [<ffffffffa057fc98>] ? kvm_vcpu_kick+0x88/0xa0 [kvm]
2014-07-15,17:35:19,warning,1,1,1,[57835.188802] [<ffffffff810377bd>] ? native_smp_send_reschedule+0x4d/0x70
2014-07-15,17:35:19,warning,1,1,1,[57835.188772] [<ffffffff81185923>] alloc_pages_vma+0x93/0x150
2014-07-15,17:35:19,warning,1,1,1,[57835.188759] Call Trace:
2014-07-15,17:35:19,warning,1,1,1,[57835.188723] 0000000000000000 ffffffff81171740 ffff88075f1fd8c8 ffffea001f317040
2014-07-15,17:35:19,warning,1,1,1,[57835.188687] 0000000000000000 ffffffff810377bd ffff88075f1fd868 ffffffffa057fc98
2014-07-15,17:35:19,warning,1,1,1,[57835.188650] ffff88075f1fd848 ffffea001f317040 ffff88075f1fd898 ffffffff81185923
2014-07-15,17:35:19,warning,1,1,1,[57835.188641] Stack:
2014-07-15,17:35:19,warning,1,1,1,[57835.188606] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
2014-07-15,17:35:19,warning,1,1,1,[57835.188571] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
2014-07-15,17:35:19,warning,1,1,1,[57835.188535] CR2: 00000000003a4158 CR3: 00000007befe3000 CR4: 00000000000407e0
2014-07-15,17:35:19,warning,1,1,1,[57835.188507] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
2014-07-15,17:35:19,warning,1,1,1,[57835.188466] FS: 000000007ffd7000(0053) GS:ffff88081ecc0000(002b) knlGS:fffff802874fd000
2014-07-15,17:35:19,warning,1,1,1,[57835.188430] R13: 00007f175a7a3000 R14: ffffea001ce4bff0 R15: ffff8807392ffd18
2014-07-15,17:35:19,warning,1,1,1,[57835.188360] R10: 80000007cc5c1025 R11: 00007f1780000000 R12: 00000000000200da
2014-07-15,17:35:19,warning,1,1,1,[57835.188325] RBP: ffff88075f1fd828 R08: 0000000000000000 R09: ffffea001ce4bff0
2014-07-15,17:35:19,warning,1,1,1,[57835.188289] RDX: 00000000ffffffff RSI: 00007f175a7a3000 RDI: 00000000000200da
2014-07-15,17:35:19,warning,1,1,1,[57835.188253] RAX: 0000000000000000 RBX: ffffea001f317040 RCX: 00007f175a7a3000
2014-07-15,17:35:19,warning,1,1,1,[57835.188227] RSP: 0018:ffff88075f1fd818 EFLAGS: 00010293
2014-07-15,17:35:19,warning,1,1,1,[57835.188184] RIP: 0010:[<ffffffff811820be>] [<ffffffff811820be>] policy_zonelist+0x1e/0xa0
2014-07-15,17:35:19,warning,1,1,1,[57835.188146] task: ffff88075fce7260 ti: ffff88075f1fc000 task.ti: ffff88075f1fc000
2014-07-15,17:35:19,warning,1,1,1,"[57835.188108] Hardware name: MSI MS-7640/990FXA-GD65 (MS-7640), BIOS V20.3 09/26/2013"
2014-07-15,17:35:19,warning,1,1,1,[57835.188064] CPU: 3 PID: 3780 Comm: kvm Tainted: G O-------------- 3.10.0-2-pve #1
2014-07-15,17:35:19,warning,1,1,1,[57835.187828] Modules linked in: vhost_net tun macvtap macvlan kvm_amd kvm nfsd auth_rpcgss nfs_acl nfs lockd fscache sunrpc loop fuse radeon snd_pcm snd_page_alloc snd_timer i2c_algo_bit snd soundcore ttm serio_raw drm_kms_helper pcspkr fam15h_power mxm_wmi edac_mce_amd k10temp sp5100_tco drm edac_core i2c_piix4 i2c_core wmi acpi_cpufreq mperf ext4 mbcache jbd2 dm_snapshot dm_bufio raid1 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq sg ahci libahci libata r8169 mii e1000e(O)
2014-07-15,17:35:19,warning,1,1,1,[57835.187806] invalid opcode: 0000 [#1] SMP
2014-07-15,17:35:19,crit,1,1,1,[57835.187784] kernel BUG at mm/mempolicy.c:1715!
2014-07-15,17:35:19,warning,1,1,1,[57835.187754] ------------[ cut here ]------------
 
another lock up ..

Date|Time|Level|Host Name|Category|Program|Messages
2014-07-21|01:15:26|emerg|atlas|kern|kernel|[118813.013286] Kernel panic - not syncing: Fatal exception
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118813.012037] ---[ end trace 9b088e2147986d51 ]---
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118813.005547] RSP
2014-07-21|01:15:26|alert|atlas|kern|kernel|[118813.004344] RIP [] vma_interval_tree_insert_after+0x30/0x90
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118813.001936] Code: 55 48 8b 47 08 49 89 d2 48 2b 07 48 8b 97 98 00 00 00 49 89 f9 48 89 e5 48 c1 e8 0c 4c 8d 44 02 ff 48 8b 46 60 48 85 c0 74 51 90 <4c> 39 40 18 48 8d 48 a8 73 04 4c 89 40 18 48 8b 41 68 48 85 c0
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118813.000814] [] ? system_call_fastpath+0x16/0x1b
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.999646] [] stub_clone+0x69/0x90
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.998458] [] SyS_clone+0x16/0x20
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.997279] [] ? get_unused_fd_flags+0x30/0x40
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.997253] [] do_fork+0xa9/0x340
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.997220] [] copy_process.part.25+0xa26/0x14c0
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.997193] [] dup_mm+0x2d9/0x650
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.997179] Call Trace:
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.997143] ffff88038f5ecda8 ffff88038f5ecd90 ffff8807eff23e30 0000000001200011
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.997107] ffff880036930cf8 ffff880036930c80 ffff88038f5ecd80 ffff88038f5ecda0
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.997035] ffff8807eff23e30 ffffffff81059ee9 ffff8807eff23df0 ffff8807ee89f738
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.997025] Stack:
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996990] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996954] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996919] CR2: 0000000001f0c818 CR3: 00000007ee7b1000 CR4: 00000000000407e0
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996890] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996849] FS: 00007f64d4355700(0000) GS:ffff88081edc0000(0000) knlGS:0000000000000000
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996813] R13: ffff8807f12fe218 R14: ffff8807f12fe250 R15: ffff88038f5ed878
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996778] R10: ffff8807f12fe238 R11: ffff8803a7163c80 R12: ffff8807ee554e58
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996742] RBP: ffff8807eff23db0 R08: 0000000000000009 R09: ffff88038f5ed878
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996707] RDX: 0000000000000009 RSI: ffff8807ee554e58 RDI: ffff88038f5ed878
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996671] RAX: dead000000200200 RBX: ffff8807ee89f6c0 RCX: ffff8807ffd3db00
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996645] RSP: 0018:ffff8807eff23db0 EFLAGS: 00010286
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996595] RIP: 0010:[] [] vma_interval_tree_insert_after+0x30/0x90
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996557] task: ffff88003691d010 ti: ffff8807eff22000 task.ti: ffff8807eff22000
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996518] Hardware name: MSI MS-7640/990FXA-GD65 (MS-7640), BIOS V20.3 09/26/2013
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996472] CPU: 7 PID: 3150 Comm: pvestatd Tainted: G O-------------- 3.10.0-2-pve #1
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996236] Modules linked in: vhost_net tun macvtap macvlan kvm_amd kvm nfsd auth_rpcgss nfs_acl nfs lockd fscache sunrpc loop fuse radeon snd_pcm snd_page_alloc snd_timer snd i2c_algo_bit soundcore ttm drm_kms_helper fam15h_power pcspkr serio_raw edac_mce_amd sp5100_tco drm k10temp mxm_wmi edac_core i2c_piix4 i2c_core wmi acpi_cpufreq mperf ext4 mbcache jbd2 dm_snapshot dm_bufio raid1 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq sg ahci libahci libata e1000e(O) r8169 mii
2014-07-21|01:15:26|warning|atlas|kern|kernel|[118812.996204] general protection fault: 0000 [#1] SMP
2014-07-21|01:15:23|info|atlas|daemon|pveproxy|worker 23261 started
2014-07-21|01:15:23|info|atlas|daemon|pveproxy|starting 1 worker(s)
2014-07-21|01:15:23|info|atlas|daemon|pveproxy|worker 20702 finished
2014-07-21|01:13:37|info|atlas|daemon|pvedaemon|successful auth for user \'root@pam\'
2014-07-21|01:10:53|info|atlas|daemon|pvedaemon|worker 23011 started
2014-07-21|01:10:53|info|atlas|daemon|pvedaemon|starting 1 worker(s)
2014-07-21|01:10:53|info|atlas|daemon|pvedaemon|worker 20189 finished
2014-07-21|01:03:23|info|atlas|daemon|pveproxy|worker 22613 started
2014-07-21|01:03:23|info|atlas|daemon|pveproxy|starting 1 worker(s)
2014-07-21|01:03:23|info|atlas|daemon|pveproxy|worker 21359 finished
2014-07-21|00:58:36|info|atlas|daemon|pvedaemon|successful auth for user \'root@pam\'
 
why do you run 3.10.2? test latest - pve-kernel-3.10.0-3-pve: 3.10.0-11
 
I do not talk about the default kernel, I recommend to update to the latest 3.10 kernel.
 
yes, therefore I suggested to upgrade to latest 3.10 kernel - you run an old one.
 
thanks Tom,

i have upgraded.

interisting enough this upgraded version of the kernel was not acused when running:
Code:
apt-get upgrade

regards,
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!