Proxmox host node unstable

jhamon

Member
Aug 18, 2008
34
0
6
Hi,

For around the past 3 to 4 weeks my proxmox host node seems to be unstable and then locks up and requires a re-bott every few days.

Tonight the following was displayed on the screen. I assume this relates to VM with ID of 110

Any other ideas on why this is suddenly happening.



Jonathan Hamon
 

Attachments

  • proxmoxerror.jpg
    proxmoxerror.jpg
    96.7 KB · Views: 31
For around the past 3 to 4 weeks my proxmox host node seems to be unstable and then locks up and requires a re-bott every few days.

It was stable before? If so, what did you change?

Tonight the following was displayed on the screen. I assume this relates to VM with ID of 110

looks quite normal.
 
I can confirm this behavior. Only the one with Intel CPU of the 6 nodes in the cluster is affected. (All other are AMD Asus motherboards.) Kernel error was something about KVM and Intel. I noticed that the host had an LVM snap shot? I don't recall making one. I think I will move the KVM VM from the node and use only OpenVZ there after reinstall.
 
Last edited:
Hi,

Seems to have started shortly after upgrading from 1.2 to 1.3



Jonathan Hamon
 
Ok. I got more data...
It's definatelly Intel/KVM/snapshot related.
None of the AMD hosts are having problems and the LVM snapshot that I deleted is back again.

Sam


Aug 2 04:58:46 host4 kernel: Call Trace:
Aug 2 04:58:46 host4 kernel: [<ffffffff8848bc6c>] :kvm:mmu_shrink+0x7c/0x150
Aug 2 04:58:46 host4 kernel: [<ffffffff802a8dd2>] shrink_slab+0xa2/0x230
Aug 2 04:58:46 host4 kernel: [<ffffffff802a96d5>] kswapd+0x4c5/0x5f0
Aug 2 04:58:46 host4 kernel: [<ffffffff8025c1e0>] autoremove_wake_function+0x0/0x30
Aug 2 04:58:46 host4 kernel: [<ffffffff802a9210>] kswapd+0x0/0x5f0
Aug 2 04:58:46 host4 kernel: [<ffffffff8025be37>] kthread+0x47/0x90
Aug 2 04:58:46 host4 kernel: [<ffffffff8020d4e8>] child_rip+0xa/0x12
Aug 2 04:58:46 host4 kernel: [<ffffffff8025bdf0>] kthread+0x0/0x90
Aug 2 04:58:46 host4 kernel: [<ffffffff8020d4de>] child_rip+0x0/0x12
Aug 2 04:58:46 host4 kernel:
Aug 2 04:58:58 host4 kernel: BUG: soft lockup - CPU#1 stuck for 11s! [kswapd0:181]
Aug 2 04:58:58 host4 kernel: CPU 1:
Aug 2 04:58:58 host4 kernel: Modules linked in: dm_snapshot kvm_intel kvm vzethdev vznetdev simfs vzrst vzcpt tun vzdquota vzmon vzdev xt_tcpudp xt_length ipt_ttl xt_tcpmss xt_TCPMSS iptable_mangle iptable_
filter xt_multiport xt_limit ipt_tos ipt_REJECT ip_tables x_tables ipv6 bridge sg serio_raw evdev parport_pc parport snd_hda_intel pcspkr psmouse thermal snd_pcm snd_timer snd_page_alloc snd_hwdep snd soundc
ore button processor intel_agp r8169 scsi_wait_scan virtio_blk virtio dm_mod usbhid hid usb_storage libusual sd_mod sr_mod ide_disk ide_generic ide_cd cdrom ide_core shpchp pci_hotplug uhci_hcd ehci_hcd usbc
ore iTCO_wdt iTCO_vendor_support i2c_i801 i2c_core ata_piix ahci pata_jmicron pata_acpi ata_generic libata scsi_mod ohci1394 ieee1394 isofs msdos fat
Aug 2 04:58:58 host4 kernel: Pid: 181, comm: kswapd0 Tainted: G D 2.6.24-7-pve #1 ovz005
Aug 2 04:58:58 host4 kernel: RIP: 0010:[<ffffffff804c8be3>] [<ffffffff804c8be3>] _spin_lock+0x83/0xa0
Aug 2 04:58:58 host4 kernel: RSP: 0018:ffff810269d19d70 EFLAGS: 00000202
Aug 2 04:58:58 host4 kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000002
Aug 2 04:58:58 host4 kernel: RDX: 0000000000000001 RSI: 0000000000000202 RDI: ffff81026e8f0020
Aug 2 04:58:58 host4 kernel: RBP: ffffffff80233e9f R08: 00000000004a6aa5 R09: ffff810269d19da8
Aug 2 04:58:58 host4 kernel: R10: 28f5c28f5c28f5c3 R11: 00000000ffffffff R12: ffff810269d19d10
Aug 2 04:58:58 host4 kernel: R13: 000000000000000a R14: ffff810269d19e70 R15: ffffffff806e6820
Aug 2 04:58:58 host4 kernel: FS: 0000000000000000(0000) GS:ffff810271c02880(0000) knlGS:0000000000000000
Aug 2 04:58:58 host4 kernel: CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
Aug 2 04:58:58 host4 kernel: CR2: 000000000900bfec CR3: 0000000005978000 CR4: 00000000000026e0
Aug 2 04:58:58 host4 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Aug 2 04:58:58 host4 kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Aug 2 04:58:58 host4 kernel:
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!