[SOLVED] VM läuft morgens plötzlich nicht mehr...

Dec 19, 2012
519
16
83
Hallo.
Ich hatte jetzt zweimal in rel. kurzer Folge das Problem, dass eine VM (die rel. großzügig ausgestattet ist) morgens plötzlich nicht mehr läuft. Bei ersten Mal dachte ich noch an einen blöden Zufall, doch als es heute erneut der Fall war, habe ich mal etwas gestöbert:

Code:
free
             total       used       free     shared    buffers     cached
Mem:      32923412   32260896     662516      60956     232392   17593864
-/+ buffers/cache:   14434640   18488772
Swap:      7340028     889036    6450992


dmesg --->
[...]
[1696621.337844] CPU: 6 PID: 3283 Comm: pvestatd Tainted: P           O    4.4.35-1-pve #1
[1696621.337846] Hardware name: Gigabyte Technology Co., Ltd. GA-990FXA-UD5/GA-990FXA-UD5, BIOS F12 10/03/2013
[1696621.337847]  0000000000000286 00000000a2f26b36 ffff8807f1f27b50 ffffffff813f9743
[1696621.337850]  ffff8807f1f27d40 0000000000000000 ffff8807f1f27bb8 ffffffff8120adcb
[1696621.337852]  ffff880826d9ada0 ffffea000c8a5400 0000000100000001 0000000000000000
[1696621.337854] Call Trace:
[1696621.337861]  [<ffffffff813f9743>] dump_stack+0x63/0x90
[1696621.337864]  [<ffffffff8120adcb>] dump_header+0x67/0x1d5
[1696621.337866]  [<ffffffff811925c5>] oom_kill_process+0x205/0x3c0
[1696621.337868]  [<ffffffff81192a17>] out_of_memory+0x237/0x4a0
[1696621.337871]  [<ffffffff81198d0e>] __alloc_pages_nodemask+0xcee/0xe20
[1696621.337873]  [<ffffffff81198e8b>] alloc_kmem_pages_node+0x4b/0xd0
[1696621.337876]  [<ffffffff8107f053>] copy_process+0x1c3/0x1c00
[1696621.337878]  [<ffffffff811c24fb>] ? handle_mm_fault+0xdeb/0x19c0
[1696621.337881]  [<ffffffff813494b3>] ? security_file_alloc+0x33/0x50
[1696621.337884]  [<ffffffff81080c20>] _do_fork+0x80/0x360
[1696621.337886]  [<ffffffff810917ff>] ? sigprocmask+0x6f/0xa0
[1696621.337888]  [<ffffffff81080fa9>] SyS_clone+0x19/0x20
[1696621.337891]  [<ffffffff8185c276>] entry_SYSCALL_64_fastpath+0x16/0x75
[1696621.337893] Mem-Info:
[1696621.337898] active_anon:2388788 inactive_anon:494666 isolated_anon:0
active_file:1788804 inactive_file:2935754 isolated_file:0
unevictable:4788 dirty:70795 writeback:11882 unstable:0
slab_reclaimable:389260 slab_unreclaimable:88355
mapped:22550 shmem:16785 pagetables:14206 bounce:0
free:49558 free_pcp:15 free_cma:0
[1696621.337902] Node 0 DMA free:15616kB min:8kB low:8kB high:12kB active_anon:0kB inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15668kB managed:15616kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:0kB slab_unreclaimable:0kB kernel_stack:0kB pagetables:0kB unstable:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? yes
[1696621.337906] lowmem_reserve[]: 0 3261 32043 32043 32043
[1696621.337909] Node 0 DMA32 free:124764kB min:2328kB low:2908kB high:3492kB active_anon:417824kB inactive_anon:501548kB active_file:611388kB inactive_file:954200kB unevictable:1788kB isolated(anon):0kB isolated(file):0kB present:3515312kB managed:3434456kB mlocked:1788kB dirty:28860kB writeback:4404kB mapped:9220kB shmem:6880kB slab_reclaimable:739440kB slab_unreclaimable:39216kB kernel_stack:656kB pagetables:5028kB unstable:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
[1696621.337913] lowmem_reserve[]: 0 0 28782 28782 28782
[1696621.337916] Node 0 Normal free:57852kB min:20572kB low:25712kB high:30856kB active_anon:9137328kB inactive_anon:1477116kB active_file:6543828kB inactive_file:10788816kB unevictable:17364kB isolated(anon):0kB isolated(file):0kB present:29999104kB managed:29473340kB mlocked:17364kB dirty:254320kB writeback:43124kB mapped:80980kB shmem:60260kB slab_reclaimable:817600kB slab_unreclaimable:314204kB kernel_stack:6240kB pagetables:51796kB unstable:0kB bounce:0kB free_pcp:60kB local_pcp:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
[1696621.337920] lowmem_reserve[]: 0 0 0 0 0
[1696621.337923] Node 0 DMA: 2*4kB (U) 1*8kB (U) 1*16kB (U) 1*32kB (U) 1*64kB (U) 1*128kB (U) 0*256kB 0*512kB 1*1024kB (U) 1*2048kB (M) 3*4096kB (M) = 15616kB
[1696621.337932] Node 0 DMA32: 4539*4kB (UME) 13356*8kB (UME) 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 125004kB
[1696621.337938] Node 0 Normal: 6580*4kB (UMEH) 4016*8kB (UMEH) 23*16kB (H) 7*32kB (H) 3*64kB (H) 1*128kB (H) 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 59360kB
[1696621.337947] Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=1048576kB
[1696621.337948] Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=2048kB
[1696621.337949] 4819397 total pagecache pages
[1696621.337951] 76544 pages in swap cache
[1696621.337953] Swap cache stats: add 1108327, delete 1031783, find 6767402/6909208
[1696621.337954] Free swap  = 6033780kB
[1696621.337954] Total swap = 7340028kB
[1696621.337955] 8382521 pages RAM
[1696621.337956] 0 pages HighMem/MovableOnly
[1696621.337957] 151668 pages reserved
[1696621.337958] 0 pages cma reserved
[1696621.337959] 0 pages hwpoisoned
[1696621.337960] [ pid ]   uid  tgid total_vm      rss nr_ptes nr_pmds swapents oom_score_adj name
[1696621.337965] [ 1372]     0  1372    16530     7756      38       4       32             0 systemd-journal
[...]
[1696621.338133] [30861]     0 30861     1452      176       7       3        0             0 sleep
[1696621.338135] Out of memory: Kill process 3765 (kvm) score 102 or sacrifice child
[1696621.338204] Killed process 3765 (kvm) total-vm:4890676kB, anon-rss:3880652kB, file-rss:4376kB
[1696622.251079] vmbr52: port 2(tap507i0) entered disabled state
[1696622.251187] vmbr52: port 2(tap507i0) entered disabled state
[1741033.262258] kvm [18758]: vcpu0 unhandled rdmsr: 0xc001100d
[1745964.936326] device tap500i0 entered promiscuous mode
[1745964.952798] vmbr11: port 3(tap500i0) entered forwarding state
[1745964.952808] vmbr11: port 3(tap500i0) entered forwarding state
[1745965.996400] kvm: zapping shadow pages for mmio generation wraparound
[1745965.999391] kvm: zapping shadow pages for mmio generation wraparound

Es sieht also alles danach aus, als würde die VM aufgrund von Speicherproblemen abgeschossen. Allerdings gibt es nirgendwo Warnungen und auch im Dashboard zeigt die RAM-Auslastung ca 46% an.
Daher die Frage: Was kann das sein?