Hi everyone,
The last couple of weeks we are having random crashes of various windows 2016 guests when the backup runs. They do not always crash and not always the same guest.
The Windows guest log has:
Proxmox log has:
Any ideas would be appreciated.
Thank you,
The last couple of weeks we are having random crashes of various windows 2016 guests when the backup runs. They do not always crash and not always the same guest.
The Windows guest log has:
The previous system shutdown at 12:10:47 PM on 1/14/204 was unexpected.
Proxmox log has:
Code:
Jan 14 12:09:52 vsvr-7865-9 pvescheduler[120630]: INFO: Finished Backup of VM 103 (00:06:09)
Jan 14 12:09:52 vsvr-7865-9 pvescheduler[120630]: INFO: Starting Backup of VM 221 (qemu)
Jan 14 12:11:14 vsvr-7865-9 kernel: corosync invoked oom-killer: gfp_mask=0x4c2cc0(GFP_KERNEL_ACCOUNT|__GFP_NOWARN|__GFP_COMP|__GFP_NOMEMALLOC), order=2, oom_score_adj=0
Jan 14 12:11:15 vsvr-7865-9 kernel: CPU: 7 PID: 2145 Comm: corosync Tainted: P O 6.5.11-7-pve #1
Jan 14 12:11:15 vsvr-7865-9 kernel: Hardware name: Dell Inc. Precision 7865 Tower/0DMYN9, BIOS 1.1.0 06/14/2023
Jan 14 12:11:15 vsvr-7865-9 kernel: Call Trace:
Jan 14 12:11:15 vsvr-7865-9 kernel: <TASK>
Jan 14 12:11:15 vsvr-7865-9 kernel: dump_stack_lvl+0x48/0x70
Jan 14 12:11:15 vsvr-7865-9 kernel: dump_stack+0x10/0x20
Jan 14 12:11:15 vsvr-7865-9 kernel: dump_header+0x4f/0x260
Jan 14 12:11:15 vsvr-7865-9 kernel: oom_kill_process+0x10d/0x1c0
Jan 14 12:11:15 vsvr-7865-9 kernel: out_of_memory+0x270/0x560
Jan 14 12:11:15 vsvr-7865-9 kernel: __alloc_pages+0x114f/0x12e0
Jan 14 12:11:15 vsvr-7865-9 kernel: ? __alloc_skb+0x8a/0x1b0
Jan 14 12:11:15 vsvr-7865-9 kernel: __kmalloc_large_node+0x7e/0x160
Jan 14 12:11:15 vsvr-7865-9 kernel: ? memcg_slab_post_alloc_hook+0x1bf/0x280
Jan 14 12:11:15 vsvr-7865-9 kernel: __kmalloc_node_track_caller.cold+0x5/0xa3
Jan 14 12:11:15 vsvr-7865-9 kernel: kmalloc_reserve+0x67/0x100
Jan 14 12:11:15 vsvr-7865-9 kernel: __alloc_skb+0x8a/0x1b0
Jan 14 12:11:15 vsvr-7865-9 kernel: alloc_skb_with_frags+0x4d/0x240
Jan 14 12:11:15 vsvr-7865-9 kernel: sock_alloc_send_pskb+0x20e/0x260
Jan 14 12:11:15 vsvr-7865-9 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 14 12:11:15 vsvr-7865-9 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 14 12:11:15 vsvr-7865-9 kernel: ? wait_for_unix_gc+0x46/0x110
Jan 14 12:11:15 vsvr-7865-9 kernel: unix_dgram_sendmsg+0x171/0xb80
Jan 14 12:11:15 vsvr-7865-9 kernel: ? inet_recvmsg+0x121/0x140
Jan 14 12:11:15 vsvr-7865-9 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 14 12:11:15 vsvr-7865-9 kernel: ? security_socket_recvmsg+0x47/0x80
Jan 14 12:11:15 vsvr-7865-9 kernel: unix_seqpacket_sendmsg+0x34/0x80
Jan 14 12:11:15 vsvr-7865-9 kernel: sock_write_iter+0x191/0x1a0
Jan 14 12:11:15 vsvr-7865-9 kernel: do_iter_readv_writev+0xf2/0x160
Jan 14 12:11:15 vsvr-7865-9 kernel: do_iter_write+0xa5/0x220
Jan 14 12:11:15 vsvr-7865-9 kernel: vfs_writev+0xf8/0x1c0
Jan 14 12:11:15 vsvr-7865-9 kernel: do_writev+0x108/0x170
Jan 14 12:11:15 vsvr-7865-9 kernel: __x64_sys_writev+0x1c/0x30
Jan 14 12:11:15 vsvr-7865-9 kernel: do_syscall_64+0x5b/0x90
Jan 14 12:11:15 vsvr-7865-9 kernel: ? syscall_exit_to_user_mode+0x37/0x60
Jan 14 12:11:15 vsvr-7865-9 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 14 12:11:15 vsvr-7865-9 kernel: ? do_syscall_64+0x67/0x90
Jan 14 12:11:15 vsvr-7865-9 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 14 12:11:15 vsvr-7865-9 kernel: ? do_syscall_64+0x67/0x90
Jan 14 12:11:15 vsvr-7865-9 kernel: ? do_syscall_64+0x67/0x90
Jan 14 12:11:15 vsvr-7865-9 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 14 12:11:15 vsvr-7865-9 kernel: ? do_syscall_64+0x67/0x90
Jan 14 12:11:15 vsvr-7865-9 kernel: ? do_syscall_64+0x67/0x90
Jan 14 12:11:15 vsvr-7865-9 kernel: entry_SYSCALL_64_after_hwframe+0x6e/0xd8
Jan 14 12:11:15 vsvr-7865-9 kernel: RIP: 0033:0x7ff40d5e0c8d
Jan 14 12:11:15 vsvr-7865-9 kernel: Code: 28 89 54 24 1c 48 89 74 24 10 89 7c 24 08 e8 3a 7a f8 ff 8b 54 24 1c 48 8b 74 24 10 41 89 c0 8b 7c 24 08 b8 14 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 33 44 89 c7 48 89 44 24 08 e8 8e 7a f8 ff 48
Jan 14 12:11:15 vsvr-7865-9 kernel: RSP: 002b:00007ff40a595a60 EFLAGS: 00000293 ORIG_RAX: 0000000000000014
Jan 14 12:11:15 vsvr-7865-9 kernel: RAX: ffffffffffffffda RBX: 00007ff401f5c010 RCX: 00007ff40d5e0c8d
Jan 14 12:11:15 vsvr-7865-9 kernel: RDX: 0000000000000001 RSI: 00007ff40a595b50 RDI: 0000000000000016
Jan 14 12:11:15 vsvr-7865-9 kernel: RBP: 00007ff40a5cfe40 R08: 0000000000000000 R09: 0000000000000004
Jan 14 12:11:15 vsvr-7865-9 kernel: R10: 0000000000000575 R11: 0000000000000293 R12: 00007ff409c8e010
Jan 14 12:11:15 vsvr-7865-9 kernel: R13: 00005559bfce0d70 R14: 00007ff40a595b50 R15: 00007ff40a597e70
Jan 14 12:11:15 vsvr-7865-9 kernel: </TASK>
Jan 14 12:11:15 vsvr-7865-9 kernel: Mem-Info:
Jan 14 12:11:15 vsvr-7865-9 kernel: active_anon:4066383 inactive_anon:1829383 isolated_anon:0
active_file:1287 inactive_file:7621 isolated_file:0
unevictable:40861 dirty:9 writeback:23
slab_reclaimable:69256 slab_unreclaimable:942755
mapped:22837 shmem:22081 pagetables:25254
sec_pagetables:10902 bounce:0
kernel_misc_reclaimable:0
free:1416123 free_pcp:62 free_cma:0
Jan 14 12:11:15 vsvr-7865-9 kernel: Node 0 active_anon:16265532kB inactive_anon:7317532kB active_file:5552kB inactive_file:30080kB unevictable:163444kB isolated(anon):0kB isolated(file):0kB mapped:91348kB dirty:36kB writeback:92kB shmem:88324kB shmem_thp: 0kB shmem_pmdmapped: 0kB anon_thp: 3778560kB writeback_tmp:0kB kernel_stack:15072kB pagetables:101016kB sec_pagetables:43608kB all_unreclaimable? no
Jan 14 12:11:15 vsvr-7865-9 kernel: Node 0 DMA free:11264kB boost:0kB min:12kB low:24kB high:36kB reserved_highatomic:0KB active_anon:0kB inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB writepending:0kB present:15992kB managed:15360kB mlocked:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB
Jan 14 12:11:15 vsvr-7865-9 kernel: lowmem_reserve[]: 0 2834 64066 64066 64066
Jan 14 12:11:15 vsvr-7865-9 kernel: Node 0 DMA32 free:402384kB boost:0kB min:2988kB low:5888kB high:8788kB reserved_highatomic:6144KB active_anon:580kB inactive_anon:248504kB active_file:0kB inactive_file:0kB unevictable:0kB writepending:0kB present:3026784kB managed:2960920kB mlocked:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB
Jan 14 12:11:15 vsvr-7865-9 kernel: lowmem_reserve[]: 0 0 61232 61232 61232
Jan 14 12:11:15 vsvr-7865-9 kernel: Node 0 Normal free:5250844kB boost:0kB min:64576kB low:127276kB high:189976kB reserved_highatomic:155648KB active_anon:16264840kB inactive_anon:7069140kB active_file:7224kB inactive_file:30224kB unevictable:163444kB writepending:128kB present:63948800kB managed:62709828kB mlocked:163444kB bounce:0kB free_pcp:248kB local_pcp:0kB free_cma:0kB
Jan 14 12:11:15 vsvr-7865-9 kernel: lowmem_reserve[]: 0 0 0 0 0
Jan 14 12:11:15 vsvr-7865-9 kernel: Node 0 DMA: 0*4kB 0*8kB 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 1*1024kB (U) 1*2048kB (M) 2*4096kB (M) = 11264kB
Jan 14 12:11:15 vsvr-7865-9 kernel: Node 0 DMA32: 19922*4kB (UME) 40337*8kB (UE) 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 402384kB
Jan 14 12:11:15 vsvr-7865-9 kernel: Node 0 Normal: 408266*4kB (UM) 452398*8kB (UME) 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 5252248kB
Jan 14 12:11:15 vsvr-7865-9 kernel: Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=1048576kB
Jan 14 12:11:15 vsvr-7865-9 kernel: Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=2048kB
Jan 14 12:11:15 vsvr-7865-9 kernel: 34724 total pagecache pages
Jan 14 12:11:15 vsvr-7865-9 kernel: 0 pages in swap cache
Jan 14 12:11:15 vsvr-7865-9 kernel: Free swap = 0kB
Jan 14 12:11:15 vsvr-7865-9 kernel: Total swap = 0kB
Jan 14 12:11:15 vsvr-7865-9 kernel: 16747894 pages RAM
Jan 14 12:11:15 vsvr-7865-9 kernel: 0 pages HighMem/MovableOnly
Jan 14 12:11:15 vsvr-7865-9 kernel: 326367 pages reserved
Jan 14 12:11:15 vsvr-7865-9 kernel: 0 pages hwpoisoned
Jan 14 12:11:15 vsvr-7865-9 kernel: Tasks state (memory values in pages):
Jan 14 12:11:15 vsvr-7865-9 kernel: [ pid ] uid tgid total_vm rss pgtables_bytes swapents oom_score_adj name
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 832] 0 832 10327 624 114688 0 -250 systemd-journal
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 859] 0 859 6719 720 77824 0 -1000 systemd-udevd
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1585] 0 1585 19796 576 53248 0 0 pvefw-logger
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1587] 103 1587 1969 576 53248 0 0 rpcbind
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1590] 101 1590 2292 576 57344 0 -900 dbus-daemon
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1594] 0 1594 38187 336 65536 0 -1000 lxcfs
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1597] 0 1597 69539 480 81920 0 0 pve-lxc-syscall
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1600] 0 1600 2895 720 65536 0 0 smartd
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1601] 0 1601 1766 291 49152 0 0 ksmtuned
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1605] 0 1605 1327 288 49152 0 0 qmeventd
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1607] 0 1607 12489 912 86016 0 0 systemd-logind
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1613] 0 1613 583 192 36864 0 -1000 watchdog-mux
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1617] 0 1617 60164 816 90112 0 0 zed
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1633] 0 1633 618 240 49152 0 0 atopacctd
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1660] 0 1660 5212 4819 86016 0 -1000 atop
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1930] 0 1930 1256 384 49152 0 0 lxc-monitord
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1947] 0 1947 1468 432 49152 0 0 agetty
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1952] 0 1952 3853 1152 73728 0 -1000 sshd
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1972] 100 1972 4714 634 57344 0 0 chronyd
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 1977] 100 1977 2632 496 57344 0 0 chronyd
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2016] 0 2016 126542 691 151552 0 0 rrdcached
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2031] 0 2031 144103 19819 471040 0 0 pmxcfs
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2123] 0 2123 10664 772 69632 0 0 master
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2125] 104 2125 10776 960 69632 0 0 qmgr
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2130] 0 2130 139855 41667 401408 0 0 corosync
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2131] 0 2131 1652 480 53248 0 0 cron
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2149] 0 2149 39286 24713 319488 0 0 pvestatd
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2150] 0 2150 39311 24587 294912 0 0 pve-firewall
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2154] 0 2154 615 240 45056 0 0 bpfilter_umh
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2177] 0 2177 58307 34274 401408 0 0 pvedaemon
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2178] 0 2178 60436 34515 425984 0 0 pvedaemon worke
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2179] 0 2179 60439 34563 425984 0 0 pvedaemon worke
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2180] 0 2180 60439 34563 425984 0 0 pvedaemon worke
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2185] 0 2185 54841 27683 360448 0 0 pve-ha-crm
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2187] 33 2187 58652 34385 421888 0 0 pveproxy
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2193] 33 2193 20199 13189 180224 0 0 spiceproxy
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2194] 33 2194 20260 13286 180224 0 0 spiceproxy work
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2195] 0 2195 54707 27688 376832 0 0 pve-ha-lrm
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2209] 0 2209 2670274 2096191 19308544 0 0 kvm
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2353] 0 2353 4741529 4214387 36196352 0 0 kvm
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2463] 0 2463 3287 919 57344 0 0 swtpm
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2471] 0 2471 2687472 2146671 19619840 0 0 kvm
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2686] 0 2686 3287 871 61440 0 0 swtpm
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2719] 0 2719 2654217 2119771 19456000 0 0 kvm
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 2821] 0 2821 53610 28237 360448 0 0 pvescheduler
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 29066] 33 29066 60818 35057 438272 0 0 pveproxy worker
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 36482] 33 36482 60818 35201 438272 0 0 pveproxy worker
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 37138] 33 37138 60818 34961 438272 0 0 pveproxy worker
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 88453] 0 88453 4934 1536 81920 0 100 systemd
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 88454] 0 88454 42439 1344 94208 0 100 (sd-pam)
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 89354] 0 89354 4753 1853 77824 0 0 sshd
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 89410] 0 89410 53010 28845 409600 0 0 qm
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 97689] 104 97689 10764 960 69632 0 0 pickup
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 120630] 0 120630 57986 29321 397312 0 0 task UPID:vsvr-
Jan 14 12:11:15 vsvr-7865-9 kernel: [ 125959] 0 125959 1366 336 49152 0 0 sleep
Jan 14 12:11:15 vsvr-7865-9 kernel: oom-kill:constraint=CONSTRAINT_NONE,nodemask=(null),cpuset=corosync.service,mems_allowed=0,global_oom,task_memcg=/qemu.slice/103.scope,task=kvm,pid=2353,uid=0
Jan 14 12:11:15 vsvr-7865-9 kernel: Out of memory: Killed process 2353 (kvm) total-vm:18966116kB, anon-rss:16853764kB, file-rss:3784kB, shmem-rss:0kB, UID:0 pgtables:35348kB oom_score_adj:0
Jan 14 12:11:15 vsvr-7865-9 systemd[1]: 103.scope: A process of this unit has been killed by the OOM killer.
Jan 14 12:11:15 vsvr-7865-9 systemd[1]: 103.scope: Failed with result 'oom-kill'.
Jan 14 12:11:15 vsvr-7865-9 systemd[1]: 103.scope: Consumed 30min 31.305s CPU time.
Jan 14 12:11:17 vsvr-7865-9 kernel: oom_reaper: reaped process 2353 (kvm), now anon-rss:0kB, file-rss:744kB, shmem-rss:0kB
Jan 14 12:11:19 vsvr-7865-9 kernel: zd64: p1 p2
Jan 14 12:11:19 vsvr-7865-9 kernel: vmbr1: port 3(tap103i0) entered disabled state
Jan 14 12:11:19 vsvr-7865-9 kernel: tap103i0 (unregistering): left allmulticast mode
Jan 14 12:11:19 vsvr-7865-9 kernel: vmbr1: port 3(tap103i0) entered disabled state
Jan 14 12:11:19 vsvr-7865-9 kernel: zd128: p1
Jan 14 12:11:19 vsvr-7865-9 qmeventd[126402]: Starting cleanup for 103
Jan 14 12:11:19 vsvr-7865-9 qmeventd[126402]: Finished cleanup for 103
Jan 14 12:12:37 vsvr-7865-9 pvescheduler[120630]: INFO: Finished Backup of VM 221 (00:02:45)
Jan 14 12:12:37 vsvr-7865-9 pvescheduler[120630]: INFO: Starting Backup of VM 223 (qemu)
Jan 14 12:14:37 vsvr-7865-9 pvescheduler[120630]: INFO: Finished Backup of VM 223 (00:02:00)
Jan 14 12:14:37 vsvr-7865-9 pvescheduler[120630]: INFO: Backup job finished successfully
Any ideas would be appreciated.
Thank you,