Ich habe in den letzten Tagen wiederholt (2 oder 3x in den letzten 2 Wochen) einen Crash einer VM erlebt, die bisher recht zuverlässig gelaufen ist.
Die VM läuft auf einem AMD Ryzen Node.
Fehlermeldung:
Bisher lief die VM auf einem anderen Node (Intel N95).
Mir ist nicht klar, warum die VM crashed.
Die VM läuft auf einem AMD Ryzen Node.
Fehlermeldung:
Code:
Jul 09 06:50:47 evcc kernel: BUG: unable to handle page fault for address: ffffffffaf67d513
Jul 09 06:50:47 evcc kernel: #PF: supervisor write access in kernel mode
Jul 09 06:50:47 evcc kernel: #PF: error_code(0x0003) - permissions violation
Jul 09 06:50:47 evcc kernel: PGD d015067 P4D d015067 PUD d016063 PMD b6001e1
Jul 09 06:50:47 evcc kernel: Oops: 0003 [#2] PREEMPT SMP PTI
Jul 09 06:50:47 evcc kernel: CPU: 0 PID: 620 Comm: evcc Tainted: G D 6.1.0-37-amd64 #1 Debian 6.1.140-1
Jul 09 06:50:47 evcc kernel: Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 4.2025.02-3 04/03/2025
Jul 09 06:50:47 evcc kernel: RIP: 0010:kvm_kick_cpu+0x23/0x30
Jul 09 06:50:47 evcc kernel: Code: 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 63 ff 53 48 c7 c0 f8 99 01 00 31 db 48 8b 14 fd 60 8b a1 b0 0f b7 0c 02 b8 05 00 00 00 <0f> 01 c1 5b e9 04 48 d8 00 0f 1f 40 00 0f 1f>
Jul 09 06:50:47 evcc kernel: RSP: 0018:ffffa77d8227baf8 EFLAGS: 00010046
Jul 09 06:50:47 evcc kernel: RAX: 0000000000000005 RBX: 0000000000000000 RCX: 0000000000000001
Jul 09 06:50:47 evcc kernel: RDX: ffff8bf71e300000 RSI: 00000000000000ff RDI: 0000000000000001
Jul 09 06:50:47 evcc kernel: RBP: 0000000000031a80 R08: ffff8bf71febfec0 R09: 000000000000015d
Jul 09 06:50:47 evcc kernel: R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000000000
Jul 09 06:50:47 evcc kernel: R13: 0000000000000283 R14: ffff8bf6c7a124f4 R15: 0000000000000001
Jul 09 06:50:47 evcc kernel: FS: 000000c000e40098(0000) GS:ffff8bf71e200000(0000) knlGS:0000000000000000
Jul 09 06:50:47 evcc kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 09 06:50:47 evcc kernel: CR2: ffffffffaf67d513 CR3: 00000000032bc000 CR4: 00000000000006f0
Jul 09 06:50:47 evcc kernel: Call Trace:
Jul 09 06:50:47 evcc kernel: <TASK>
Jul 09 06:50:47 evcc kernel: __pv_queued_spin_unlock_slowpath+0x98/0xd0
Jul 09 06:50:47 evcc kernel: __raw_callee_save___pv_queued_spin_unlock_slowpath+0x11/0x24
Jul 09 06:50:47 evcc kernel: .slowpath+0x9/0x16
Jul 09 06:50:47 evcc kernel: _raw_spin_unlock_irqrestore+0xa/0x40
Jul 09 06:50:47 evcc kernel: try_to_wake_up+0xd9/0x540
Jul 09 06:50:47 evcc kernel: wake_up_q+0x4a/0x90
Jul 09 06:50:47 evcc kernel: futex_wake+0x151/0x180
Jul 09 06:50:47 evcc kernel: do_futex+0xda/0x1b0
Jul 09 06:50:47 evcc kernel: __x64_sys_futex+0x8e/0x1d0
Jul 09 06:50:47 evcc kernel: do_syscall_64+0x55/0xb0
Jul 09 06:50:47 evcc kernel: ? ksys_write+0xd4/0xf0
Jul 09 06:50:47 evcc kernel: ? exit_to_user_mode_prepare+0x40/0x1e0
Jul 09 06:50:47 evcc kernel: ? syscall_exit_to_user_mode+0x1e/0x40
Jul 09 06:50:47 evcc kernel: ? do_syscall_64+0x61/0xb0
Jul 09 06:50:47 evcc kernel: ? _raw_spin_unlock+0xa/0x30
Jul 09 06:50:47 evcc kernel: ? finish_task_switch.isra.0+0x90/0x2d0
Jul 09 06:50:47 evcc kernel: ? __schedule+0x355/0x9e0
Jul 09 06:50:47 evcc kernel: ? switch_fpu_return+0x4c/0xd0
Jul 09 06:50:47 evcc kernel: ? exit_to_user_mode_prepare+0x14b/0x1e0
Jul 09 06:50:47 evcc kernel: ? syscall_exit_to_user_mode+0x1e/0x40
Jul 09 06:50:47 evcc kernel: ? do_syscall_64+0x61/0xb0
Jul 09 06:50:47 evcc kernel: ? swake_up_one+0x36/0x60
Jul 09 06:50:47 evcc kernel: ? _raw_spin_unlock_irqrestore+0xa/0x40
Jul 09 06:50:47 evcc kernel: ? rcu_core+0x1f2/0x4d0
Jul 09 06:50:47 evcc kernel: ? handle_softirqs+0xd7/0x280
Jul 09 06:50:47 evcc kernel: ? __irq_exit_rcu+0x3b/0xe0
Jul 09 06:50:47 evcc kernel: ? exit_to_user_mode_prepare+0x40/0x1e0
Jul 09 06:50:47 evcc kernel: entry_SYSCALL_64_after_hwframe+0x6e/0xd8
Jul 09 06:50:47 evcc kernel: RIP: 0033:0x484143
Jul 09 06:50:47 evcc kernel: Code: 24 20 c3 cc cc cc cc 48 8b 7c 24 08 8b 74 24 10 8b 54 24 14 4c 8b 54 24 18 4c 8b 44 24 20 44 8b 4c 24 28 b8 ca 00 00 00 0f 05 <89> 44 24 30 c3 cc cc cc cc cc cc cc cc cc cc>
Jul 09 06:50:47 evcc kernel: RSP: 002b:000000c0000b9e38 EFLAGS: 00000202 ORIG_RAX: 00000000000000ca
Jul 09 06:50:47 evcc kernel: RAX: ffffffffffffffda RBX: 0000000000000001 RCX: 0000000000484143
Jul 09 06:50:47 evcc kernel: RDX: 0000000000000001 RSI: 0000000000000081 RDI: 000000c001382148
Jul 09 06:50:47 evcc kernel: RBP: 000000c0000b9e88 R08: 0000000000000000 R09: 0000000000000000
Jul 09 06:50:47 evcc kernel: R10: 0000000000000000 R11: 0000000000000202 R12: 000000c0000b9ea0
Jul 09 06:50:47 evcc kernel: R13: 000000c001c00ae0 R14: 000000c001156540 R15: 0000000000000000
Jul 09 06:50:47 evcc kernel: </TASK>
Jul 09 06:50:47 evcc kernel: Modules linked in: binfmt_misc nls_ascii nls_cp437 bochs drm_vram_helper vfat cfg80211 drm_ttm_helper ttm drm_kms_helper virtio_console virtio_balloon fat rfkill button evdev joy>
Jul 09 06:50:47 evcc kernel: CR2: ffffffffaf67d513
Jul 09 06:50:47 evcc kernel: ---[ end trace 0000000000000000 ]---
Jul 09 06:50:47 evcc kernel: RIP: 0010:kvm_kick_cpu+0x23/0x30
Jul 09 06:50:47 evcc kernel: Code: 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 63 ff 53 48 c7 c0 f8 99 01 00 31 db 48 8b 14 fd 60 8b a1 b0 0f b7 0c 02 b8 05 00 00 00 <0f> 01 c1 5b e9 04 48 d8 00 0f 1f 40 00 0f 1f>
Jul 09 06:50:47 evcc kernel: RSP: 0018:ffffa77d800a3da0 EFLAGS: 00010046
Jul 09 06:50:47 evcc kernel: RAX: 0000000000000005 RBX: 0000000000000000 RCX: 0000000000000001
Jul 09 06:50:47 evcc kernel: RDX: ffff8bf71e300000 RSI: 00000000000000ff RDI: 0000000000000001
Jul 09 06:50:47 evcc kernel: RBP: ffff8bf71e338500 R08: ffff8bf71febfec0 R09: 00000000000001c9
Jul 09 06:50:47 evcc kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffff8bf6c19d7a00
Jul 09 06:50:47 evcc kernel: R13: 0000000000000001 R14: 0000000000000001 R15: ffff8bf71e331480
Jul 09 06:50:47 evcc kernel: FS: 000000c000e40098(0000) GS:ffff8bf71e200000(0000) knlGS:0000000000000000
Jul 09 06:50:47 evcc kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 09 06:50:47 evcc kernel: CR2: ffffffffaf67d513 CR3: 00000000032bc000 CR4: 00000000000006f0
Jul 09 06:50:47 evcc kernel: note: evcc[620] exited with irqs disabled
Jul 09 06:50:47 evcc kernel: note: evcc[620] exited with preempt_count 2
Jul 09 06:50:47 evcc kernel: rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
Jul 09 06:50:47 evcc kernel: rcu: 1-...!: (0 ticks this GP) idle=0324/1/0x4000000000000002 softirq=839512/839512 fqs=0
Jul 09 06:50:47 evcc kernel: (detected by 0, t=5253 jiffies, g=1204569, q=56 ncpus=2)
Jul 09 06:50:47 evcc kernel: Sending NMI from CPU 0 to CPUs 1:
Jul 09 06:50:47 evcc kernel: NMI backtrace for cpu 1 skipped: idling at native_halt+0xa/0x10
Jul 09 06:50:47 evcc kernel: rcu: rcu_preempt kthread timer wakeup didn't happen for 5253 jiffies! g1204569 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402
Jul 09 06:50:47 evcc kernel: rcu: Possible timer handling issue on cpu=1 timer-softirq=874118
Bisher lief die VM auf einem anderen Node (Intel N95).
Mir ist nicht klar, warum die VM crashed.
Last edited: