Kernal Errors Related to KVM

killerherts

Member
Apr 12, 2020
14
2
8
34
One of my virtual machines keeps rebooting randomly at night, but i cannot seem to make head or tails of the error log.
12/18 @ 2:03
12/18 @ 5:14
these are my crash times from today

I have attached my kern.log
 

Attachments

hi,

in your log i see:
Code:
Dec 16 12:00:13 Hades kernel: [    1.307377] mce: [Hardware Error]: CPU 19: Machine Check: 0 Bank 1: a000000000000000
Dec 16 12:00:13 Hades kernel: [    1.307377] mce: [Hardware Error]: TSC 0
Dec 16 12:00:13 Hades kernel: [    1.307377] mce: [Hardware Error]: PROCESSOR 0:f61 TIME 1608148787 SOCKET 1 APIC 19 microcode 1

you should check your hardware.. or is this from the VM? if yes then qm config VMID can shed some more light
 
hi,

in your log i see:
Code:
Dec 16 12:00:13 Hades kernel: [    1.307377] mce: [Hardware Error]: CPU 19: Machine Check: 0 Bank 1: a000000000000000
Dec 16 12:00:13 Hades kernel: [    1.307377] mce: [Hardware Error]: TSC 0
Dec 16 12:00:13 Hades kernel: [    1.307377] mce: [Hardware Error]: PROCESSOR 0:f61 TIME 1608148787 SOCKET 1 APIC 19 microcode 1

you should check your hardware.. or is this from the VM? if yes then qm config VMID can shed some more light
When you say from the vm you mean the log from inside that installation correct if so no. These are displayed in my node kernel log I did go through and make sure all the memory DIMMs are installed correctly this briefly stopped the error from coming back but 24 hours later it was back. Would the next step be to do a long memtest on all the memory stick in the sever?