VM/Ubuntu Server random running issues

Dec 29, 2020
24
0
1
45
Hello guys,
Im running into issues with running VM's(ubuntu server).
Sometimes this happens at the beginning when I start to install OS, sometimes when I run 2 VMs .. completely randomly.
Screen Shot 2020-12-28 at 10.05.38 PM.png

in syslogs I see:
Code:
Dec 28 21:15:36 orion QEMU[10820]: KVM: entry failed, hardware error 0x5
Dec 28 21:15:36 orion QEMU[10820]: RAX=ffffffffa82da270 RBX=000000000000000b RCX=0000000000000001 RDX=000000000001598e
Dec 28 21:15:36 orion QEMU[10820]: RSI=00000043c991fd61 RDI=0000000000000082 RBP=ffffb136c00cbe90 RSP=ffffb136c00cbe70
Dec 28 21:15:36 orion QEMU[10820]: R8 =000000cd42e4dffb R9 =00000043d0f61461 R10=0000000000100000 R11=0000000000000000
Dec 28 21:15:36 orion QEMU[10820]: R12=000000000000000b R13=ffff97b27d289600 R14=0000000000000000 R15=0000000000000000
Dec 28 21:15:36 orion QEMU[10820]: RIP=ffffffffa82da65e RFL=00000246 [---Z-P-] CPL=0 II=0 A20=1 SMM=0 HLT=0
Dec 28 21:15:36 orion QEMU[10820]: ES =0000 0000000000000000 ffffffff 00c00000
Dec 28 21:15:36 orion QEMU[10820]: CS =0010 0000000000000000 ffffffff 00a09b00 DPL=0 CS64 [-RA]
Dec 28 21:15:36 orion QEMU[10820]: SS =0018 0000000000000000 ffffffff 00c09300 DPL=0 DS   [-WA]
Dec 28 21:15:36 orion QEMU[10820]: DS =0000 0000000000000000 ffffffff 00c00000
Dec 28 21:15:36 orion QEMU[10820]: FS =0000 0000000000000000 ffffffff 00c00000
Dec 28 21:15:36 orion QEMU[10820]: GS =0000 ffff97b27dac0000 ffffffff 00c00000
Dec 28 21:15:36 orion QEMU[10820]: LDT=0000 0000000000000000 ffffffff 00c00000
Dec 28 21:15:36 orion QEMU[10820]: TR =0040 fffffe0000234000 0000206f 00008b00 DPL=0 TSS64-busy
Dec 28 21:15:36 orion QEMU[10820]: GDT=     fffffe0000232000 0000007f
Dec 28 21:15:36 orion QEMU[10820]: IDT=     fffffe0000000000 00000fff
Dec 28 21:15:36 orion QEMU[10820]: CR0=80050033 CR2=00007fa224c0c4c0 CR3=00000000749fc000 CR4=000006e0
Dec 28 21:15:36 orion QEMU[10820]: DR0=0000000000000000 DR1=0000000000000000 DR2=0000000000000000 DR3=0000000000000000
Dec 28 21:15:36 orion QEMU[10820]: DR6=00000000fffe0ff0 DR7=0000000000000400
Dec 28 21:15:36 orion QEMU[10820]: EFER=0000000000000d01
Dec 28 21:15:36 orion QEMU[10820]: Code=53 00 f4 c3 66 90 e9 07 00 00 00 0f 00 2d b6 10 53 00 fb f4 <c3> 90 0f 1f 44 00 00 55 48 89 e5 41 55 41 54 53 e8 7d e3 64 ff 65 44 8b 25 d5 5c d3 57 0f
Dec 28 21:15:36 orion kernel: set kvm_intel.dump_invalid_vmcs=1 to dump internal KVM state.

Any help is appreciated.

Thanks!
 
Hi again,
did memory check - everything looks good.
Ive reinstalled proxmox.
the same issue :(

any ideas where might be the issue.
thanks in advance.
please let me know if I need to provide more specific info.

Screen Shot 2020-12-30 at 23.39.41.png
 
Please post the output of pveversion -v.
 
Code:
root@orion:~# pveversion -v
proxmox-ve: 6.3-1 (running kernel: 5.4.78-2-pve)
pve-manager: 6.3-3 (running version: 6.3-3/eee5f901)
pve-kernel-5.4: 6.3-3
pve-kernel-helper: 6.3-3
pve-kernel-5.4.78-2-pve: 5.4.78-2
pve-kernel-5.4.73-1-pve: 5.4.73-1
ceph-fuse: 12.2.11+dfsg1-2.1+b1
corosync: 3.0.4-pve1
criu: 3.11-3
glusterfs-client: 5.5-3
ifupdown: 0.8.35+pve1
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.16-pve1
libproxmox-acme-perl: 1.0.5
libproxmox-backup-qemu0: 1.0.2-1
libpve-access-control: 6.1-3
libpve-apiclient-perl: 3.1-3
libpve-common-perl: 6.3-2
libpve-guest-common-perl: 3.1-3
libpve-http-server-perl: 3.0-6
libpve-storage-perl: 6.3-3
libqb0: 1.0.5-1
libspice-server1: 0.14.2-4~pve6+1
lvm2: 2.03.02-pve4
lxc-pve: 4.0.3-1
lxcfs: 4.0.3-pve3
novnc-pve: 1.1.0-1
proxmox-backup-client: 1.0.6-1
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.4-3
pve-cluster: 6.2-1
pve-container: 3.3-1
pve-docs: 6.3-1
pve-edk2-firmware: 2.20200531-1
pve-firewall: 4.1-3
pve-firmware: 3.1-3
pve-ha-manager: 3.1-1
pve-i18n: 2.2-2
pve-qemu-kvm: 5.1.0-7
pve-xtermjs: 4.7.0-3
qemu-server: 6.3-2
smartmontools: 7.1-pve2
spiceterm: 3.1-1
vncterm: 1.6-2
zfsutils-linux: 0.8.5-pve1
root@orion:~#
 
Last edited:
Hello again,
posting here more info about my checks:

Code:
root@orion:~# dmesg | grep -e DMAR -e IOMMU -e VT-d
[    0.010235] ACPI: DMAR 0x00000000798BFD30 0000C4 (v01 SUPERM SMCI--MB 00000001 INTL 20091013)
[    0.419550] DMAR: IOMMU enabled
[    0.816202] DMAR: Host address width 46
[    0.816203] DMAR: DRHD base: 0x000000fbffc000 flags: 0x1
[    0.816208] DMAR: dmar0: reg_base_addr fbffc000 ver 1:0 cap 8d2078c106f0466 ecap f020de
[    0.816209] DMAR: RMRR base: 0x0000007ba59000 end: 0x0000007ba68fff
[    0.816209] DMAR: ATSR flags: 0x0
[    0.816210] DMAR: RHSA base: 0x000000fbffc000 proximity domain: 0x0
[    0.816212] DMAR-IR: IOAPIC id 1 under DRHD base  0xfbffc000 IOMMU 0
[    0.816213] DMAR-IR: IOAPIC id 2 under DRHD base  0xfbffc000 IOMMU 0
[    0.816214] DMAR-IR: HPET id 0 under DRHD base 0xfbffc000
[    0.816215] DMAR-IR: x2apic is disabled because BIOS sets x2apic opt out bit.
[    0.816215] DMAR-IR: Use 'intremap=no_x2apic_optout' to override the BIOS setting.
[    0.816511] DMAR-IR: Enabled IRQ remapping in xapic mode
[    1.811241] DMAR: dmar0: Using Queued invalidation
[    1.826946] DMAR: Intel(R) Virtualization Technology for Directed I/O
root@orion:~#


Code:
root@orion:~# kvm-ok
INFO: /dev/kvm exists
KVM acceleration can be used
root@orion:~#
 
Have you tried other OSes as well or does the issue specifically appear on Ubuntu?
 
Have you tried other OSes as well or does the issue specifically appear on Ubuntu?
thanks for reply. Just tried CentOS, the same issue:
Code:
Jan 03 15:38:23 orion QEMU[14332]: KVM: entry failed, hardware error 0x5
Jan 03 15:38:23 orion kernel: set kvm_intel.dump_invalid_vmcs=1 to dump internal KVM state.
Jan 03 15:38:23 orion QEMU[14332]: RAX=ffffffffa3ad62d0 RBX=0000000000000004 RCX=0000000000000000 RDX=0000000000000001
Jan 03 15:38:23 orion QEMU[14332]: RSI=0000000000000000 RDI=00000001a07528c0 RBP=0000000000000004 RSP=ffffb197c039bea0
Jan 03 15:38:23 orion QEMU[14332]: R8 =0000000a7f25287e R9 =ffff994f4f2ed000 R10=0000000000000000 R11=0000000000018f38
Jan 03 15:38:23 orion QEMU[14332]: R12=ffffffffffffffff R13=0000000000000000 R14=0000000000000000 R15=0000000000000000
Jan 03 15:38:23 orion QEMU[14332]: RIP=ffffffffa3ad667e RFL=00000246 [---Z-P-] CPL=0 II=0 A20=1 SMM=0 HLT=0
Jan 03 15:38:23 orion QEMU[14332]: ES =0000 0000000000000000 ffffffff 00c00000
Jan 03 15:38:23 orion QEMU[14332]: CS =0010 0000000000000000 ffffffff 00a09b00 DPL=0 CS64 [-RA]
Jan 03 15:38:23 orion QEMU[14332]: SS =0018 0000000000000000 ffffffff 00c09300 DPL=0 DS   [-WA]
Jan 03 15:38:23 orion QEMU[14332]: DS =0000 0000000000000000 ffffffff 00c00000
Jan 03 15:38:23 orion QEMU[14332]: FS =0000 0000000000000000 ffffffff 00c00000
Jan 03 15:38:23 orion QEMU[14332]: GS =0000 ffff994fb8f00000 ffffffff 00c00000
Jan 03 15:38:23 orion QEMU[14332]: LDT=0000 0000000000000000 ffffffff 00c00000
Jan 03 15:38:23 orion QEMU[14332]: TR =0040 fffffe00000af000 0000206f 00008b00 DPL=0 TSS64-busy
Jan 03 15:38:23 orion QEMU[14332]: GDT=     fffffe00000ad000 0000007f
Jan 03 15:38:23 orion QEMU[14332]: IDT=     fffffe0000000000 00000fff
Jan 03 15:38:23 orion QEMU[14332]: CR0=80050033 CR2=00007f35d65e5000 CR3=000000007eef2000 CR4=000006e0
Jan 03 15:38:23 orion QEMU[14332]: DR0=0000000000000000 DR1=0000000000000000 DR2=0000000000000000 DR3=0000000000000000
Jan 03 15:38:23 orion QEMU[14332]: DR6=00000000fffe0ff0 DR7=0000000000000400
Jan 03 15:38:23 orion QEMU[14332]: EFER=0000000000000d01
Jan 03 15:38:23 orion QEMU[14332]: Code=08 75 c4 eb 80 90 e9 07 00 00 00 0f 00 2d a6 2c 53 00 fb f4 <c3> 90 e9 07 00 00 00 0f 00 2d 96 2c 53 00 f4 c3 90 90 0f 1f 44 00 00 41 55 41 54 55 53 e8
 
have you more details about your CPU (Stepping Code etc.)?
I do recall we had some very very strange issues with some gear purchased "somewhere" and it turned out that these have been ES (engineering sample) CPUs. After replacing those with "regular" ones all these strange things went away. Those also had been V4-Xeons IIRC ...
 
have you more details about your CPU (Stepping Code etc.)?
I do recall we had some very very strange issues with some gear purchased "somewhere" and it turned out that these have been ES (engineering sample) CPUs. After replacing those with "regular" ones all these strange things went away. Those also had been V4-Xeons IIRC ...

Thanks for reply. Do I need to run another command? Please let me know.

Code:
root@orion:~# dmidecode --type processor
# dmidecode 3.2
Getting SMBIOS data from sysfs.
SMBIOS 3.0 present.

Handle 0x003E, DMI type 4, 42 bytes
Processor Information
        Socket Designation: CPU1
        Type: Central Processor
        Family: Xeon
        Manufacturer: Intel
        ID: F1 06 04 00 FF FB EB BF
        Signature: Type 0, Family 6, Model 79, Stepping 1
        Flags:
                FPU (Floating-point unit on-chip)
                VME (Virtual mode extension)
                DE (Debugging extension)
                PSE (Page size extension)
                TSC (Time stamp counter)
                MSR (Model specific registers)
                PAE (Physical address extension)
                MCE (Machine check exception)
                CX8 (CMPXCHG8 instruction supported)
                APIC (On-chip APIC hardware supported)
                SEP (Fast system call)
                MTRR (Memory type range registers)
                PGE (Page global enable)
                MCA (Machine check architecture)
                CMOV (Conditional move instruction supported)
                PAT (Page attribute table)
                PSE-36 (36-bit page size extension)
                CLFSH (CLFLUSH instruction supported)
                DS (Debug store)
                ACPI (ACPI supported)
                MMX (MMX technology supported)
                FXSR (FXSAVE and FXSTOR instructions supported)
                SSE (Streaming SIMD extensions)
                SSE2 (Streaming SIMD extensions 2)
                SS (Self-snoop)
                HTT (Multi-threading)
                TM (Thermal monitor supported)
                PBE (Pending break enabled)
        Version: Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz
        Voltage: 1.8 V
        External Clock: 100 MHz
        Max Speed: 4000 MHz
        Current Speed: 2400 MHz
        Status: Populated, Enabled
        Upgrade: Socket LGA2011-3
        L1 Cache Handle: 0x003B
        L2 Cache Handle: 0x003C
        L3 Cache Handle: 0x003D
        Serial Number: Not Specified
        Asset Tag: Not Specified
        Part Number: Not Specified
        Core Count: 14
        Core Enabled: 14
        Thread Count: 28
        Characteristics:
                64-bit capable
                Multi-Core
                Hardware Thread
                Execute Protection
                Enhanced Virtualization
                Power/Performance Control
 
Code:
root@orion:~# cat /proc/cpuinfo
processor       : 0
vendor_id       : GenuineIntel
cpu family      : 6
model           : 79
model name      : Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz
stepping        : 1
microcode       : 0xb000038
cpu MHz         : 2287.992
cache size      : 35840 KB
physical id     : 0
siblings        : 28
core id         : 0
cpu cores       : 14
apicid          : 0
initial apicid  : 0
fpu             : yes
fpu_exception   : yes
cpuid level     : 20
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a rdseed adx smap intel_pt xsaveopt cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts md_clear flush_l1d
bugs            : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs taa itlb_multihit
bogomips        : 4800.47
clflush size    : 64
cache_alignment : 64
address sizes   : 46 bits physical, 48 bits virtual
power management:

processor       : 1
vendor_id       : GenuineIntel
cpu family      : 6
model           : 79
model name      : Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz
stepping        : 1
microcode       : 0xb000038
cpu MHz         : 2299.600
cache size      : 35840 KB
physical id     : 0
siblings        : 28
core id         : 1
cpu cores       : 14
apicid          : 2
initial apicid  : 2
fpu             : yes
fpu_exception   : yes
cpuid level     : 20
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a rdseed adx smap intel_pt xsaveopt cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts md_clear flush_l1d
bugs            : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs taa itlb_multihit
bogomips        : 4800.47
clflush size    : 64
cache_alignment : 64
address sizes   : 46 bits physical, 48 bits virtual
power management:

processor       : 2
vendor_id       : GenuineIntel
cpu family      : 6
model           : 79
model name      : Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz
stepping        : 1
microcode       : 0xb000038
cpu MHz         : 2258.174
cache size      : 35840 KB
physical id     : 0
siblings        : 28
core id         : 2
cpu cores       : 14
apicid          : 4
initial apicid  : 4
fpu             : yes
fpu_exception   : yes
cpuid level     : 20
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a rdseed adx smap intel_pt xsaveopt cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts md_clear flush_l1d
bugs            : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs taa itlb_multihit
bogomips        : 4800.47
clflush size    : 64
cache_alignment : 64
address sizes   : 46 bits physical, 48 bits virtual
power management:

processor       : 3
vendor_id       : GenuineIntel
cpu family      : 6
model           : 79
model name      : Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz
stepping        : 1
microcode       : 0xb000038
cpu MHz         : 2903.934
cache size      : 35840 KB
physical id     : 0
siblings        : 28
core id         : 3
cpu cores       : 14
apicid          : 6
initial apicid  : 6
fpu             : yes
fpu_exception   : yes
cpuid level     : 20
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a rdseed adx smap intel_pt xsaveopt cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts md_clear flush_l1d
bugs            : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs taa itlb_multihit
bogomips        : 4800.47
clflush size    : 64
cache_alignment : 64
address sizes   : 46 bits physical, 48 bits virtual
power management:

processor       : 4
vendor_id       : GenuineIntel
cpu family      : 6
model           : 79
model name      : Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz
stepping        : 1
microcode       : 0xb000038
cpu MHz         : 2901.361
cache size      : 35840 KB
physical id     : 0
siblings        : 28
core id         : 4
cpu cores       : 14
apicid          : 8
initial apicid  : 8
fpu             : yes
fpu_exception   : yes
cpuid level     : 20
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a rdseed adx smap intel_pt xsaveopt cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts md_clear flush_l1d
bugs            : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs taa itlb_multihit
bogomips        : 4800.47
clflush size    : 64
cache_alignment : 64
address sizes   : 46 bits physical, 48 bits virtual
power management:

processor       : 5
vendor_id       : GenuineIntel
cpu family      : 6
model           : 79
model name      : Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz
stepping        : 1
microcode       : 0xb000038
cpu MHz         : 2900.737
cache size      : 35840 KB
physical id     : 0
siblings        : 28
core id         : 5
cpu cores       : 14
apicid          : 10
initial apicid  : 10
fpu             : yes
fpu_exception   : yes
cpuid level     : 20
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a rdseed adx smap intel_pt xsaveopt cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts md_clear flush_l1d
bugs            : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs taa itlb_multihit
bogomips        : 4800.47
clflush size    : 64
cache_alignment : 64
address sizes   : 46 bits physical, 48 bits virtual
power management:

processor       : 6
vendor_id       : GenuineIntel
cpu family      : 6
model           : 79
model name      : Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz
stepping        : 1
microcode       : 0xb000038
cpu MHz         : 2902.799
cache size      : 35840 KB
physical id     : 0
siblings        : 28
core id         : 6
cpu cores       : 14
apicid          : 12
initial apicid  : 12
fpu             : yes
fpu_exception   : yes
cpuid level     : 20
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a rdseed adx smap intel_pt xsaveopt cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts md_clear flush_l1d
bugs            : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs taa itlb_multihit
bogomips        : 4800.47
clflush size    : 64
cache_alignment : 64
address sizes   : 46 bits physical, 48 bits virtual
power management:

processor       : 7
vendor_id       : GenuineIntel
cpu family      : 6
model           : 79
model name      : Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz
stepping        : 1
microcode       : 0xb000038
cpu MHz         : 2903.215
cache size      : 35840 KB
physical id     : 0
siblings        : 28
core id         : 8
cpu cores       : 14
apicid          : 16
initial apicid  : 16
fpu             : yes
fpu_exception   : yes
cpuid level     : 20
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a rdseed adx smap intel_pt xsaveopt cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts md_clear flush_l1d
bugs            : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs taa itlb_multihit
bogomips        : 4800.47
clflush size    : 64
cache_alignment : 64
address sizes   : 46 bits physical, 48 bits virtual
power management:

processor       : 8
vendor_id       : GenuineIntel
cpu family      : 6
model           : 79
model name      : Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz
stepping        : 1
microcode       : 0xb000038
cpu MHz         : 2903.682
cache size      : 35840 KB
physical id     : 0
siblings        : 28
core id         : 9
cpu cores       : 14
apicid          : 18
initial apicid  : 18
fpu             : yes
fpu_exception   : yes
cpuid level     : 20
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a rdseed adx smap intel_pt xsaveopt cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts md_clear flush_l1d
bugs            : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs taa itlb_multihit
bogomips        : 4800.47
clflush size    : 64
cache_alignment : 64
address sizes   : 46 bits physical, 48 bits virtual
power management:

processor       : 9
vendor_id       : GenuineIntel
cpu family      : 6
model           : 79
model name      : Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz
stepping        : 1
microcode       : 0xb000038
cpu MHz         : 2903.921
cache size      : 35840 KB
physical id     : 0
siblings        : 28
core id         : 10
cpu cores       : 14
apicid          : 20
initial apicid  : 20
fpu             : yes
fpu_exception   : yes
cpuid level     : 20
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a rdseed adx smap intel_pt xsaveopt cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts md_clear flush_l1d
bugs            : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs taa itlb_multihit
bogomips        : 4800.47
clflush size    : 64
cache_alignment : 64
address sizes   : 46 bits physical, 48 bits virtual
power management:
....
 
Looks ok from what I recall and what I have found here:
https://superuser.com/questions/635565/fake-intel-cpus

are you running the latest Bios/UEFI on your mainboard?
Yeah, https://www.supermicro.com/Bios/softfiles/10741/X10SRi-F_BIOS_3_2_release_notes.pdf
Code:
# dmidecode 3.2
Getting SMBIOS data from sysfs.
SMBIOS 3.0 present.
70 structures occupying 3499 bytes.
Table at 0x000EC9B0.

Handle 0x0000, DMI type 0, 24 bytes
BIOS Information
        Vendor: American Megatrends Inc.
        Version: 3.2
        Release Date: 11/22/2019
        Address: 0xF0000
        Runtime Size: 64 kB
        ROM Size: 16 MB
        Characteristics:
                PCI is supported
                BIOS is upgradeable
                BIOS shadowing is allowed
                Boot from CD is supported
                Selectable boot is supported
                BIOS ROM is socketed
                EDD is supported
                5.25"/1.2 MB floppy services are supported (int 13h)
                3.5"/720 kB floppy services are supported (int 13h)
                3.5"/2.88 MB floppy services are supported (int 13h)
                Print screen service is supported (int 5h)
                8042 keyboard services are supported (int 9h)
                Serial services are supported (int 14h)
                Printer services are supported (int 17h)
                ACPI is supported
                USB legacy is supported
                BIOS boot specification is supported
                Targeted content distribution is supported
                UEFI is supported
        BIOS Revision: 5.6
Handle 0x0001, DMI type 1, 27 bytes
System Information
        Manufacturer: Supermicro
        Product Name: SYS-5018R-M
        Version: 0123456789
        Serial Number: S16231410B05598
        UUID: 00000000-0000-0000-0000-3cecef03f2ea
        Wake-up Type: Power Switch
        SKU Number: Default string
        Family: Default string

Handle 0x0002, DMI type 2, 15 bytes
Base Board Information
        Manufacturer: Supermicro
        Product Name: X10SRi-F
        Version: 1.01B
        Serial Number: NM206S007342
        Asset Tag: Default string
        Features:
                Board is a hosting board
                Board is replaceable
        Location In Chassis: Default string
        Chassis Handle: 0x0003
        Type: Motherboard
        Contained Object Handles: 0

Handle 0x0003, DMI type 3, 22 bytes
Chassis Information
        Manufacturer: Supermicro
        Type: Other
        Lock: Not Present
        Version: 0123456789
        Serial Number: C813MLJ27P10183
        Asset Tag: Default string
        Boot-up State: Safe
        Power Supply State: Safe
        Thermal State: Safe
        Security Status: None
        OEM Information: 0x00000000
        Height: Unspecified
and so on ...
 
You could try installing the latest microcode package (intel-microcode) which is available in the non-free repository.
 
You could try installing the latest microcode package (intel-microcode) which is available in the non-free repository.
thanks for reply Mira.
Looks like I have the latest microcode packages.
Code:
root@orion:~# dmesg | grep microcode
[    1.907325] microcode: sig=0x406f1, pf=0x1, revision=0xb000038
[    1.908139] microcode: Microcode Update Driver: v2.2.


root@orion:~# apt install intel-microcode
Reading package lists... Done
Building dependency tree       
Reading state information... Done
Package intel-microcode is not available, but is referred to by another package.
This may mean that the package is missing, has been obsoleted, or
is only available from another source

E: Package 'intel-microcode' has no installation candidate
root@orion:~#
 
Seems you missed to configure non-free repo.

See https://wiki.debian.org/Microcode
thanks for reply.
added

Code:
deb http://security.debian.org/ buster/updates main contrib non-free
deb-src http://security.debian.org/ buster/updates main contrib non-free
deb  http://deb.debian.org/debian buster main contrib non-free
deb-src  http://deb.debian.org/debian buster main contrib non-free

then

Code:
apt-get update
apt-get install intel-microcode

getting

Code:
Jan 04 09:57:45 orion QEMU[8878]: KVM: entry failed, hardware error 0x7
Jan 04 09:57:45 orion QEMU[8878]: EAX=00000000 EBX=00000000 ECX=00000000 EDX=00000600
Jan 04 09:57:45 orion QEMU[8878]: ESI=00000000 EDI=00000000 EBP=00000000 ESP=00000000
Jan 04 09:57:45 orion QEMU[8878]: EIP=00000000 EFL=00000002 [-------] CPL=0 II=0 A20=1 SMM=0 HLT=0
Jan 04 09:57:45 orion QEMU[8878]: ES =0000 00000000 0000ffff 00009300
Jan 04 09:57:45 orion QEMU[8878]: CS =1000 00010000 0000ffff 00009b00
Jan 04 09:57:45 orion QEMU[8878]: SS =0000 00000000 0000ffff 00009300
Jan 04 09:57:45 orion QEMU[8878]: DS =0000 00000000 0000ffff 00009300
Jan 04 09:57:45 orion QEMU[8878]: FS =0000 00000000 0000ffff 00009300
Jan 04 09:57:45 orion QEMU[8878]: GS =0000 00000000 0000ffff 00009300
Jan 04 09:57:45 orion QEMU[8878]: LDT=0000 00000000 0000ffff 00008200
Jan 04 09:57:45 orion QEMU[8878]: TR =0000 00000000 0000ffff 00008b00
Jan 04 09:57:45 orion QEMU[8878]: GDT=     00000000 0000ffff
Jan 04 09:57:45 orion QEMU[8878]: IDT=     00000000 0000ffff
Jan 04 09:57:45 orion QEMU[8878]: CR0=60000010 CR2=00000000 CR3=00000000 CR4=00000000
Jan 04 09:57:45 orion QEMU[8878]: DR0=0000000000000000 DR1=0000000000000000 DR2=0000000000000000 DR3=0000000000000000
Jan 04 09:57:45 orion QEMU[8878]: DR6=00000000ffff0ff0 DR7=0000000000000400
Jan 04 09:57:45 orion QEMU[8878]: EFER=0000000000000000
Jan 04 09:57:45 orion QEMU[8878]: Code=<ea> f2 d0 00 f0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
 
KVM: entry failed, hardware error 0x5
If you search for this in the www you will find that (old) thread:
https://ubuntuforums.org/showthread.php?t=2385050&s=90e18f1577a2f621f833431d07d9af73&page=2

Also those:
https://bugzilla.kernel.org/show_bug.cgi?id=197813
https://bugzilla.redhat.com/show_bug.cgi?id=1695596

All seem to be related to the v4 xeons. No one seems to have found a solution

I am sorry but I think there is nothing I can do to help.
You could just try things, use a different CPU or down core mode. But that's all guesswork
 
If you search for this in the www you will find that (old) thread:
https://ubuntuforums.org/showthread.php?t=2385050&s=90e18f1577a2f621f833431d07d9af73&page=2

Also those:
https://bugzilla.kernel.org/show_bug.cgi?id=197813
https://bugzilla.redhat.com/show_bug.cgi?id=1695596

All seem to be related to the v4 xeons. No one seems to have found a solution

I am sorry but I think there is nothing I can do to help.
You could just try things, use a different CPU or down core mode. But that's all guesswork
Thanks for reply.

Not sure if this is the root cause or not... Now I create VM's allocating 1-2 cores(ONLY).. before I was adding 28(max I have)
I hare 3 VMs green. installing 4th VM.

was running virt-host-validate and getting this:
 

Attachments

  • Screen Shot 2021-01-04 at 16.31.38.png
    Screen Shot 2021-01-04 at 16.31.38.png
    207.3 KB · Views: 17

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!