[SOLVED] PVE 6.0.x doesn't load kvm_amd (EYPC 7551)

Nov 5, 2018
Please advise as I need to get production up and running again. Should I just revert back to the PVE 5.x version?

Had to R&R due to no backups and a hardware failure. Upgraded/installed PVE6.0.1 on EYPC 7551 but it's not loading the virtualization module and manually doesn't seem to cut it either.

# lscpu
Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
Address sizes:       43 bits physical, 48 bits virtual
CPU(s):              64
On-line CPU(s) list: 0-63
Thread(s) per core:  2
Core(s) per socket:  32
Socket(s):           1
NUMA node(s):        4
Vendor ID:           AuthenticAMD
CPU family:          23
Model:               1
Model name:          AMD EPYC 7551P 32-Core Processor
Stepping:            2
CPU MHz:             2553.157
BogoMIPS:            3992.46
Virtualization:      AMD-V
L1d cache:           32K
L1i cache:           64K
L2 cache:            512K
L3 cache:            8192K
NUMA node0 CPU(s):   0,4,8,12,16,20,24,28,32,36,40,44,48,52,56,60
NUMA node1 CPU(s):   1,5,9,13,17,21,25,29,33,37,41,45,49,53,57,61
NUMA node2 CPU(s):   2,6,10,14,18,22,26,30,34,38,42,46,50,54,58,62
NUMA node3 CPU(s):   3,7,11,15,19,23,27,31,35,39,43,47,51,55,59,63
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid amd_dcm aperfmperf pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb hw_pstate sme ssbd ibpb vmmcall fsgsbase bmi1 avx2 smep bmi2 rdseed adx smap clflushopt sha_ni xsaveopt xsavec xgetbv1 xsaves clzero irperf xsaveerptr arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif overflow_recov succor smca

It's enabled in bios and it was working find in 5.x I just don't want to reinstall/start over so I am trying to get clarification on if I found a software bug in PVE6.0.1
root@C14G-H1-PVE:~# lsmod | grep kvm
root@C14G-H1-PVE:~# lsmod | grep -i kvm

Try to load it myself
root@C14G-H1-PVE:~# modprobe kvm_amd
modprobe: ERROR: ../libkmod/libkmod.c:586 kmod_search_moddep() could not open moddep file '/lib/modules/4.15.18-18-pve/modules.dep.bin'
modprobe: FATAL: Module kvm_amd not found in directory /lib/modules/4.15.18-18-pve
root@C14G-H1-PVE:~# modprobe kvm
modprobe: ERROR: ../libkmod/libkmod.c:586 kmod_search_moddep() could not open moddep file '/lib/modules/4.15.18-18-pve/modules.dep.bin'
modprobe: FATAL: Module kvm not found in directory /lib/modules/4.15.18-18-pve

and of course the details
root@C14G-H1-PVE:~# apt update
Hit:1 http://ftp.us.debian.org/debian buster InRelease
Get:2 http://ftp.us.debian.org/debian buster-updates InRelease [49.3 kB]     
Hit:3 http://security.debian.org buster/updates InRelease                                         
Hit:4 http://download.proxmox.com/debian/ceph-nautilus buster InRelease                           
Hit:5 https://enterprise.proxmox.com/debian/pve buster InRelease                             
Fetched 49.3 kB in 1s (61.1 kB/s)
Reading package lists... Done
Building dependency tree       
Reading state information... Done
All packages are up to date.
root@C14G-H1-PVE:~# apt upgrade
Reading package lists... Done
Building dependency tree       
Reading state information... Done
Calculating upgrade... Done
0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.
root@C14G-H1-PVE:~# uname -a
Linux C14G-H1-PVE 4.15.18-18-pve #1 SMP PVE 4.15.18-44 (Wed, 03 Jul 2019 11:19:13 +0200) x86_64 GNU/Linux

please send the output of pveversion -v.
There is something wrong with your installation because Proxmox VE 6 uses the Linux Kernel version 5.0 and not 4.15.
root@C14G-H1-PVE:~# pveversion
pve-manager/6.0-7/28984024 (running kernel: 4.15.18-18-pve)
root@C14G-H1-PVE:~# pveversion -v
proxmox-ve: 6.0-2 (running kernel: 4.15.18-18-pve)
pve-manager: 6.0-7 (running version: 6.0-7/28984024)
pve-kernel-5.0: 6.0-8
pve-kernel-helper: 6.0-8
pve-kernel-5.0.21-2-pve: 5.0.21-3
pve-kernel-5.0.21-1-pve: 5.0.21-2
pve-kernel-5.0.15-1-pve: 5.0.15-1
ceph: 14.2.4-pve1
ceph-fuse: 14.2.4-pve1
corosync: 3.0.2-pve2
criu: 3.11-3
glusterfs-client: 5.5-3
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.12-pve1
libpve-access-control: 6.0-2
libpve-apiclient-perl: 3.0-2
libpve-common-perl: 6.0-4
libpve-guest-common-perl: 3.0-1
libpve-http-server-perl: 3.0-2
libpve-storage-perl: 6.0-8
libqb0: 1.0.5-1
lvm2: 2.03.02-pve3
lxc-pve: 3.1.0-65
lxcfs: 3.0.3-pve60
novnc-pve: 1.0.0-60
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.0-7
pve-cluster: 6.0-7
pve-container: 3.0-7
pve-docs: 6.0-4
pve-edk2-firmware: 2.20190614-1
pve-firewall: 4.0-7
pve-firmware: 3.0-2
pve-ha-manager: 3.0-2
pve-i18n: 2.0-3
pve-qemu-kvm: 4.0.0-5
pve-xtermjs: 3.13.2-1
qemu-server: 6.0-7
smartmontools: 7.0-pve2
spiceterm: 3.1-1
vncterm: 1.6-1
zfsutils-linux: 0.8.1-pve2
Even though I just installed fresh (though it detected an old version I said to not use it) I am guessing I'll need to reinstall again...unless someone has some magic on how to get 5.x kernel chugging along
I have found the problem, the boot order was set to ISDM first which had the old kernel version. I have removed that boot option and booted from the SSD RAID and lsmod | grep -i kvm loaded the appropriate virtualization and performance is back to "normal". Thank you!
Glad that you found the problem.
Please mark the thread as marked.


