vGPU driver can't find the kernel

lu549772129

New Member
Jan 12, 2020
1
1
1
32
hi everyone
I tried to download the vGPU driver on NVIDIA enterprise website and install it, but the driver shows some error, I don't know what to do, help me, please :)
Proxmox VE version: 6.1
vGPU driver version: 410.137

ERROR: Unable to find the kernel source tree for the currently running kernel. Please make sure you have installed the kernel source files for your kernel and that they are properly configured; on Red Hat Linux systems, for example, be sure you have the 'kernel-source' or 'kernel-devel' RPM installed. If you know the correct kernel source files are installed, you may specify the kernel source path with the '--kernel-source-path' command line option.

err.jpg
 
  • Like
Reactions: boopzz
did you try to install the pve-headers package?

Code:
apt install pve-headers
 
Hi,
I got exactly the same issue (PVE 6.1, Nvidia 440.59).
At least for me installing the pve-headers package did not change anything.

Do you have any other idea?

Thanks

Edit:
These headers are installed:
Code:
dpkg -l pve-headers*
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name                     Version      Architecture Description
+++-========================-============-============-=========================
ii  pve-headers              6.1-2        all          Default Proxmox VE Kernel
ii  pve-headers-5.3          6.1-4        all          Latest Proxmox VE Kernel
ii  pve-headers-5.3.18-1-pve 5.3.18-1     amd64        The Proxmox PVE Kernel He
lines 1-8/8 (END)...skipping...
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name                     Version      Architecture Description
+++-========================-============-============-=================================
ii  pve-headers              6.1-2        all          Default Proxmox VE Kernel Headers
ii  pve-headers-5.3          6.1-4        all          Latest Proxmox VE Kernel Headers
ii  pve-headers-5.3.18-1-pve 5.3.18-1     amd64        The Proxmox PVE Kernel Headers
 
Last edited:
well you can always use the flag like the error message says if it does not find the source automatically
(they are in /usr/src/{version})
 
Thank you very much!

This brought me a little bit further: bash NVIDIA-Linux-x86_64-440.59.run --kernel-source-path /usr/src/linux-headers-5.3.18-1-pve/

The kernel now compiles but I get this error:
1581340341016.png

I already blacklisted the wrong drivers in /etc/modprobe.d/blacklist.conf:
Code:
    blacklist vga16fb
    blacklist nouveau
    blacklist rivafb
    blacklist nvidiafb
    blacklist rivatv

This is the output of the nvidia log at the end:
Code:
[   36.653239] vfio-pci 0000:1a:00.0: vfio_ecap_init: hiding ecap 0x19@0x168
[   97.544769] nvidia: disagrees about version of symbol module_layout

What can be the reason for this?
 
what kernel are running at the moment?
please post the output of pveversion -v (or uname -a)
 
Thanks for the help!
Here is the output of pveversion -v :
Code:
proxmox-ve: 6.1-2 (running kernel: 5.3.10-1-pve)
pve-manager: 6.1-3 (running version: 6.1-3/37248ce6)
pve-kernel-5.3: 6.0-12
pve-kernel-helper: 6.0-12
pve-kernel-5.3.10-1-pve: 5.3.10-1
ceph-fuse: 12.2.11+dfsg1-2.1+b1
corosync: 3.0.2-pve4
criu: 3.11-3
glusterfs-client: 5.5-3
ifupdown: 0.8.35+pve1
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.13-pve1
libpve-access-control: 6.0-5
libpve-apiclient-perl: 3.0-2
libpve-common-perl: 6.0-9
libpve-guest-common-perl: 3.0-3
libpve-http-server-perl: 3.0-3
libpve-storage-perl: 6.1-2
libqb0: 1.0.5-1
libspice-server1: 0.14.2-4~pve6+1
lvm2: 2.03.02-pve3
lxc-pve: 3.2.1-1
lxcfs: 3.0.3-pve60
novnc-pve: 1.1.0-1
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.1-1
pve-cluster: 6.1-2
pve-container: 3.0-14
pve-docs: 6.1-3
pve-edk2-firmware: 2.20191002-1
pve-firewall: 4.0-9
pve-firmware: 3.0-4
pve-ha-manager: 3.0-8
pve-i18n: 2.0-3
pve-qemu-kvm: 4.1.1-2
pve-xtermjs: 3.13.2-1
qemu-server: 6.1-2
smartmontools: 7.0-pve2
spiceterm: 3.1-1
vncterm: 1.6-1
zfsutils-linux: 0.8.2-pve2
 
proxmox-ve: 6.1-2 (running kernel: 5.3.10-1-pve)
you are not running that kernel at the moment, you have to upgrade your packages and then reboot, then try again to build the driver
 
Thanks a lot!
Stupid error ;)

I also had install: apt-get install -y libglvnd-dev
The full line for installing was then: bash NVIDIA-Linux-x86_64-440.59.run --kernel-source-path /usr/src/linux-headers-5.3.18-1-pve/ --glvnd-egl-config-path /usr/include/glvnd
 
  • Like
Reactions: boopzz
Thanks a lot!
Stupid error ;)

I also had install: apt-get install -y libglvnd-dev
The full line for installing was then: bash NVIDIA-Linux-x86_64-440.59.run --kernel-source-path /usr/src/linux-headers-5.3.18-1-pve/ --glvnd-egl-config-path /usr/include/glvnd
Hey. I think I am running into pretty much the same problem, but the solutions mentioned here do not seem to work for me. I am still getting the "nvidia.ko" error you attached above.

My pveversion outputs: pve-manager/8.0.4/d258a813cfa6b390 (running kernel: 6.2.16-10-pve) after installing libglvnd-dev and running the command you mentioned above.

I still get the error: nvidia.ko.

I have both an iGPU (i9-11900k) and an RTX 4070. I am looking to use my RTX 4070.
 
on a normal pve kernel with the normal nvidia packages, you shouldn't have to specify any of those options (except maybe --dkms) see: https://pve.proxmox.com/wiki/NVIDIA_vGPU_on_Proxmox_VE_7.x#Host_Driver_Installation
Hmm. I have a normal pve kernel and I am getting exactly this error message trying to install the vGPU driver.

pveversion -v:
Code:
proxmox-ve: 8.1.0 (running kernel: 6.5.11-4-pve)
pve-manager: 8.1.3 (running version: 8.1.3/b46aac3b42da5d15)
proxmox-kernel-helper: 8.0.9
proxmox-kernel-6.5.11-4-pve-signed: 6.5.11-4
proxmox-kernel-6.5: 6.5.11-4
ceph-fuse: 17.2.7-pve1
corosync: 3.1.7-pve3
criu: 3.17.1-2
glusterfs-client: 10.3-5
ifupdown2: 3.2.0-1+pmx7
ksm-control-daemon: 1.4-1
libjs-extjs: 7.0.0-4
libknet1: 1.28-pve1
libproxmox-acme-perl: 1.5.0
libproxmox-backup-qemu0: 1.4.0
libproxmox-rs-perl: 0.3.1
libpve-access-control: 8.0.7
libpve-apiclient-perl: 3.3.1
libpve-common-perl: 8.1.0
libpve-guest-common-perl: 5.0.6
libpve-http-server-perl: 5.0.5
libpve-network-perl: 0.9.4
libpve-rs-perl: 0.8.7
libpve-storage-perl: 8.0.5
libspice-server1: 0.15.1-1
lvm2: 2.03.16-2
lxc-pve: 5.0.2-4
lxcfs: 5.0.3-pve3
novnc-pve: 1.4.0-3
proxmox-backup-client: 3.0.4-1
proxmox-backup-file-restore: 3.0.4-1
proxmox-kernel-helper: 8.0.9
proxmox-mail-forward: 0.2.2
proxmox-mini-journalreader: 1.4.0
proxmox-offline-mirror-helper: 0.6.2
proxmox-widget-toolkit: 4.1.3
pve-cluster: 8.0.5
pve-container: 5.0.8
pve-docs: 8.1.3
pve-edk2-firmware: 4.2023.08-1
pve-firewall: 5.0.3
pve-firmware: 3.9-1
pve-ha-manager: 4.0.3
pve-i18n: 3.1.2
pve-qemu-kvm: 8.1.2-4
pve-xtermjs: 5.3.0-2
qemu-server: 8.0.10
smartmontools: 7.3-pve1
spiceterm: 3.3.0
swtpm: 0.8.0+pve1
vncterm: 1.8.0
zfsutils-linux: 2.2.0-pve3
 
Hmm. I have a normal pve kernel and I am getting exactly this error message trying to install the vGPU driver.
which driver version did you try to install (with which command exactly?) and what exact error did you get (there are at least two here in the thread)
 
which driver version did you try to install (with which command exactly?) and what exact error did you get (there are at least two here in the thread)
Sorry, my bad.

The error I was getting was the same as in the OP.

I misread the comment about not having to specify any options as not having to install the headers. After installing the headers, the error message went away.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!