Nvidia Driver Tomfoolery, unable to apt install anything.

jw6677

Active Member
Oct 19, 2019
93
5
28
34
www.cayk.ca
Spent a chunk of time trying to get cuda / openCL working in python, and seemed to have negatively impacted something I should not have.

I'll preface that my plan is to reinstall proxmox, but I hope to try to learn what has happened by diagnosing the issue before that.

When I try to 'apt-get install' anything, I get the same DKMS issue, example shown below with nvidia-driver (as this is what I was working on when I began to run into this issue)

Code:
root@server:~# apt install nvidia-driver
Reading package lists... Done
Building dependency tree
Reading state information... Done
nvidia-driver is already the newest version (418.74-1).
0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.
2 not fully installed or removed.
After this operation, 0 B of additional disk space will be used.
Do you want to continue? [Y/n]
Setting up nvidia-kernel-dkms (418.74-1) ...
Removing old nvidia-current-418.74 DKMS files...

------------------------------
Deleting module version: 418.74
completely from the DKMS tree.
------------------------------
Done.
Loading new nvidia-current-418.74 DKMS files...
Building for 5.3.13-1-pve
Building initial module for 5.3.13-1-pve
Error! Bad return status for module build on kernel: 5.3.13-1-pve (x86_64)
Consult /var/lib/dkms/nvidia-current/418.74/build/make.log for more information.
dpkg: error processing package nvidia-kernel-dkms (--configure):
installed nvidia-kernel-dkms package post-installation script subprocess returned error exit status 10
dpkg: dependency problems prevent configuration of nvidia-driver:
nvidia-driver depends on nvidia-kernel-dkms (= 418.74-1) | nvidia-kernel-418.74; however:
  Package nvidia-kernel-dkms is not configured yet.
  Package nvidia-kernel-418.74 is not installed.
  Package nvidia-kernel-dkms which provides nvidia-kernel-418.74 is not configured yet.

dpkg: error processing package nvidia-driver (--configure):
dependency problems - leaving unconfigured
Errors were encountered while processing:
nvidia-kernel-dkms
nvidia-driver
E: Sub-process /usr/bin/dpkg returned an error code (1)

However, I get this same issue for any 'apt-get install' I try to run...

e.g.
Code:
root@server:~# apt-get install iperf
Reading package lists... Done
Building dependency tree
Reading state information... Done
iperf is already the newest version (2.0.12+dfsg1-2).
0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.
2 not fully installed or removed.
After this operation, 0 B of additional disk space will be used.
Do you want to continue? [Y/n]
Setting up nvidia-kernel-dkms (418.74-1) ...
Removing old nvidia-current-418.74 DKMS files...

------------------------------
Deleting module version: 418.74
completely from the DKMS tree.
------------------------------
Done.
Loading new nvidia-current-418.74 DKMS files...
Building for 5.3.13-1-pve
Building initial module for 5.3.13-1-pve
Error! Bad return status for module build on kernel: 5.3.13-1-pve (x86_64)
Consult /var/lib/dkms/nvidia-current/418.74/build/make.log for more information.
dpkg: error processing package nvidia-kernel-dkms (--configure):
installed nvidia-kernel-dkms package post-installation script subprocess returned error exit status 10
dpkg: dependency problems prevent configuration of nvidia-driver:
nvidia-driver depends on nvidia-kernel-dkms (= 418.74-1) | nvidia-kernel-418.74; however:
  Package nvidia-kernel-dkms is not configured yet.
  Package nvidia-kernel-418.74 is not installed.
  Package nvidia-kernel-dkms which provides nvidia-kernel-418.74 is not configured yet.

dpkg: error processing package nvidia-driver (--configure):
dependency problems - leaving unconfigured
Errors were encountered while processing:
nvidia-kernel-dkms
nvidia-driver
E: Sub-process /usr/bin/dpkg returned an error code (1)


I've spent a couple of hours trying to trace the errors shown, but generally keep stumbing upon unrelated bugs, and unfortunately am not making progress in understanding what went wrong exactly. I am hoping someone can point me in the right direction.
 
Last edited:
Pastbin of cat /var/lib/dkms/nvidia-current/418.74/build/make.log:
https://pastebin.com/i7fnCyh3


Edit: Having trouble figuring out the proper commands to help provide information from my end, is there a proxmox forum how-to or something, to help beginners identify what they should attach to support requests?
 
Thanks Tom, I came across that thread when trying to debug on my won, however the very first command is inhibited by the same error.

See:
Code:
root@server:/mnt/lizardfs/ssd# apt install devscripts
Reading package lists... Done
Building dependency tree
Reading state information... Done
devscripts is already the newest version (2.19.5+deb10u1).
0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.
2 not fully installed or removed.
After this operation, 0 B of additional disk space will be used.
Do you want to continue? [Y/n] Y
Setting up nvidia-kernel-dkms (418.74-1) ...
Removing old nvidia-current-418.74 DKMS files...

------------------------------
Deleting module version: 418.74
completely from the DKMS tree.
------------------------------
Done.
Loading new nvidia-current-418.74 DKMS files...
Building for 5.3.13-1-pve
Building initial module for 5.3.13-1-pve
Error! Bad return status for module build on kernel: 5.3.13-1-pve (x86_64)
Consult /var/lib/dkms/nvidia-current/418.74/build/make.log for more information.
dpkg: error processing package nvidia-kernel-dkms (--configure):
 installed nvidia-kernel-dkms package post-installation script subprocess returned error exit status 10
dpkg: dependency problems prevent configuration of nvidia-driver:
 nvidia-driver depends on nvidia-kernel-dkms (= 418.74-1) | nvidia-kernel-418.74; however:
  Package nvidia-kernel-dkms is not configured yet.
  Package nvidia-kernel-418.74 is not installed.
  Package nvidia-kernel-dkms which provides nvidia-kernel-418.74 is not configured yet.

dpkg: error processing package nvidia-driver (--configure):
 dependency problems - leaving unconfigured
Errors were encountered while processing:
 nvidia-kernel-dkms
 nvidia-driver
E: Sub-process /usr/bin/dpkg returned an error code (1)

If I keep following along and dget the (slightly new version) ...430.64-4.dsc, it eventually leads me to:


Code:
root@server:~/nvidia-graphics-drivers-430.64# mk-build-deps --install
dh_testdir
dh_testroot
dh_prep
dh_testdir
dh_testroot
dh_install
dh_installdocs
dh_installchangelogs
dh_compress
dh_fixperms
dh_installdeb
dh_gencontrol
dh_md5sums
dh_builddeb
dpkg-deb: building package 'nvidia-graphics-drivers-build-deps' in '../nvidia-graphics-drivers-build-deps_430.64-4_all.deb'.

The package has been created.
Attention, the package has been created in the current directory,
not in ".." as indicated by the message above!
Selecting previously unselected package nvidia-graphics-drivers-build-deps.
(Reading database ... 276775 files and directories currently installed.)
Preparing to unpack nvidia-graphics-drivers-build-deps_430.64-4_all.deb ...
Unpacking nvidia-graphics-drivers-build-deps (430.64-4) ...
Reading package lists... Done
Building dependency tree
Reading state information... Done
Starting pkgProblemResolver with broken count: 0
Starting 2 pkgProblemResolver with broken count: 0
Done
0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.
3 not fully installed or removed.
After this operation, 0 B of additional disk space will be used.
Setting up nvidia-kernel-dkms (418.74-1) ...
Removing old nvidia-current-418.74 DKMS files...

------------------------------
Deleting module version: 418.74
completely from the DKMS tree.
------------------------------
Done.
Loading new nvidia-current-418.74 DKMS files...
Building for 5.3.13-1-pve
Building initial module for 5.3.13-1-pve
Error! Bad return status for module build on kernel: 5.3.13-1-pve (x86_64)
Consult /var/lib/dkms/nvidia-current/418.74/build/make.log for more information.
dpkg: error processing package nvidia-kernel-dkms (--configure):
 installed nvidia-kernel-dkms package post-installation script subprocess returned error exit status 10
Setting up nvidia-graphics-drivers-build-deps (430.64-4) ...
dpkg: dependency problems prevent configuration of nvidia-driver:
 nvidia-driver depends on nvidia-kernel-dkms (= 418.74-1) | nvidia-kernel-418.74; however:
  Package nvidia-kernel-dkms is not configured yet.
  Package nvidia-kernel-418.74 is not installed.
  Package nvidia-kernel-dkms which provides nvidia-kernel-418.74 is not configured yet.

dpkg: error processing package nvidia-driver (--configure):
 dependency problems - leaving unconfigured
Errors were encountered while processing:
 nvidia-kernel-dkms
 nvidia-driver
E: Sub-process /usr/bin/dpkg returned an error code (1)
(Reading database ... 276778 files and directories currently installed.)
Removing nvidia-graphics-drivers-build-deps (430.64-4) ...
mk-build-deps: Unable to install all build-dep packages


There seems to be a lower level issue I've created for myself. The make.log is beyond my ability to understand, however I am still hopeful to avoid reinstallation on this node. (Though have begun the slow processes of migrating everything to other nodes...)

Is there anything else that I might try, or investigate from?
 
Following the bug report in the provided link, and I see that 430.64-4~bpo10+1 is now available!

For the other noobs who need a solution spelt out for them, here is what worked for me:


echo "deb http://deb.debian.org/debian buster-backports main non-free" >> /etc/apt/sources.list
apt update
apt-get -t buster-backports install nvidia-kernel-dkms=430.64-4~bpo10+1


Thank you to tom for taking the time to help!