Upgrade problem to 8.2 (6.8 kernel - nvidia drivers)

Hi,

I'm having troubles upgrading to pve 8.2 (kernel 6.8) due to problems compiling nvidia drivers, is there a way to solve?

Code:
Updating file-restore initramfs...
12685 blocks
Setting up proxmox-widget-toolkit (4.2.1) ...
Setting up proxmox-kernel-6.8.4-2-pve-signed (6.8.4-2) ...
Examining /etc/kernel/postinst.d.
run-parts: executing /etc/kernel/postinst.d/dkms 6.8.4-2-pve /boot/vmlinuz-6.8.4-2-pve
dkms: running auto installation service for kernel 6.8.4-2-pve.
Sign command: /lib/modules/6.8.4-2-pve/build/scripts/sign-file
Signing key: /var/lib/dkms/mok.key
Public certificate (MOK): /var/lib/dkms/mok.pub


Building module:
Cleaning build area...
'make' -j32 NV_EXCLUDE_BUILD_MODULES='' KERNEL_UNAME=6.8.4-2-pve modules......(bad exit status: 2)
Error! Bad return status for module build on kernel: 6.8.4-2-pve (x86_64)
Consult /var/lib/dkms/nvidia/535.129.03/build/make.log for more information.
Error! One or more modules failed to install during autoinstall.
Refer to previous errors for more information.
dkms: autoinstall for kernel: 6.8.4-2-pve failed!
run-parts: /etc/kernel/postinst.d/dkms exited with return code 11
Failed to process /etc/kernel/postinst.d at /var/lib/dpkg/info/proxmox-kernel-6.8.4-2-pve-signed.postinst line 20.
dpkg: error processing package proxmox-kernel-6.8.4-2-pve-signed (--configure):
 installed proxmox-kernel-6.8.4-2-pve-signed package post-installation script subprocess returned error exit status 2
 
/var/lib/dkms/nvidia/535.129.03/build/make.log

Code:
DKMS make.log for nvidia-535.129.03 for kernel 6.8.4-2-pve (x86_64)
Wed Apr 24 18:29:57 CEST 2024
make[1]: Entering directory '/usr/src/linux-headers-6.8.4-2-pve'
warning: the compiler differs from the one used to build the kernel
  The kernel was built by: gcc (Debian 12.2.0-14) 12.2.0
  You are using:           cc (Debian 12.2.0-14) 12.2.0
  SYMLINK /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-kernel.o
 CONFTEST: hash__remap_4k_pfn
 CONFTEST: set_pages_uc
 CONFTEST: list_is_first
 CONFTEST: set_memory_uc
 CONFTEST: set_memory_array_uc
 CONFTEST: set_pages_array_uc
 CONFTEST: ioremap_cache
 CONFTEST: ioremap_wc
 CONFTEST: ioremap_driver_hardened
 CONFTEST: ioremap_driver_hardened_wc
 CONFTEST: ioremap_cache_shared
 CONFTEST: pci_get_domain_bus_and_slot
 CONFTEST: get_num_physpages
 CONFTEST: pde_data
 CONFTEST: xen_ioemu_inject_msi
 CONFTEST: phys_to_dma
 CONFTEST: get_dma_ops
 CONFTEST: dma_attr_macros
 CONFTEST: dma_map_page_attrs
 CONFTEST: write_cr4
 CONFTEST: of_find_node_by_phandle
 CONFTEST: of_node_to_nid
 CONFTEST: pnv_pci_get_npu_dev
 CONFTEST: of_get_ibm_chip_id
 CONFTEST: pci_bus_address
 CONFTEST: pci_stop_and_remove_bus_device
 CONFTEST: pci_rebar_get_possible_sizes
 CONFTEST: wait_for_random_bytes
 CONFTEST: register_cpu_notifier
 CONFTEST: cpuhp_setup_state
 CONFTEST: dma_map_resource
 CONFTEST: get_backlight_device_by_name
 CONFTEST: timer_setup
 CONFTEST: pci_enable_msix_range
 CONFTEST: kernel_read_has_pointer_pos_arg
 CONFTEST: kernel_write_has_pointer_pos_arg
 CONFTEST: dma_direct_map_resource
 CONFTEST: tegra_get_platform
 CONFTEST: tegra_bpmp_send_receive
 CONFTEST: flush_cache_all
 CONFTEST: vmf_insert_pfn
 CONFTEST: jiffies_to_timespec
 CONFTEST: ktime_get_raw_ts64
 CONFTEST: ktime_get_real_ts64
 CONFTEST: full_name_hash
 CONFTEST: pci_enable_atomic_ops_to_root
 CONFTEST: vga_tryget
 CONFTEST: cc_platform_has
 CONFTEST: seq_read_iter
 CONFTEST: unsafe_follow_pfn
 CONFTEST: drm_gem_object_get
 CONFTEST: drm_gem_object_put_unlocked
 CONFTEST: add_memory_driver_managed
 CONFTEST: device_property_read_u64
 CONFTEST: devm_of_platform_populate
 CONFTEST: of_dma_configure
 CONFTEST: of_property_count_elems_of_size
 CONFTEST: of_property_read_variable_u8_array
 CONFTEST: of_property_read_variable_u32_array
 CONFTEST: i2c_new_client_device
 CONFTEST: i2c_unregister_device
 CONFTEST: of_get_named_gpio
 CONFTEST: devm_gpio_request_one
 CONFTEST: gpio_direction_input
 CONFTEST: gpio_direction_output
 CONFTEST: gpio_get_value
 CONFTEST: gpio_set_value
 CONFTEST: gpio_to_irq
 CONFTEST: icc_get
 CONFTEST: icc_put
 CONFTEST: icc_set_bw
 CONFTEST: dma_buf_export_args
 CONFTEST: dma_buf_ops_has_kmap
 CONFTEST: dma_buf_ops_has_kmap_atomic
 CONFTEST: dma_buf_ops_has_map
 CONFTEST: dma_buf_ops_has_map_atomic
 CONFTEST: dma_buf_has_dynamic_attachment
 CONFTEST: dma_buf_attachment_has_peer2peer
 CONFTEST: dma_set_mask_and_coherent
 CONFTEST: devm_clk_bulk_get_all
 CONFTEST: get_task_ioprio
 CONFTEST: mdev_set_iommu_device
 CONFTEST: offline_and_remove_memory
 CONFTEST: vfio_register_notifier
 CONFTEST: vfio_register_emulated_iommu_dev
 CONFTEST: mdev_parent_dev
 CONFTEST: mdev_dev
 CONFTEST: mdev_get_type_group_id
 CONFTEST: mdev_uuid
 CONFTEST: mdev_from_dev
 CONFTEST: pci_irq_vector_helpers
 CONFTEST: kvmalloc
 CONFTEST: vfio_uninit_group_dev
 CONFTEST: vfio_pin_pages_has_vfio_device_arg
 CONFTEST: vfio_pin_pages_has_pages_arg
 CONFTEST: vfio_alloc_device
 CONFTEST: vfio_free_device
 CONFTEST: is_export_symbol_gpl_of_node_to_nid
 CONFTEST: is_export_symbol_gpl_sme_active
 CONFTEST: is_export_symbol_present_swiotlb_map_sg_attrs
 CONFTEST: is_export_symbol_present_swiotlb_dma_ops
 CONFTEST: is_export_symbol_present___close_fd
 CONFTEST: is_export_symbol_present_close_fd
 CONFTEST: is_export_symbol_present_get_unused_fd
 CONFTEST: is_export_symbol_present_get_unused_fd_flags
 CONFTEST: is_export_symbol_present_nvhost_get_default_device
 CONFTEST: is_export_symbol_present_nvhost_syncpt_unit_interface_get_byte_offset
 CONFTEST: is_export_symbol_present_nvhost_syncpt_unit_interface_get_aperture
 CONFTEST: is_export_symbol_present_tegra_dce_register_ipc_client
 CONFTEST: is_export_symbol_present_tegra_dce_unregister_ipc_client
 CONFTEST: is_export_symbol_present_tegra_dce_client_ipc_send_recv
 CONFTEST: is_export_symbol_present_dram_clk_to_mc_clk
 CONFTEST: is_export_symbol_present_get_dram_num_channels
 CONFTEST: is_export_symbol_present_tegra_dram_types
 CONFTEST: is_export_symbol_present_pxm_to_node
 CONFTEST: is_export_symbol_present_screen_info
 CONFTEST: is_export_symbol_present_i2c_bus_status
 CONFTEST: is_export_symbol_present_tegra_fuse_control_read
 CONFTEST: is_export_symbol_present_tegra_get_platform
 CONFTEST: is_export_symbol_present_pci_find_host_bridge
 CONFTEST: is_export_symbol_present_tsec_comms_send_cmd
 CONFTEST: is_export_symbol_present_tsec_comms_set_init_cb
 CONFTEST: is_export_symbol_present_tsec_comms_clear_init_cb
 CONFTEST: is_export_symbol_present_tsec_comms_alloc_mem_from_gscco
 CONFTEST: is_export_symbol_present_tsec_comms_free_gscco_mem
 CONFTEST: is_export_symbol_present_memory_block_size_bytes
 CONFTEST: crypto
 CONFTEST: dma_ops
 CONFTEST: swiotlb_dma_ops
 CONFTEST: noncoherent_swiotlb_dma_ops
 CONFTEST: vm_fault_has_address
 CONFTEST: vm_insert_pfn_prot
 CONFTEST: vmf_insert_pfn_prot
 CONFTEST: vm_ops_fault_removed_vma_arg
 CONFTEST: kmem_cache_has_kobj_remove_work
 CONFTEST: sysfs_slab_unlink
 CONFTEST: proc_ops
 CONFTEST: timespec64
 CONFTEST: vmalloc_has_pgprot_t_arg
 CONFTEST: mm_has_mmap_lock
 CONFTEST: pci_channel_state
 CONFTEST: pci_dev_has_ats_enabled
 CONFTEST: remove_memory_has_nid_arg
 CONFTEST: add_memory_driver_managed_has_mhp_flags_arg
 CONFTEST: num_registered_fb
 CONFTEST: pci_driver_has_driver_managed_dma
 CONFTEST: vm_area_struct_has_const_vm_flags
 CONFTEST: memory_failure_has_trapno_arg
 CONFTEST: enable_apicv
 CONFTEST: mdev_parent_ops
 CONFTEST: mdev_parent
 CONFTEST: vfio_info_add_capability_has_cap_type_id_arg
 CONFTEST: vfio_device_gfx_plane_info
 CONFTEST: vm_fault_t
 CONFTEST: vfio_device_migration_has_start_pfn
 CONFTEST: mdev_parent_ops_has_open_device
 CONFTEST: mdev_parent_ops_has_device_driver
 CONFTEST: vfio_device_mig_state
 CONFTEST: vfio_migration_ops
 CONFTEST: mdev_driver_has_supported_type_groups
 CONFTEST: vfio_device_ops_has_dma_unmap
 CONFTEST: vfio_device_ops_has_bind_iommufd
 CONFTEST: vfio_migration_ops_has_migration_get_data_size
 CONFTEST: vfio_log_ops
 CONFTEST: vfio_precopy_info
 CONFTEST: dom0_kernel_present
 CONFTEST: nvidia_vgpu_kvm_build
 CONFTEST: nvidia_grid_build
 CONFTEST: nvidia_grid_csp_build
 CONFTEST: get_user_pages
 CONFTEST: get_user_pages_remote
 CONFTEST: pin_user_pages
 CONFTEST: pin_user_pages_remote
 CONFTEST: pm_runtime_available
 CONFTEST: pci_class_multimedia_hd_audio
 CONFTEST: drm_available
 CONFTEST: vfio_pci_core_available
 CONFTEST: mdev_available
 CONFTEST: cmd_uphy_display_port_init
 CONFTEST: cmd_uphy_display_port_off
 CONFTEST: memory_failure_mf_sw_simulated_defined
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-pci.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-dmabuf.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-nano-timer.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-acpi.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-cray.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-dma.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-i2c.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-mmap.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-p2p.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-pat.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-procfs.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-usermap.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-vm.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-vtophys.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/os-interface.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/os-mlock.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/os-pci.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/os-registry.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/os-usermap.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-modeset-interface.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-pci-table.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-kthread-q.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-memdbg.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-report-err.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-rsync.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-msi.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-caps.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-frontend.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv_uvm_interface.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_aead.o
/var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_aead.c:41:5: warning: no previous prototype for ‘libspdm_aead_prealloc’ [-Wmissing-prototypes]
   41 | int libspdm_aead_prealloc(void **context, char const *alg)
      |     ^~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_aead.c:171:5: warning: no previous prototype for ‘libspdm_aead_prealloced’ [-Wmissing-prototypes]
  171 | int libspdm_aead_prealloced(void *context,
      |     ^~~~~~~~~~~~~~~~~~~~~~~
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_ecc.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_hkdf.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_rand.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_shash.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_aead_aes_gcm.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_sha.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_hmac_sha.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_hkdf_sha.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_ec.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_x509.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-vgpu-vfio-interface.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nvlink_linux.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nvlink_caps.o
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-procfs.c:698:1: warning: no previous prototype for ‘exercise_error_forwarding_va’ [-Wmissing-prototypes]
  698 | exercise_error_forwarding_va(
      | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.c:395:6: warning: no previous prototype for ‘nv_init_ibmnpu_info’ [-Wmissing-prototypes]
  395 | void nv_init_ibmnpu_info(nv_state_t *nv)
      |      ^~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.c:399:6: warning: no previous prototype for ‘nv_destroy_ibmnpu_info’ [-Wmissing-prototypes]
  399 | void nv_destroy_ibmnpu_info(nv_state_t *nv)
      |      ^~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.c:403:5: warning: no previous prototype for ‘nv_init_ibmnpu_devices’ [-Wmissing-prototypes]
  403 | int nv_init_ibmnpu_devices(nv_state_t *nv)
      |     ^~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.c:408:6: warning: no previous prototype for ‘nv_unregister_ibmnpu_devices’ [-Wmissing-prototypes]
  408 | void nv_unregister_ibmnpu_devices(nv_state_t *nv)
      |      ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.c:428:5: warning: no previous prototype for ‘nv_get_ibmnpu_chip_id’ [-Wmissing-prototypes]
  428 | int nv_get_ibmnpu_chip_id(nv_state_t *nv)
      |     ^~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.c:437:6: warning: no previous prototype for ‘nv_ibmnpu_cache_flush_numa_region’ [-Wmissing-prototypes]
  437 | void nv_ibmnpu_cache_flush_numa_region(nv_state_t *nv)
      |      ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-dma.c:293:6: warning: no previous prototype for ‘nv_load_dma_map_scatterlist’ [-Wmissing-prototypes]
  293 | void nv_load_dma_map_scatterlist(
      |      ^~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-dma.c:489:23: warning: no previous prototype for ‘nv_dma_unmap_sgt’ [-Wmissing-prototypes]
  489 | NV_STATUS NV_API_CALL nv_dma_unmap_sgt(
      |                       ^~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/os-interface.c:370:7: warning: no previous prototype for ‘os_mem_copy_custom’ [-Wmissing-prototypes]
  370 | void *os_mem_copy_custom(
      |       ^~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-caps.c:269:5: warning: no previous prototype for ‘nv_cap_procfs_init’ [-Wmissing-prototypes]
  269 | int nv_cap_procfs_init(void)
      |     ^~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv.c:735:12: warning: no previous prototype for ‘nvidia_init_module’ [-Wmissing-prototypes]
  735 | int __init nvidia_init_module(void)
      |            ^~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv.c:855:6: warning: no previous prototype for ‘nvidia_exit_module’ [-Wmissing-prototypes]
  855 | void nvidia_exit_module(void)
      |      ^~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv.c:1124:23: warning: no previous prototype for ‘nv_get_num_dpaux_instances’ [-Wmissing-prototypes]
 1124 | NV_STATUS NV_API_CALL nv_get_num_dpaux_instances(nv_state_t *nv, NvU32 *num_instances)
      |                       ^~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-pci.c: In function ‘nv_pci_probe’:
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-pci.c:214:29: error: ‘const struct bus_type’ has no member named ‘iommu_ops’
  214 |         if (pci_dev->dev.bus->iommu_ops == NULL)
      |                             ^~
make[3]: *** [scripts/Makefile.build:243: /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-pci.o] Error 1
make[3]: *** Waiting for unfinished jobs....
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nvlink_linux.c:313:12: warning: no previous prototype for ‘nvlink_core_init’ [-Wmissing-prototypes]
  313 | int __init nvlink_core_init(void)
      |            ^~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nvlink_linux.c:389:6: warning: no previous prototype for ‘nvlink_core_exit’ [-Wmissing-prototypes]
  389 | void nvlink_core_exit(void)
      |      ^~~~~~~~~~~~~~~~
make[2]: *** [/usr/src/linux-headers-6.8.4-2-pve/Makefile:1926: /var/lib/dkms/nvidia/535.129.03/build] Error 2
make[1]: *** [Makefile:240: __sub-make] Error 2
make[1]: Leaving directory '/usr/src/linux-headers-6.8.4-2-pve'
make: *** [Makefile:82: modules] Error 2
 
Ask NVidia to make their drivers compatible with Linux kernel 6.8 (as only they have access to their sources) or use an open source driver for your GPU instead.
 
/var/lib/dkms/nvidia/535.129.03/build/make.log

Code:
DKMS make.log for nvidia-535.129.03 for kernel 6.8.4-2-pve (x86_64)
Wed Apr 24 18:29:57 CEST 2024
make[1]: Entering directory '/usr/src/linux-headers-6.8.4-2-pve'
warning: the compiler differs from the one used to build the kernel
  The kernel was built by: gcc (Debian 12.2.0-14) 12.2.0
  You are using:           cc (Debian 12.2.0-14) 12.2.0
  SYMLINK /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-kernel.o
 CONFTEST: hash__remap_4k_pfn
 CONFTEST: set_pages_uc
 CONFTEST: list_is_first
 CONFTEST: set_memory_uc
 CONFTEST: set_memory_array_uc
 CONFTEST: set_pages_array_uc
 CONFTEST: ioremap_cache
 CONFTEST: ioremap_wc
 CONFTEST: ioremap_driver_hardened
 CONFTEST: ioremap_driver_hardened_wc
 CONFTEST: ioremap_cache_shared
 CONFTEST: pci_get_domain_bus_and_slot
 CONFTEST: get_num_physpages
 CONFTEST: pde_data
 CONFTEST: xen_ioemu_inject_msi
 CONFTEST: phys_to_dma
 CONFTEST: get_dma_ops
 CONFTEST: dma_attr_macros
 CONFTEST: dma_map_page_attrs
 CONFTEST: write_cr4
 CONFTEST: of_find_node_by_phandle
 CONFTEST: of_node_to_nid
 CONFTEST: pnv_pci_get_npu_dev
 CONFTEST: of_get_ibm_chip_id
 CONFTEST: pci_bus_address
 CONFTEST: pci_stop_and_remove_bus_device
 CONFTEST: pci_rebar_get_possible_sizes
 CONFTEST: wait_for_random_bytes
 CONFTEST: register_cpu_notifier
 CONFTEST: cpuhp_setup_state
 CONFTEST: dma_map_resource
 CONFTEST: get_backlight_device_by_name
 CONFTEST: timer_setup
 CONFTEST: pci_enable_msix_range
 CONFTEST: kernel_read_has_pointer_pos_arg
 CONFTEST: kernel_write_has_pointer_pos_arg
 CONFTEST: dma_direct_map_resource
 CONFTEST: tegra_get_platform
 CONFTEST: tegra_bpmp_send_receive
 CONFTEST: flush_cache_all
 CONFTEST: vmf_insert_pfn
 CONFTEST: jiffies_to_timespec
 CONFTEST: ktime_get_raw_ts64
 CONFTEST: ktime_get_real_ts64
 CONFTEST: full_name_hash
 CONFTEST: pci_enable_atomic_ops_to_root
 CONFTEST: vga_tryget
 CONFTEST: cc_platform_has
 CONFTEST: seq_read_iter
 CONFTEST: unsafe_follow_pfn
 CONFTEST: drm_gem_object_get
 CONFTEST: drm_gem_object_put_unlocked
 CONFTEST: add_memory_driver_managed
 CONFTEST: device_property_read_u64
 CONFTEST: devm_of_platform_populate
 CONFTEST: of_dma_configure
 CONFTEST: of_property_count_elems_of_size
 CONFTEST: of_property_read_variable_u8_array
 CONFTEST: of_property_read_variable_u32_array
 CONFTEST: i2c_new_client_device
 CONFTEST: i2c_unregister_device
 CONFTEST: of_get_named_gpio
 CONFTEST: devm_gpio_request_one
 CONFTEST: gpio_direction_input
 CONFTEST: gpio_direction_output
 CONFTEST: gpio_get_value
 CONFTEST: gpio_set_value
 CONFTEST: gpio_to_irq
 CONFTEST: icc_get
 CONFTEST: icc_put
 CONFTEST: icc_set_bw
 CONFTEST: dma_buf_export_args
 CONFTEST: dma_buf_ops_has_kmap
 CONFTEST: dma_buf_ops_has_kmap_atomic
 CONFTEST: dma_buf_ops_has_map
 CONFTEST: dma_buf_ops_has_map_atomic
 CONFTEST: dma_buf_has_dynamic_attachment
 CONFTEST: dma_buf_attachment_has_peer2peer
 CONFTEST: dma_set_mask_and_coherent
 CONFTEST: devm_clk_bulk_get_all
 CONFTEST: get_task_ioprio
 CONFTEST: mdev_set_iommu_device
 CONFTEST: offline_and_remove_memory
 CONFTEST: vfio_register_notifier
 CONFTEST: vfio_register_emulated_iommu_dev
 CONFTEST: mdev_parent_dev
 CONFTEST: mdev_dev
 CONFTEST: mdev_get_type_group_id
 CONFTEST: mdev_uuid
 CONFTEST: mdev_from_dev
 CONFTEST: pci_irq_vector_helpers
 CONFTEST: kvmalloc
 CONFTEST: vfio_uninit_group_dev
 CONFTEST: vfio_pin_pages_has_vfio_device_arg
 CONFTEST: vfio_pin_pages_has_pages_arg
 CONFTEST: vfio_alloc_device
 CONFTEST: vfio_free_device
 CONFTEST: is_export_symbol_gpl_of_node_to_nid
 CONFTEST: is_export_symbol_gpl_sme_active
 CONFTEST: is_export_symbol_present_swiotlb_map_sg_attrs
 CONFTEST: is_export_symbol_present_swiotlb_dma_ops
 CONFTEST: is_export_symbol_present___close_fd
 CONFTEST: is_export_symbol_present_close_fd
 CONFTEST: is_export_symbol_present_get_unused_fd
 CONFTEST: is_export_symbol_present_get_unused_fd_flags
 CONFTEST: is_export_symbol_present_nvhost_get_default_device
 CONFTEST: is_export_symbol_present_nvhost_syncpt_unit_interface_get_byte_offset
 CONFTEST: is_export_symbol_present_nvhost_syncpt_unit_interface_get_aperture
 CONFTEST: is_export_symbol_present_tegra_dce_register_ipc_client
 CONFTEST: is_export_symbol_present_tegra_dce_unregister_ipc_client
 CONFTEST: is_export_symbol_present_tegra_dce_client_ipc_send_recv
 CONFTEST: is_export_symbol_present_dram_clk_to_mc_clk
 CONFTEST: is_export_symbol_present_get_dram_num_channels
 CONFTEST: is_export_symbol_present_tegra_dram_types
 CONFTEST: is_export_symbol_present_pxm_to_node
 CONFTEST: is_export_symbol_present_screen_info
 CONFTEST: is_export_symbol_present_i2c_bus_status
 CONFTEST: is_export_symbol_present_tegra_fuse_control_read
 CONFTEST: is_export_symbol_present_tegra_get_platform
 CONFTEST: is_export_symbol_present_pci_find_host_bridge
 CONFTEST: is_export_symbol_present_tsec_comms_send_cmd
 CONFTEST: is_export_symbol_present_tsec_comms_set_init_cb
 CONFTEST: is_export_symbol_present_tsec_comms_clear_init_cb
 CONFTEST: is_export_symbol_present_tsec_comms_alloc_mem_from_gscco
 CONFTEST: is_export_symbol_present_tsec_comms_free_gscco_mem
 CONFTEST: is_export_symbol_present_memory_block_size_bytes
 CONFTEST: crypto
 CONFTEST: dma_ops
 CONFTEST: swiotlb_dma_ops
 CONFTEST: noncoherent_swiotlb_dma_ops
 CONFTEST: vm_fault_has_address
 CONFTEST: vm_insert_pfn_prot
 CONFTEST: vmf_insert_pfn_prot
 CONFTEST: vm_ops_fault_removed_vma_arg
 CONFTEST: kmem_cache_has_kobj_remove_work
 CONFTEST: sysfs_slab_unlink
 CONFTEST: proc_ops
 CONFTEST: timespec64
 CONFTEST: vmalloc_has_pgprot_t_arg
 CONFTEST: mm_has_mmap_lock
 CONFTEST: pci_channel_state
 CONFTEST: pci_dev_has_ats_enabled
 CONFTEST: remove_memory_has_nid_arg
 CONFTEST: add_memory_driver_managed_has_mhp_flags_arg
 CONFTEST: num_registered_fb
 CONFTEST: pci_driver_has_driver_managed_dma
 CONFTEST: vm_area_struct_has_const_vm_flags
 CONFTEST: memory_failure_has_trapno_arg
 CONFTEST: enable_apicv
 CONFTEST: mdev_parent_ops
 CONFTEST: mdev_parent
 CONFTEST: vfio_info_add_capability_has_cap_type_id_arg
 CONFTEST: vfio_device_gfx_plane_info
 CONFTEST: vm_fault_t
 CONFTEST: vfio_device_migration_has_start_pfn
 CONFTEST: mdev_parent_ops_has_open_device
 CONFTEST: mdev_parent_ops_has_device_driver
 CONFTEST: vfio_device_mig_state
 CONFTEST: vfio_migration_ops
 CONFTEST: mdev_driver_has_supported_type_groups
 CONFTEST: vfio_device_ops_has_dma_unmap
 CONFTEST: vfio_device_ops_has_bind_iommufd
 CONFTEST: vfio_migration_ops_has_migration_get_data_size
 CONFTEST: vfio_log_ops
 CONFTEST: vfio_precopy_info
 CONFTEST: dom0_kernel_present
 CONFTEST: nvidia_vgpu_kvm_build
 CONFTEST: nvidia_grid_build
 CONFTEST: nvidia_grid_csp_build
 CONFTEST: get_user_pages
 CONFTEST: get_user_pages_remote
 CONFTEST: pin_user_pages
 CONFTEST: pin_user_pages_remote
 CONFTEST: pm_runtime_available
 CONFTEST: pci_class_multimedia_hd_audio
 CONFTEST: drm_available
 CONFTEST: vfio_pci_core_available
 CONFTEST: mdev_available
 CONFTEST: cmd_uphy_display_port_init
 CONFTEST: cmd_uphy_display_port_off
 CONFTEST: memory_failure_mf_sw_simulated_defined
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-pci.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-dmabuf.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-nano-timer.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-acpi.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-cray.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-dma.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-i2c.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-mmap.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-p2p.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-pat.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-procfs.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-usermap.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-vm.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-vtophys.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/os-interface.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/os-mlock.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/os-pci.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/os-registry.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/os-usermap.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-modeset-interface.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-pci-table.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-kthread-q.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-memdbg.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-report-err.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-rsync.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-msi.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-caps.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-frontend.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv_uvm_interface.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_aead.o
/var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_aead.c:41:5: warning: no previous prototype for ‘libspdm_aead_prealloc’ [-Wmissing-prototypes]
   41 | int libspdm_aead_prealloc(void **context, char const *alg)
      |     ^~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_aead.c:171:5: warning: no previous prototype for ‘libspdm_aead_prealloced’ [-Wmissing-prototypes]
  171 | int libspdm_aead_prealloced(void *context,
      |     ^~~~~~~~~~~~~~~~~~~~~~~
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_ecc.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_hkdf.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_rand.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_shash.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_aead_aes_gcm.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_sha.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_hmac_sha.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_hkdf_sha.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_ec.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/libspdm_x509.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-vgpu-vfio-interface.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nvlink_linux.o
  CC [M]  /var/lib/dkms/nvidia/535.129.03/build/nvidia/nvlink_caps.o
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-procfs.c:698:1: warning: no previous prototype for ‘exercise_error_forwarding_va’ [-Wmissing-prototypes]
  698 | exercise_error_forwarding_va(
      | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.c:395:6: warning: no previous prototype for ‘nv_init_ibmnpu_info’ [-Wmissing-prototypes]
  395 | void nv_init_ibmnpu_info(nv_state_t *nv)
      |      ^~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.c:399:6: warning: no previous prototype for ‘nv_destroy_ibmnpu_info’ [-Wmissing-prototypes]
  399 | void nv_destroy_ibmnpu_info(nv_state_t *nv)
      |      ^~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.c:403:5: warning: no previous prototype for ‘nv_init_ibmnpu_devices’ [-Wmissing-prototypes]
  403 | int nv_init_ibmnpu_devices(nv_state_t *nv)
      |     ^~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.c:408:6: warning: no previous prototype for ‘nv_unregister_ibmnpu_devices’ [-Wmissing-prototypes]
  408 | void nv_unregister_ibmnpu_devices(nv_state_t *nv)
      |      ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.c:428:5: warning: no previous prototype for ‘nv_get_ibmnpu_chip_id’ [-Wmissing-prototypes]
  428 | int nv_get_ibmnpu_chip_id(nv_state_t *nv)
      |     ^~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-ibmnpu.c:437:6: warning: no previous prototype for ‘nv_ibmnpu_cache_flush_numa_region’ [-Wmissing-prototypes]
  437 | void nv_ibmnpu_cache_flush_numa_region(nv_state_t *nv)
      |      ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-dma.c:293:6: warning: no previous prototype for ‘nv_load_dma_map_scatterlist’ [-Wmissing-prototypes]
  293 | void nv_load_dma_map_scatterlist(
      |      ^~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-dma.c:489:23: warning: no previous prototype for ‘nv_dma_unmap_sgt’ [-Wmissing-prototypes]
  489 | NV_STATUS NV_API_CALL nv_dma_unmap_sgt(
      |                       ^~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/os-interface.c:370:7: warning: no previous prototype for ‘os_mem_copy_custom’ [-Wmissing-prototypes]
  370 | void *os_mem_copy_custom(
      |       ^~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-caps.c:269:5: warning: no previous prototype for ‘nv_cap_procfs_init’ [-Wmissing-prototypes]
  269 | int nv_cap_procfs_init(void)
      |     ^~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv.c:735:12: warning: no previous prototype for ‘nvidia_init_module’ [-Wmissing-prototypes]
  735 | int __init nvidia_init_module(void)
      |            ^~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv.c:855:6: warning: no previous prototype for ‘nvidia_exit_module’ [-Wmissing-prototypes]
  855 | void nvidia_exit_module(void)
      |      ^~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv.c:1124:23: warning: no previous prototype for ‘nv_get_num_dpaux_instances’ [-Wmissing-prototypes]
 1124 | NV_STATUS NV_API_CALL nv_get_num_dpaux_instances(nv_state_t *nv, NvU32 *num_instances)
      |                       ^~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-pci.c: In function ‘nv_pci_probe’:
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-pci.c:214:29: error: ‘const struct bus_type’ has no member named ‘iommu_ops’
  214 |         if (pci_dev->dev.bus->iommu_ops == NULL)
      |                             ^~
make[3]: *** [scripts/Makefile.build:243: /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv-pci.o] Error 1
make[3]: *** Waiting for unfinished jobs....
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nvlink_linux.c:313:12: warning: no previous prototype for ‘nvlink_core_init’ [-Wmissing-prototypes]
  313 | int __init nvlink_core_init(void)
      |            ^~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/535.129.03/build/nvidia/nvlink_linux.c:389:6: warning: no previous prototype for ‘nvlink_core_exit’ [-Wmissing-prototypes]
  389 | void nvlink_core_exit(void)
      |      ^~~~~~~~~~~~~~~~
make[2]: *** [/usr/src/linux-headers-6.8.4-2-pve/Makefile:1926: /var/lib/dkms/nvidia/535.129.03/build] Error 2
make[1]: *** [Makefile:240: __sub-make] Error 2
make[1]: Leaving directory '/usr/src/linux-headers-6.8.4-2-pve'
make: *** [Makefile:82: modules] Error 2
Looks like we're on the same driver version and I do experience this as well and I didn't adhere to the warning to pin the default kernel before invoking the upgrade.

Currently not rebooting just in case. My log also looks quite similar to yours as well. I was thinking of attempting to uninstall the nvidia drivers and then letting the kernel update go through like normal and then trying to install the drivers under the new kernel but I have a feeling that may not work out well.


Any ideas?
 
Looks like we're on the same driver version and I do experience this as well and I didn't adhere to the warning to pin the default kernel before invoking the upgrade.

Currently not rebooting just in case. My log also looks quite similar to yours as well. I was thinking of attempting to uninstall the nvidia drivers and then letting the kernel update go through like normal and then trying to install the drivers under the new kernel but I have a feeling that may not work out well.


Any ideas?
That is what I did but had to install nvidia driver direct from nvidia 550

But I'm up to date and running
 
That is what I did but had to install nvidia driver direct from nvidia 550

But I'm up to date and running
Just to recap

1. you uninstalled the current nvidia drivers
2. then you ran the upgrade as normal
3. rebooted into new kernel
4. ran the .run 550 drivers under new kernel?

And everything was fine afterwards?
 
Just to recap

1. you uninstalled the current nvidia drivers
2. then you ran the upgrade as normal
3. rebooted into new kernel
4. ran the .run 550 drivers under new kernel?

And everything was fine afterwards?
Yup. Purged nvidia before new driver install as well. But all good
 
  • Like
Reactions: unixbird
Yup. Purged nvidia before new driver install as well. But all good
Hi itguy327, could you help me with the steps to propperly purge nvidia, I am having the same problems but i have not been able to reinstall the nvidia drivers myself. I have run:

./NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm-custom.run --uninstall

and

apt-get remove --purge '^nvidia-.*'

and

find /usr/lib -iname "*nvidia*" -delete

But still no luck.

I am getting this from the installation log:

make[1]: Entering directory '/usr/src/linux-headers-6.8.4-2-pve'
warning: the compiler differs from the one used to build the kernel
The kernel was built by: gcc (Debian 12.2.0-14) 12.2.0
You are using: cc (Debian 12.2.0-14) 12.2.0
CC [M] /tmp/selfgz41539/NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm/kernel/nvidia/nv-pci.o
/tmp/selfgz41539/NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm/kernel/nvidia/nv-pci.c: In function 'nv_pci_probe':
/tmp/selfgz41539/NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm/kernel/nvidia/nv-pci.c:225:29: error: 'const struct bus_type' has no member named 'iommu_ops'
225 | if (pci_dev->dev.bus->iommu_ops == NULL)
| ^~
make[3]: *** [scripts/Makefile.build:243: /tmp/selfgz41539/NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm/kernel/nvidia/nv-pci.o] Error 1
make[3]: Target '/tmp/selfgz41539/NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm/kernel/' not remade because of errors.
make[2]: *** [/usr/src/linux-headers-6.8.4-2-pve/Makefile:1926: /tmp/selfgz41539/NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm/kernel] Error 2
make[2]: Target 'modules' not remade because of errors.
make[1]: *** [Makefile:240: __sub-make] Error 2
make[1]: Target 'modules' not remade because of errors.
make[1]: Leaving directory '/usr/src/linux-headers-6.8.4-2-pve'
make: *** [Makefile:85: modules] Error 2
ERROR: The nvidia kernel module was not created.



it seems like a compiler error, but i do not know how to fix it.

Thank you very much
 
Last edited:
Hi itguy327, could you help me with the steps to propperly purge nvidia, I am having the same problems but i have not been able to reinstall the nvidia drivers myself. I have run:

./NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm-custom.run --uninstall

and

apt-get remove --purge '^nvidia-.*'

and

find /usr/lib -iname "*nvidia*" -delete

But still no luck.

I am getting this from the installation log:

make[1]: Entering directory '/usr/src/linux-headers-6.8.4-2-pve'
warning: the compiler differs from the one used to build the kernel
The kernel was built by: gcc (Debian 12.2.0-14) 12.2.0
You are using: cc (Debian 12.2.0-14) 12.2.0
CC [M] /tmp/selfgz41539/NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm/kernel/nvidia/nv-pci.o
/tmp/selfgz41539/NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm/kernel/nvidia/nv-pci.c: In function 'nv_pci_probe':
/tmp/selfgz41539/NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm/kernel/nvidia/nv-pci.c:225:29: error: 'const struct bus_type' has no member named 'iommu_ops'
225 | if (pci_dev->dev.bus->iommu_ops == NULL)
| ^~
make[3]: *** [scripts/Makefile.build:243: /tmp/selfgz41539/NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm/kernel/nvidia/nv-pci.o] Error 1
make[3]: Target '/tmp/selfgz41539/NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm/kernel/' not remade because of errors.
make[2]: *** [/usr/src/linux-headers-6.8.4-2-pve/Makefile:1926: /tmp/selfgz41539/NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm/kernel] Error 2
make[2]: Target 'modules' not remade because of errors.
make[1]: *** [Makefile:240: __sub-make] Error 2
make[1]: Target 'modules' not remade because of errors.
make[1]: Leaving directory '/usr/src/linux-headers-6.8.4-2-pve'
make: *** [Makefile:85: modules] Error 2
ERROR: The nvidia kernel module was not created.



it seems like a compiler error, but i do not know how to fix it.

Thank you very much
I did the following
apt remove nvidia-driver
apt purge *nvidia*
apt autoremove
apt clean
Then the nvidia installer https://www.nvidia.com/Download/driverResults.aspx/224022/en-us
 
Last edited:
I'm having a similar issue. I've tried to reinstall and i just get an error in the installer. I've uninstalled the 535 drives. Is there a way to revert back to previous kernel?
 
Hi evryone,


I think I have a kind of similar issue with r8168.


Code:
Setting up proxmox-kernel-6.8.4-2-pve-signed (6.8.4-2) ...
Examining /etc/kernel/postinst.d.
run-parts: executing /etc/kernel/postinst.d/dkms 6.8.4-2-pve /boot/vmlinuz-6.8.4-2-pve
dkms: running auto installation service for kernel 6.8.4-2-pve.
Sign command: /lib/modules/6.8.4-2-pve/build/scripts/sign-file
Signing key: /var/lib/dkms/mok.key
Public certificate (MOK): /var/lib/dkms/mok.pub


Building module:
Cleaning build area...
make -j6 KERNELRELEASE=6.8.4-2-pve -C /lib/modules/6.8.4-2-pve/build M=/var/lib/dkms/r8168/8.052.01-1/build........(bad exit status: 2)
Error! Bad return status for module build on kernel: 6.8.4-2-pve (x86_64)
Consult /var/lib/dkms/r8168/8.052.01-1/build/make.log for more information.
Error! One or more modules failed to install during autoinstall.
Refer to previous errors for more information.
dkms: autoinstall for kernel: 6.8.4-2-pve failed!
run-parts: /etc/kernel/postinst.d/dkms exited with return code 11
Failed to process /etc/kernel/postinst.d at /var/lib/dpkg/info/proxmox-kernel-6.8.4-2-pve-signed.postinst line 20.
dpkg: error processing package proxmox-kernel-6.8.4-2-pve-signed (--configure):
 installed proxmox-kernel-6.8.4-2-pve-signed package post-installation script subprocess returned error exit status 2
dpkg: dependency problems prevent configuration of proxmox-kernel-6.8:
 proxmox-kernel-6.8 depends on proxmox-kernel-6.8.4-2-pve-signed | proxmox-kernel-6.8.4-2-pve; however:
  Package proxmox-kernel-6.8.4-2-pve-signed is not configured yet.
  Package proxmox-kernel-6.8.4-2-pve is not installed.
  Package proxmox-kernel-6.8.4-2-pve-signed which provides proxmox-kernel-6.8.4-2-pve is not configured yet
."
"


and the var/lib/dkms/r8168/8.052.01-1/build/make.log content :

Code:
DKMS make.log for r8168-8.052.01-1 for kernel 6.8.4-2-pve (x86_64)
Fri Apr 26 06:54:34 CEST 2024
make: Entering directory '/usr/src/linux-headers-6.8.4-2-pve'
  CC [M]  /var/lib/dkms/r8168/8.052.01-1/build/r8168_n.o
  CC [M]  /var/lib/dkms/r8168/8.052.01-1/build/r8168_asf.o
  CC [M]  /var/lib/dkms/r8168/8.052.01-1/build/rtl_eeprom.o
  CC [M]  /var/lib/dkms/r8168/8.052.01-1/build/rtltool.o
/var/lib/dkms/r8168/8.052.01-1/build/rtl_eeprom.c:96:5: warning: no previous prototype for ‘rtl8168_eeprom_cmd_done’ [-Wmissing-prototypes]
   96 | int rtl8168_eeprom_cmd_done(struct rtl8168_private *tp)
      |     ^~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/r8168/8.052.01-1/build/r8168_n.c:1624:6: warning: no previous prototype for ‘_proc_dump_tx_desc’ [-Wmissing-prototypes]
 1624 | void _proc_dump_tx_desc(struct seq_file *m, struct TxDesc *desc_base,
      |      ^~~~~~~~~~~~~~~~~~
/var/lib/dkms/r8168/8.052.01-1/build/r8168_n.c:2949:5: warning: no previous prototype for ‘mdio_real_read’ [-Wmissing-prototypes]
 2949 | u32 mdio_real_read(struct rtl8168_private *tp,
      |     ^~~~~~~~~~~~~~
/var/lib/dkms/r8168/8.052.01-1/build/r8168_n.c:5726:1: warning: no previous prototype for ‘rtl8168_set_rx_vlan_filter’ [-Wmissing-prototypes]
 5726 | rtl8168_set_rx_vlan_filter(struct rtl8168_private *tp)
      | ^~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/r8168/8.052.01-1/build/r8168_n.c: In function ‘rtl8168_get_drvinfo’:
/var/lib/dkms/r8168/8.052.01-1/build/r8168_n.c:6296:17: error: implicit declaration of function ‘strlcpy’; did you mean ‘strscpy’? [-Werror=implicit-function-declaration]
 6296 |                 strlcpy(info->fw_version, rtl_fw->version,
      |                 ^~~~~~~
      |                 strscpy
/var/lib/dkms/r8168/8.052.01-1/build/r8168_n.c: At top level:
/var/lib/dkms/r8168/8.052.01-1/build/r8168_n.c:28087:5: warning: no previous prototype for ‘rtl8168_enable_msix’ [-Wmissing-prototypes]
28087 | int rtl8168_enable_msix(struct rtl8168_private *tp)
      |     ^~~~~~~~~~~~~~~~~~~
cc1: some warnings being treated as errors
make[2]: *** [scripts/Makefile.build:243: /var/lib/dkms/r8168/8.052.01-1/build/r8168_n.o] Error 1
make[1]: *** [/usr/src/linux-headers-6.8.4-2-pve/Makefile:1926: /var/lib/dkms/r8168/8.052.01-1/build] Error 2
make: *** [Makefile:240: __sub-make] Error 2
make: Leaving directory '/usr/src/linux-headers-6.8.4-2-pve'

So if I understand correctly, I must uninstall the r8168 driver and reinstall it after, m'I right?

Thanks in advance for your answer.

Kwenosaure
 
  • Like
Reactions: shbatm
Hi itguy327, could you help me with the steps to propperly purge nvidia, I am having the same problems but i have not been able to reinstall the nvidia drivers myself. I have run:

./NVIDIA-Linux-x86_64-550.54.10-vgpu-kvm-custom.run --uninstall
There's a need to differentiate between the vGPU host drivers and the "normal" ones. You are running a modified install script for an old version of the 17 series vGPU host drivers. That might be a frankendriver with part of the LTS series and the production series.

Since the Pascal architecture is not supported by the new 17 series of vGPU host drivers and the 16 series does currently not support Kernel 6.8.x - I decided to skip on vGPU for now and instead passthrough the GPU to one guest. That is not ideal.

As for the upgrade to Proxmox 8.2: I was able to reboot without kernel pinning, no issue, even if the 6.8 kernel could not be built
 
  • Like
Reactions: Urbaman
Ask NVidia to make their drivers compatible with Linux kernel 6.8 (as only they have access to their sources) or use an open source driver for your GPU instead.
I believe this is what most of us will need to do. Kernel magic is beyond my skill set, but I did find this thread where they are getting 6.8 working with nvidia on Fedora. If its relevant or helps get things moving here, I'm happy to test.

nvidia-developers
 
  • Like
Reactions: dooferorg
Happened across this thread after running in to errors trying to get the nvidia vgpu 14 (nvidia 510) compiled under Proxmox 8.2. I have an older Tesla P4 card I am wanting to try with this. Seems I might be out of luck if the kernel module won't build.

Sucks too looking at all the hardware now dropped from the v17 branch https://docs.nvidia.com/grid/gpus-supported-by-vgpu.html



Edit: I recommend reading through https://gitlab.com/polloloco/vgpu-proxmox .. feels like useful info for our use-cases (older cards vGPU on Proxmox)
 
Last edited:
  • Like
Reactions: Chamonix73

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!