Nvidia RTX 4090 Black Screen : only after OS boot w/ DisplayPort

comma

New Member
Aug 23, 2023
3
1
3
On a random day booting my proxmox windows VM, the DisplayPort stopped working, that seems solvable but you're reading a impossible quest towards zero answers, it seems.

Before telling you everything I tried, note that DisplayPort actually works before the OS Boot, I can go into bios and run generic, safe mode, drivers, no issues. Also HDMI totally works even in the OS.

Here is everything I tried:
=====================
* Update to the latest drivers 560.94
* Update the UEFI Firmware that "fixes" DisplayPort (https://nvidia.custhelp.com/app/answers/detail/a_id/5411/~/nvidia-gpu-uefi-firmware-update-tool)
* Create a new windows VM
* Create a desktop Ubuntu 24.04 VM
* Change cables
* Turn Off/On the machine and let it wait ^^
* Take the GPU to a friends bare-metal, normal windows install, and swap with his 4080


Nothing that I did make the DisplayPort work after the OS Boot, except one thing, a few dummy DisplayPort plugs that I had laying around for a remote gaming project, they are instantly recognized after the OS Boots without problems


If you want some command outputs and the Nvidia bug report here you go (note: this is with HDMI connected to the PiKVM and the DisplayPort to my monitor, on a fresh Ubuntu VM with latest proprietary drivers available)

Code:
root@a:/home/a# mokutil --sb-state
SecureBoot disabled

root@a:/home/a# uname -a
Linux a 6.8.0-41-generic #41-Ubuntu SMP PREEMPT_DYNAMIC Fri Aug  2 20:41:06 UTC 2024 x86_64 x86_64 x86_64 GNU/Linux


root@a:/home/a# inxi -SGxxxz
System:
  Kernel: 6.8.0-41-generic arch: x86_64 bits: 64 compiler: gcc v: 13.2.0
    clocksource: kvm-clock
  Desktop: GNOME v: 46.0 tk: GTK v: 3.24.41 wm: gnome-shell
    tools: gsd-screensaver-proxy dm: GDM3 v: 46.0 Distro: Ubuntu 24.04 LTS
    (Noble Numbat)
Graphics:
  Device-1: vendor: Red Hat driver: bochs-drm v: N/A ports: active: Virtual-1
    empty: none bus-ID: 00:01.0 chip-ID: 1234:1111 class-ID: 0300
  Device-2: NVIDIA AD102 [GeForce RTX 4090] driver: nouveau v: kernel
    arch: Lovelace pcie: speed: 2.5 GT/s lanes: 16 ports: active: none
    off: HDMI-A-1 empty: DP-1,DP-2,DP-3 bus-ID: 01:00.0 chip-ID: 10de:2684
    class-ID: 0300
  Display: server: X.Org v: 21.1.11 with: Xwayland v: 23.2.6
    compositor: gnome-shell driver: X: loaded: modesetting unloaded: fbdev,vesa
    dri: swrast gpu: bochs-drm,nouveau display-ID: :0 screens: 1
  Screen-1: 0 s-res: 1920x1080 s-dpi: 96 s-size: 508x286mm (20.00x11.26")
    s-diag: 583mm (22.95")
  Monitor-1: Virtual-1 res: 1920x1080 hz: 60 dpi: 150
    size: 325x203mm (12.8x7.99") diag: 383mm (15.09") modes: N/A
  API: EGL v: 1.5 hw: drv: nvidia nouveau platforms: device: 0 drv: nouveau
    device: 1 drv: swrast gbm: drv: kms_swrast surfaceless: drv: nouveau x11:
    drv: swrast inactive: wayland
  API: OpenGL v: 4.5 compat-v: 4.3 vendor: mesa v: 24.0.5-1ubuntu1
    glx-v: 1.4 direct-render: yes renderer: llvmpipe (LLVM 17.0.6 256 bits)
    device-ID: ffffffff:ffffffff


a@a:~$ lspci | grep -i vga
01:00.0 VGA compatible controller: NVIDIA Corporation AD102 [GeForce RTX 4090] (rev a1)


root@a:/home/a# lshw -c display
  *-display               
       description: VGA compatible controller
       product: AD102 [GeForce RTX 4090]
       vendor: NVIDIA Corporation
       physical id: 0
       bus info: pci@0000:01:00.0
       logical name: /dev/fb0
       version: a1
       width: 64 bits
       clock: 33MHz
       capabilities: pm msi pciexpress vga_controller bus_master cap_list rom fb
       configuration: depth=32 driver=nvidia latency=0 mode=1280x800 visual=truecolor xres=1280 yres=800
       resources: iomemory:38000-37fff iomemory:38080-3807f irq:76 memory:80000000-80ffffff memory:380000000000-3807ffffffff memory:380800000000-380801ffffff ioport:8000(size=128) memory:81080000-810fffff
  *-graphics
       product: simpledrmdrmfb
       physical id: 1
       logical name: /dev/fb0
       capabilities: fb
       configuration: depth=32 resolution=1280,800
      
root@a:/home/a# lsmod | grep nvidia
nvidia_uvm           1806336  0
nvidia_drm             94208  9
nvidia_modeset       1314816  10 nvidia_drm
nvidia              56823808  443 nvidia_uvm,nvidia_modeset
video                  73728  1 nvidia_modeset

root@a:/home/a# dmesg | grep -iE "error|fail|nvidia|amd"
[    0.000000] Linux version 6.8.0-41-generic (buildd@lcy02-amd64-100) (x86_64-linux-gnu-gcc-13 (Ubuntu 13.2.0-23ubuntu4) 13.2.0, GNU ld (GNU Binutils for Ubuntu) 2.42) #41-Ubuntu SMP PREEMPT_DYNAMIC Fri Aug  2 20:41:06 UTC 2024 (Ubuntu 6.8.0-41.41-generic 6.8.12)
[    0.000000]   AMD AuthenticAMD
[    0.003270] RAMDISK: [mem 0x740f1000-0x77f97fff]
[    0.106830] smpboot: CPU0: AMD Ryzen 9 5950X 16-Core Processor (family: 0x19, model: 0x21, stepping: 0x2)
[    0.107023] Performance Events: Fam17h+ core perfctr, AMD PMU driver.
[    1.530161] ACPI: _OSC evaluation for CPUs failed, trying _PDC
[   20.300001] shpchp 0000:05:01.0: pci_hp_register failed with error -16
[   20.300015] shpchp 0000:05:01.0: Slot initialization failed
[   20.301863] shpchp 0000:05:02.0: pci_hp_register failed with error -16
[   20.301876] shpchp 0000:05:02.0: Slot initialization failed
[   20.304057] shpchp 0000:05:03.0: pci_hp_register failed with error -16
[   20.304070] shpchp 0000:05:03.0: Slot initialization failed
[   20.306267] shpchp 0000:05:04.0: pci_hp_register failed with error -16
[   20.306281] shpchp 0000:05:04.0: Slot initialization failed
[   20.357932] amd_pstate: the _CPC object is not present in SBIOS or ACPI disabled
[   20.404832] RAS: Correctable Errors collector initialized.
[   22.239194] nvidia: loading out-of-tree module taints kernel.
[   22.239202] nvidia: module license 'NVIDIA' taints kernel.
[   22.239207] nvidia: module license taints kernel.
[   22.263763] kvm_amd: TSC scaling supported
[   22.263768] kvm_amd: Nested Virtualization enabled
[   22.263769] kvm_amd: Nested Paging enabled
[   22.263777] kvm_amd: Virtual VMLOAD VMSAVE supported
[   22.263778] kvm_amd: Virtual GIF supported
[   22.263779] kvm_amd: LBR virtualization supported
[   22.264813] input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:1c.0/0000:01:00.1/sound/card1/input7
[   22.264915] input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:1c.0/0000:01:00.1/sound/card1/input8
[   22.265013] input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:1c.0/0000:01:00.1/sound/card1/input9
[   22.265116] input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:1c.0/0000:01:00.1/sound/card1/input10
[   22.321525] nvidia-nvlink: Nvlink Core is being initialized, major device number 239
[   22.323814] nvidia 0000:01:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:eek:wns=none
[   22.375452] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  535.183.01  Sun May 12 19:39:15 UTC 2024
[   22.388423] nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms  535.183.01  Sun May 12 19:31:08 UTC 2024
[   22.398457] [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver
[   28.298519] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:01:00.0 on minor 1
[   28.403355] nvidia_uvm: module uses symbols nvUvmInterfaceDisableAccessCntr from proprietary module nvidia, inheriting taint.
[   28.980714] nvidia-uvm: Loaded the UVM driver, major device number 237.
[   35.265452] Modules linked in: snd_seq_dummy snd_hrtimer nvidia_uvm(PO) qrtr binfmt_misc nls_iso8859_1 intel_rapl_msr intel_rapl_common nvidia_drm(PO) nvidia_modeset(PO) kvm_amd nvidia(PO) ccp snd_hda_codec_hdmi kvm snd_hda_intel irqbypass snd_intel_dspcfg crct10dif_pclmul snd_intel_sdw_acpi polyval_clmulni polyval_generic snd_hda_codec snd_seq_midi ghash_clmulni_intel snd_seq_midi_event snd_rawmidi snd_hda_core sha256_ssse3 snd_seq snd_hwdep sha1_ssse3 aesni_intel snd_pcm crypto_simd snd_seq_device video wmi snd_timer cryptd snd i2c_i801 soundcore lpc_ich vmgenid i2c_smbus joydev input_leds mac_hid serio_raw msr parport_pc ppdev lp parport efi_pstore nfnetlink dmi_sysfs qemu_fw_cfg ip_tables x_tables autofs4 uas usb_storage hid_generic usbhid hid crc32_pclmul ahci psmouse xhci_pci libahci xhci_pci_renesas

root@a:/home/a# xrandr -q
Screen 0: minimum 8 x 8, current 1280 x 720, maximum 32767 x 32767
HDMI-0 connected primary 1280x720+0+0 (normal left inverted right x axis y axis) 527mm x 296mm
   1280x720      60.00*+  59.94    50.00    29.97    25.00    23.98
   1920x1080     50.00    29.97    25.00    23.98
   1680x1050     59.95
   1600x900      60.00
   1440x900      59.89
   1400x1050     59.98
   1280x1024     75.02    60.02
   1280x960      60.00
   1152x864      75.00
   1024x768      70.07    60.00
   800x600       75.00    72.19    60.32    56.25
   640x480       75.00    72.81    59.94    59.93
DP-0 disconnected (normal left inverted right x axis y axis)
DP-1 disconnected (normal left inverted right x axis y axis)
DP-2 disconnected (normal left inverted right x axis y axis)
DP-3 disconnected (normal left inverted right x axis y axis)
DP-4 disconnected (normal left inverted right x axis y axis)
DP-5 disconnected (normal left inverted right x axis y axis)
None-1-1 connected (normal left inverted right x axis y axis)
   1280x800      60.00 +

Nvidia bug report log

As a final note I want to add that the UEFI Driver update did work but no changes noted and that this issues happened another time, one week before, and that same day, somehow, after shutting things down and up plugging and unplugging I got it to work again, then went on my week, shut down the server on the weekend and that was it


If anyone has any clues I'd love to test things out, thanks!
 
Can the driver disable a cable based on something ? Seems strange that it works on safe mode
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!