vGPU NVIDIA TITAN Xp for Windows Clients

Jun 12, 2020
26
7
23
Hello,

currently I am trying to integrate two nvida titanium xp gpus on proxmox.
But I am running in to the following issue by addin the GPU incl. profile to a VM:


Error by adding the GPU to a second System :
Code:
kvm: -device vfio-pci,sysfsdev=/sys/bus/mdev/devices/00000000-0000-0000-0000-000000000100,id=hostpci0,bus=pci.0,addr=0x10: vfio 00000000-0000-0000-0000-000000000100: error getting device from group 91: Input/output error
Verify all devices in group 91 are bound to vfio-<bus> or pci-stub and not already in use
TASK ERROR: start failed: QEMU exited with code 1




Output of dmesg:
Bash:
dmesg |grep vgpu
[    9.153501] nvidia-vgpu-vfio 00000000-0000-0000-0000-000000000100: Adding to iommu group 91
[    9.153505] nvidia-vgpu-vfio 00000000-0000-0000-0000-000000000100: MDEV: group_id = 91


What I did:

- Installed the pve binary-amd64 for current version
- Installed v15.0 nVidia vGPU Driver


nano /etc/default/grub

GRUB_CMDLINE_LINUX_DEFAULT="quiet intel_iommu=on iommu=pt"
- OR -
GRUB_CMDLINE_LINUX_DEFAULT="quiet amd_iommu=on iommu=pt"

Save file and close

update-grub

-Load VFIO modules at boot-

nano /etc/modules

Code:
echo 'vfio' >> /etc/modules
echo 'vfio_iommu_type1' >> /etc/modules
echo 'vfio_pci' >> /etc/modules
echo 'vfio_virqfd' >> /etc/modules

Save file and close

Code:
echo "options kvm ignore_msrs=1" > /etc/modprobe.d/kvm.conf
echo "blacklist nouveau" >> /etc/modprobe.d/blacklist.conf

Code:
update-initramfs -u



chmod -R +x vgpu_unlock
chmod +x NVIDIA------.run

./NVIDIA------.run --dkms

nano /usr/src/nvidia-510.85.03/nvidia/os-interface.c
#include "/root/vgpu_unlock/vgpu_unlock_hooks.c"

nano /usr/src/nvidia-450.80/nvidia/nvidia.Kbuild
ldflags-y += -T /root/vgpu_unlock/kern.ld

cd vgpu_unlock-rs
cargo build --release

mkdir /etc/systemd/system/nvidia-vgpud.service.d
mkdir /etc/systemd/system/nvidia-vgpu-mgr.service.d

nano /etc/systemd/system/nvidia-vgpud.service.d/vgpu_unlock.conf

[Service]
Environment=LD_PRELOAD=/root/vgpu_unlock-rs/target/release/libvgpu_unlock_rs.so

nano /etc/systemd/system/nvidia-vgpu-mgr.service.d/vgpu_unlock.conf

[Service]
Environment=LD_PRELOAD=/root/vgpu_unlock-rs/target/release/libvgpu_unlock_rs.so

- Created a custom profile for nvidia-48

Code:
[profile.nvidia-48]
num_displays = 1
display_width = 2560
display_height = 1440
max_pixels = 3686400
cuda_enabled = 1
frl_enabled = 60
framebuffer = 3937053354
pci_id = 0x1B3011A0
pci_device_id = 0x1B30



nano /etc/pve/qemu/[VM#].conf
Code:
args: -uuid 00000000-0000-0000-0000-000000000###

Added the GPU and to the vm

1677176275948.png

Any idea how this Issue can be solved?
 
Last edited:

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!