I have RTX8000 and T40 GPUs on a server, I want to try ProxMox and move away from ESXi.
I installed ProxMox, I installed vGPU KVM. I can see nvidia-smi vgpu output, when I do this:
ls -la /sys/bus/pci/devices/0000\:02\:00.0/mdev_supported_types/
I get devices list list nvidia-264, nvidia-265 etc
Not UUIDs. So I pass this to my VM using this:
qm set 100 -hostpci0 02:00.0,mdev=nvidia-264
but when I try to start the VM I get this:
mdev instance '00000000-0000-0000-0000-000000000100' already existed, using it.
kvm: -device vfio-pci,sysfsdev=/sys/bus/pci/devices/0000:02:00.0/00000000-0000-0000-0000-000000000100,id=hostpci0,bus=pci.0,addr=0x10: vfio 00000000-0000-0000-0000-000000000100: error getting device from group 138: Connection timed out
Verify all devices in group 138 are bound to vfio-<bus> or pci-stub and not already in use
TASK ERROR: start failed: QEMU exited with code 1
Can anyone help? What am I missing here? Thanks in advance!
I installed ProxMox, I installed vGPU KVM. I can see nvidia-smi vgpu output, when I do this:
ls -la /sys/bus/pci/devices/0000\:02\:00.0/mdev_supported_types/
I get devices list list nvidia-264, nvidia-265 etc
Not UUIDs. So I pass this to my VM using this:
qm set 100 -hostpci0 02:00.0,mdev=nvidia-264
but when I try to start the VM I get this:
mdev instance '00000000-0000-0000-0000-000000000100' already existed, using it.
kvm: -device vfio-pci,sysfsdev=/sys/bus/pci/devices/0000:02:00.0/00000000-0000-0000-0000-000000000100,id=hostpci0,bus=pci.0,addr=0x10: vfio 00000000-0000-0000-0000-000000000100: error getting device from group 138: Connection timed out
Verify all devices in group 138 are bound to vfio-<bus> or pci-stub and not already in use
TASK ERROR: start failed: QEMU exited with code 1
Can anyone help? What am I missing here? Thanks in advance!