Hi, am desperate for any advice!
Have an install of pve that went up this week - 6.2.
I've followed the guides to setup gpu passthrough and have got a VM booting with my gpu passed through.
Hardware is:
intel server board s3420gp, xeon x3430, asus p106-100 gpu - is a second hand mining card which essentially is a gtx1060 6gb without display outputs (uefi compatible). The idea is that I will use this for cuda accelerated tensorflow/pytorch deep learning in an ubuntu 20.04 lts guest - i.e. I dont need this to run as a display card.
The host binds to VFIO correctly and passes through to the guest. Nouveau installs correctly automatically - but is unfortunately useless as it doesn't support cuda.
My guest dmesg after I've installed the 440 nvidia drivers from the ubuntu repository shows:
NVRM (a pci address) RmInitAdapter failed!
NVRM (a pci address) rm_init_adapter failed, device minor number 0
my nvidia-smi command returns:
no devices found
I'm thinking that this could be the linux equivalent of the code43 error in windows - but I'm very open to suggestions from people who know more than me.
my grub addons are:
GRUB_CMDLINE_LINUX_DEFAULT="quiet intel_iommu=on iommu=pt rd.driver.pre=vfio-pci pcie_acs_override=downstream,multifunction nofb nomodeset video=vesafbff,efifbff vfio_iommu_type1.allow_unsafe_interrupts=1"
not sure how much of this I need - the last line was crucial as the previous methods (documented in the guides on the proxmox website) do not function to allow unsafe interupts
my vm conf is:
args: -cpu 'host,hv_time,kvm=off,hv_vendor_id=1234567890ab' -machine type=q35,kernel_irqchip=on
bios: ovmf
bootdisk: scsi0
cores: 2
cpu: host,hidden=1,flags=+pcid
efidisk0: tensor:vm-100-disk-1,size=1M
hostpci0: 02:00,pcie=1,romfile=patch_vbios.bin
ide2: none,media=cdrom
machine: q35
memory: 6144
name: tensor
net0: virtio=BA:C7:1D:92:04:5C,bridge=vmbr0,firewall=1
numa: 0
ostype: l26
scsi0: tensor:vm-100-disk-0,size=100G
scsihw: virtio-scsi-pci
smbios1: uuid=17494963-eb27-48a1-998e-d011db8580f7
sockets: 1
vmgenid: 45a59a1f-a9f2-4cf4-b1b0-b0fb1db2fbfb
I have tried a variety of different configurations including rombar off, changing the vendor, I downloaded a bios from techpowerup which has been modded.
if anyone has any ideas - then please let me know!
Have an install of pve that went up this week - 6.2.
I've followed the guides to setup gpu passthrough and have got a VM booting with my gpu passed through.
Hardware is:
intel server board s3420gp, xeon x3430, asus p106-100 gpu - is a second hand mining card which essentially is a gtx1060 6gb without display outputs (uefi compatible). The idea is that I will use this for cuda accelerated tensorflow/pytorch deep learning in an ubuntu 20.04 lts guest - i.e. I dont need this to run as a display card.
The host binds to VFIO correctly and passes through to the guest. Nouveau installs correctly automatically - but is unfortunately useless as it doesn't support cuda.
My guest dmesg after I've installed the 440 nvidia drivers from the ubuntu repository shows:
NVRM (a pci address) RmInitAdapter failed!
NVRM (a pci address) rm_init_adapter failed, device minor number 0
my nvidia-smi command returns:
no devices found
I'm thinking that this could be the linux equivalent of the code43 error in windows - but I'm very open to suggestions from people who know more than me.
my grub addons are:
GRUB_CMDLINE_LINUX_DEFAULT="quiet intel_iommu=on iommu=pt rd.driver.pre=vfio-pci pcie_acs_override=downstream,multifunction nofb nomodeset video=vesafbff,efifbff vfio_iommu_type1.allow_unsafe_interrupts=1"
not sure how much of this I need - the last line was crucial as the previous methods (documented in the guides on the proxmox website) do not function to allow unsafe interupts
my vm conf is:
args: -cpu 'host,hv_time,kvm=off,hv_vendor_id=1234567890ab' -machine type=q35,kernel_irqchip=on
bios: ovmf
bootdisk: scsi0
cores: 2
cpu: host,hidden=1,flags=+pcid
efidisk0: tensor:vm-100-disk-1,size=1M
hostpci0: 02:00,pcie=1,romfile=patch_vbios.bin
ide2: none,media=cdrom
machine: q35
memory: 6144
name: tensor
net0: virtio=BA:C7:1D:92:04:5C,bridge=vmbr0,firewall=1
numa: 0
ostype: l26
scsi0: tensor:vm-100-disk-0,size=100G
scsihw: virtio-scsi-pci
smbios1: uuid=17494963-eb27-48a1-998e-d011db8580f7
sockets: 1
vmgenid: 45a59a1f-a9f2-4cf4-b1b0-b0fb1db2fbfb
I have tried a variety of different configurations including rombar off, changing the vendor, I downloaded a bios from techpowerup which has been modded.
if anyone has any ideas - then please let me know!
Last edited: