Hi,
we are struggeling with vgpus for vllm / pytorch inside a VM.
Setup:
We operate a node with 4x Nvidia L40S in a Proxmox Cluster (8.4.1). Driver on the host is 580.65.05 and vGPUs are setup correctly and work reliable and flawless for VDIs.
For LLM Inference we have mapped 4 vGPUs into a...