Hello! I am actively working on this for a project at my employment. There are a lot of misunderstandings in this threat. First the A100 and A30 support MIG (Multi-Instance Graphics) which is not GRID. MIG partitions the A100 and A30s in to smaller contexts. GRID uses vGPU technology that create PCI devices that can passthru to VMs. I'm not certain that MIG utilizes SR-IOV in any capacity. The instances of the graphics cards are exposed via /dev. To be honest, I don't believe GRID uses SR-IOV either, it uses software mediated devices, e.g. not hardware virtualization like the Intel XE or AMDPRO GPUs do.
I believe that MIG will not work with virtual machines, but will require containers, as mentioned in this thread. This is due to the devices not being exposed as hardware, like with GRID. GRID is a licensed feature and comes at a price.
I don't wish to encourage license circumventing software, but if you are looking for vGPU to work with Proxmox, as purely an educational endeavour, vgpu-unlock can unlock software mediated devices on consumer grade hardware. The last time I was researching this, for the project I'm currently on, I could not get an RTX3080 to work, but I was able to see a drop down list of all supported mediated devices within the Proxmox user interface. I could not get the cards to be recognized; however in the discourse of getting the A100s to be passed into my VMs I discovered that VM UEFI bios for Proxmox is bugged and I could not get the devices. I would like to circle back to this and discover whether I can get vGPU with mdevctl devices to function with SeaBIOS vs OVMF, which is what seems to be bugged. For the record, the bug is that the nvidia kmod will not recognize the card being passed in, regardless if it's a full device or a virtual GPU.
I hope to report back for anyone who is stuck on this project, or at least complete the missing components to this thread; however this will likely be a couple month's project. The answer may still be to use containers only, as outlined in the aforementioned article.
*I should clarify that the Ampere architecture does support SR-IOV; presumably also is the reason vgpu-unlock doesn't yet support Ampere devices. I was working with 1080s as well as 3080s for this.