Well. I feel like i myself am running in circles really.
I think i am close now since i no longer get any BAR: ... mem notifications
I will share my config for you to compare, try, evaluate. This config works well but the VM get stuck with a black screen. I assume i will have to try out video=vesafb: off,efifb: off and video=efifb: off,vesafb: off as well
iommu=pt amd_iommu=forced_isolation noats nobar root=ZFS=rpool/ROOT/pve-1 boot=zfs video=efifb
ff,vesafb
ff hugepagesz=2M hugepages=1375 default_hugepagesz=1G hugepagesz=1G hugepages=8 text
/etc/modprobe.d/passthrough.conf (where nnnn replace the real id i have configured)
blacklist amdgpu
options vfio-pci ids=nnnn:nnnn,nnnn:nnnn disable_vga=1
option kvm ignor_ms
/etc/modules
vfio
vfio_iommu_type1
vfio_pci
vfio_virqfd
aufs
overlay
The problem is not getting the PCI-E passthrough to work, it is for it to remain stable.
After a while i get the dreaded vfio_ecap_init: hiding ecap 0xffff.... while initially all is well.
Eventually, after hundreds and thousands of vfio_ecap... messages i get Refused to change power state, currently in D3.
NOTE: after the final reboot i now appear to have working pci-e GPU pass through with video=vesafb: off,efifb: off and the BIOS set to UEFI only. Some suggest to disable CSM is you have that in BIOS.
dmesg | grep BAR
now shows
.... BAR 0: assigned to efifb
which i assume is a good thing since i no longer get flooded with other BAR .. .notifciations