Dear all,
I have quite some machines running with GPU passtrought but all of a sudden one system does not longer come up, and I don't get a clue what changed.
This is my usually working grub entry for a system with two identical 1080Ti cards:
All with the usual vifo modules etc. As I said, everything was working. On this particular machine I hade to update to 5.19 because of this error:
BAR 0: can't reserve [mem 0xe0000000-0xefffffff 64bit pref]"
I needed also the script from here
https://forum.proxmox.com/threads/problem-with-gpu-passthrough.55918/page-3#post-484395I
But now, the system got stuck at the very beginning:
Then I tried to remove all different parts from the grub entry. One by one. If I removed
the system booted at least until here:
If I remove everything else, the system boots again. My problem is that I do not get any error messages. I tried also different kernel versions:
All without success. Finally I setup a completely new pc with a similar GPU, this time a HP820 workstation. After I upgraded to 5.19.17 and updating to my working grub entry from above....
Again, no boot at all, everything got stuck here:
My other cluster nodes with 5.15.74 work exactly with this grub line. What is going on here?
I have quite some machines running with GPU passtrought but all of a sudden one system does not longer come up, and I don't get a clue what changed.
This is my usually working grub entry for a system with two identical 1080Ti cards:
Code:
#HPZ820
GRUB_CMDLINE_LINUX_DEFAULT="quiet intel_iommu=on pcie_acs_override=downstream,multifunction initcall_blacklist=sysfb_init video=vesa:off vfio-pci.ids=10de:1b81,10de:10f0 pcie_aspm=off vfio_iommu_type1.allow_unsafe_interrupts=1 kvm.ignore_msrs=1 modprobe.blacklist=radeon,nouveau,nvidia,nvidiafb,nvidia-gpu"
All with the usual vifo modules etc. As I said, everything was working. On this particular machine I hade to update to 5.19 because of this error:
BAR 0: can't reserve [mem 0xe0000000-0xefffffff 64bit pref]"
I needed also the script from here
Code:
echo 1 > /sys/bus/pci/devices/0000\:09\:00.0/remove
echo 1 > /sys/bus/pci/rescan
https://forum.proxmox.com/threads/problem-with-gpu-passthrough.55918/page-3#post-484395I
But now, the system got stuck at the very beginning:
Code:
Loading inital ramdisk ...
Then I tried to remove all different parts from the grub entry. One by one. If I removed
Code:
initcall_blacklist=sysfb_init
If I remove everything else, the system boots again. My problem is that I do not get any error messages. I tried also different kernel versions:
All without success. Finally I setup a completely new pc with a similar GPU, this time a HP820 workstation. After I upgraded to 5.19.17 and updating to my working grub entry from above....
Again, no boot at all, everything got stuck here:
Code:
Loading inital ramdisk ...
My other cluster nodes with 5.15.74 work exactly with this grub line. What is going on here?