Recent content by Lefuneste

  1. L

    Proxmox 9.0 Beta - kernel issues with vfio-pci on Mellanox 100G.

    I confirm a drastic change in behavior on the way VF passthrough is handled by 6.14.8-2-pve and 6.16.0-50-pve kernels when calling VF passthrough. I have made many tests changing kernel parameters but all comes down to the fact that everything runs fine with intremap=no_x2apic_optout parameter...
  2. L

    Proxmox 9.0 Beta - kernel issues with vfio-pci on Mellanox 100G.

    So the issue seems to be related to the kernel parameter intremap=no_x2apic_optout which cannot be transposed to 6.14.8-2-pve and 6.16.0-50-pve without causing the passthrough to crash the NIC when calling the VF at start of VM. This kernel parameter is required with my motherboard so it is...
  3. L

    Proxmox 9.0 Beta - kernel issues with vfio-pci on Mellanox 100G.

    Unfortunately yes. The link points to latest kernel 6.16.0-50-pve. Exact same behavior than with 6.14.8-2-pve Here is my cmdline : root=ZFS=rpool/ROOT/pve-1 boot=zfs intel_iommu=on iommu=pt intremap=no_x2apic_optout pci_pt_e820_access=on quiet net.naming-scheme=v252 On this specific server I...
  4. L

    Proxmox 9.0 Beta - kernel issues with vfio-pci on Mellanox 100G.

    Ok Not sure if this is related but I am having a similar and serious issue since I updated to PMX 9. I have drilled down the issue to the kernel version. The issue is definitely present with kernels 6.14.8-2-pve and 6.16.0-50-pve behaving exactly the same. Interestingly kernel 6.8.12-13-pve...
  5. L

    Kernel 6.8.12-12-pve Update results in VFIO PCI passthrough issues

    Yes exact same issue here once I updated to kernel 6.8.12-12. There definitely is some regression with this kernel version. Several VM on multiple servers give error message when mounting vfio devices at VM start kvm: -device...
  6. L

    [SOLVED] communication issue between SRIOV VM VF and CT on PF bridge

    Hi there I believe that your script is not working anymore. I am not expert in bash but trying to understand the cinematic of your code I believe that there has been some change in the following path : /sys/class/net/${bridge}/brif/ This path does not exist anymore on current (7.4-3) version...
  7. L

    help(?!) building OFED drivers

    Same issue here. Trying to update these old drivers (mlx4_core in my case) proves to be extremely difficult or even impossible on Proxmox 7.2-11 (kernel 5.15.60-2-pve). I have tried all sorts of tricks and tutorials, but I always have a dependency issue at minimum. The side effects of using...
  8. L

    Can't install Chelsio Unified Wire drivers on Proxmox 7

    For the benefit of others, I have managed an almost successful installation of latest Chelsio Unified Wire drivers on Proxmox 7. My config is : -HW : Chelsio T440-CR -Proxmox 7.2-4 -Kernel : 5.15.35-2 -Chelsio Unified Driver Version (INSTALLED FROM SOURCE) : 3.16.0.1 Installation script result...
  9. L

    [SOLVED] Problem with GPU Passthrough

    You can forcefully unload the BOOTFB framebuffer by using Jart proposed method. This should work fine with 5.15 kernel version. This is what I am using on 2 servers while the bug is still an issue with 5.15 kernels. Also see my post above about diagnosing if the BOOTFB lock is indeed your issue...
  10. L

    [TUTORIAL] Hardware watchdog at a per-VM level

    Thank you VERY MUCH. I just come to the stage where HA migration of VM is not stable enough within Proxmox to ensure my critical VM (docker VM and Home Assistant VM) always stay operational. This is perfect. It is working wonderfully. THIS SHOULD BE NATIVE IN PROXMOX !!!!
  11. L

    [SOLVED] Problem with GPU Passthrough

    Thank you for your post. I have just spent 2 days and 2 nights trying to eradicate this nasty BOOTFB memory lock brought by the broken 5.15 kernels. Obviously this awful regression happens (as usual) when I am in the process of installing and configuring a new GPU with pass-through, making the...
  12. L

    pveceph install fails

    Hurrah I solved my own mess by doing something NOT recommended... I basically followed the dependencies complains and forced the version of each package. Then pveceph runs untill the end and completes the process with the "ceph" package install. Not tested yet if it is running properly but at...
  13. L

    pveceph install fails

    I have a 3 node Proxmox 7.1.12, with a ceph RBD storage for HA VM. Following a violent brown out this morning due to local roadworks, despite heavy duty UPS my ceph cluster went down the drain. I could not restart the OSD, and could not recreate them. I tried to clean up the ceph cluster as I...
  14. L

    Kernel 5.13.19-4 crashes when launching VM with HBA passthrough

    I confirm that the new Kernel version seems to correct the issue. Amazing work guys. Your dedication is very much appreciated !
  15. L

    Kernel 5.13.19-4 crashes when launching VM with HBA passthrough

    Hi there Just bumped into this issue. I have been doing some diagnostics. With Kernel version Linux 5.13.19-4-pve when starting a TrueNas VM with PCIe Passthrough of a Dell HBA330, I get the following dmesg. Other VM continue to run but I cannot reboot the Host. It hangs indefinitely after...