hello community,
since several month I have the following issue and can't seem to solve it.
during coldboot I get an error that the ahci controller cannot be reset. this results in the ssd on that controller (SSD on an PCI adapter, which holds data and containers, its not root, root is on another SSD that mounts without issue) not being mounted so proxmox gets stuck during the boot.
I tried to force scripts into initramfs to reset the controller manually during boot, without success.
Adding GRUB_CMDLINE_LINUX="pci=nomsi" to GRUB also did not help
the funny thing is, after the boot gets stuck, if I CTRL ALT DEL force reboot, the system recognises the drive and boots without problem. but my goal is to be able to cold boot the server remotely, without any input
screenshot attached, I am willing to try solutions, google and chatGPT could not solve it
best regards
EDIT:
also tried to implement a reboot during init with an script IF the error message appears, it prints that the error message appears, but instead of reboot I get kernel panic which just halts everything (need to manually power of)
Maybe if someone could assist me to force reboot during into via a script, it would already solve the problem via that work around
since several month I have the following issue and can't seem to solve it.
during coldboot I get an error that the ahci controller cannot be reset. this results in the ssd on that controller (SSD on an PCI adapter, which holds data and containers, its not root, root is on another SSD that mounts without issue) not being mounted so proxmox gets stuck during the boot.
I tried to force scripts into initramfs to reset the controller manually during boot, without success.
Bash:
#!/bin/bash
# Unload AHCI module
rmmod ahci
# Wait for a moment to ensure the module is unloaded
sleep 5
# Reload AHCI module
modprobe ahci
# Continue with the boot process
exec /sbin/init
the funny thing is, after the boot gets stuck, if I CTRL ALT DEL force reboot, the system recognises the drive and boots without problem. but my goal is to be able to cold boot the server remotely, without any input
screenshot attached, I am willing to try solutions, google and chatGPT could not solve it
best regards
EDIT:
also tried to implement a reboot during init with an script IF the error message appears, it prints that the error message appears, but instead of reboot I get kernel panic which just halts everything (need to manually power of)
Maybe if someone could assist me to force reboot during into via a script, it would already solve the problem via that work around
Bash:
#!/bin/bash
# Check if the error message is present in the system logs
if dmesg | grep -q "AHCI 0000:01:00.0: controller reset failed (0xffffffff)"; then
# If the error is found, force a reboot
echo "AHCI controller reset error detected. Forcing a reboot..."
/sbin/reboot
fi
Attachments
Last edited: