Issues after upgrading to 6.17.4-1-pve

daviddanko · Dec 20, 2025

Today I upgraded the kernel to 6.17.4-1 from 6.17.2-2, and after rebooting, my server didn't turn on. Checking the logs it seemed that because of my media disk (rows sstarting with ata5)

r/Proxmox - Issues after upgrading to 6.17.4-1-pve

When I went back to the previous kernel, all seemed to be fine, but my truenas instance did not boot up.

When I unplugged and plugged in back my media disk, these errors were repeating:

Code:

Dec 19 22:49:09 homeserver-01 kernel: ata5.00: cmd ca/00:10:10:12:40/00:00:00:00:00/e0 tag 22 dma 8192 out\
res 51/04:10:10:12:40/00:00:00:00:00/e0 Emask 0x1 (device error)\
Dec 19 22:49:09 homeserver-01 kernel: ata5.00: status: \{ DRDY ERR \}\
Dec 19 22:49:09 homeserver-01 kernel: ata5.00: error: \{ ABRT \}\
Dec 19 22:49:09 homeserver-01 kernel: ahci 10000:e0:17.0: port does not support device sleep\
Dec 19 22:49:09 homeserver-01 kernel: ata5.00: supports DRM functions and may not be fully accessible\
Dec 19 22:49:09 homeserver-01 kernel: ata5.00: failed to enable AA (error_mask=0x1)\
Dec 19 22:49:09 homeserver-01 kernel: ata5.00: supports DRM functions and may not be fully accessible\
Dec 19 22:49:09 homeserver-01 kernel: ata5.00: failed to enable AA (error_mask=0x1)\
Dec 19 22:49:09 homeserver-01 kernel: ata5.00: configured for UDMA/133 (device error ignored)\
Dec 19 22:49:09 homeserver-01 kernel: ahci 10000:e0:17.0: port does not support device sleep\
Dec 19 22:49:09 homeserver-01 kernel: ata5: EH complete\
Dec 19 22:49:09 homeserver-01 kernel: ata5.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0\
Dec 19 22:49:09 homeserver-01 kernel: ata5.00: irq_stat 0x40000001\
Dec 19 22:49:09 homeserver-01 kernel: ata5.00: failed command: WRITE DMA EXT\
Dec 19 22:49:09 homeserver-01 kernel: ata5.00: cmd 35/00:10:10:84:e0/00:00:e8:00:00/e0 tag 23 dma 8192 out\ res 51/04:10:10:84:e0/00:00:e8:00:00/e0 Emask 0x1 (device error)\

To me it seemed like my disk just died. So I disabled my truenas VM, clicked on detach on the media disk in the VM options, removed the entry for the disk from fstab in proxmox, but it still doesn't boot with the latest kernel. It boots fine with 6.17.2-2 however. When I plug the disk in, the proxmox syslogs are showing this:

Code:

Dec 19 23:18:27 homeserver-01 kernel: sd 4:0:0:0: [sda] tag#21 CDB: Read(10) 28 00 00 00 00 00 00 01 00 00
Dec 19 23:18:27 homeserver-01 kernel: blk_print_req_error: 2 callbacks suppressed
Dec 19 23:18:27 homeserver-01 kernel: I/O error, dev sda, sector 0 op 0x0:(READ) flags 0x0 phys_seg 32 prio class 2
Dec 19 23:18:27 homeserver-01 kernel: sd 4:0:0:0: [sda] tag#14 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK cmd_age=0s
Dec 19 23:18:27 homeserver-01 kernel: sd 4:0:0:0: [sda] tag#14 CDB: Read(10) 28 00 00 00 00 00 00 01 00 00

Perhaps it's worth mentioning that just before I also upgraded truenas to 25.10.1.

Currently, even though there should be no trace of that disk, the newest still doesn't boot:

So my question is, did my media disk die? If so, and the new kernel was hanging because of the disk, why does the old kernel booted with the faulty disk? Why doesn't the new kernel boots even thought I removed, I believe, every reference to that disk?

DerekG · Dec 20, 2025

What SATA controller / HBA are you using.?
I had to disable the Rombar (and upgrade the firmware) for the my controller to boot TrueNAS after upgrade from kernel 6.8.14

daviddanko · Dec 20, 2025

DerekG said:
What SATA controller / HBA are you using.?
I had to disable the Rombar (and upgrade the firmware) for the my controller to boot TrueNAS after upgrade from kernel 6.8.14

Code:

root@homeserver-01:~# lspci -nnk | egrep -A3 -i 'sata|raid|sas|storage'
0000:00:0e.0 RAID bus controller [0104]: Intel Corporation Volume Management Device NVMe RAID Controller [8086:467f]
        Subsystem: Dell Device [1028:0be5]
        Kernel driver in use: vmd
        Kernel modules: vmd, ahci
--
10000:e0:17.0 SATA controller [0106]: Intel Corporation Alder Lake-S PCH SATA Controller [AHCI Mode] [8086:7ae2] (rev 11)
        Subsystem: Dell Device [1028:0be5]
        Kernel driver in use: ahci
        Kernel modules: ahci

So I believe I am not affected by that ROMbar issue, right?

DerekG · Dec 20, 2025

daviddanko said:
So I believe I am not affected by that ROMbar issue, right?

I can't say that as I have no experience with those controllers and no idea what devices you have connected on each. I would try to disable the Rombar one device at a time. Then look into firmware updates, rolling back the kernel to the earlier version and if it still fails do further investigations from that point. There is no simple answer to these sort of cases, it's going to be diagnostics by trial and error.

It's fairly easy to diagnose if the drive is faulty, just connect in any other machine.

daviddanko · Dec 20, 2025

Flashing the newest BIOS version (1.11.0 -> 1.30.0) fixed the issue with the kernel 6.17.4-1.

MennoMega · Dec 21, 2025

Ok wierd , i have exactly the same SATA controller on my Dell optiplex 7020 with exactly the same issue with the new kernel (6.17.4-1-pve)
So i pinned the 6.17.2-1 kernel and everything is working again ( command used : proxmox-boot-tool kernel pin 6.17.2-1-pve )
My machine(s) wont boot with that newer kernel : proxmox-kernel-6.17.4-1-pve.

After youre "solution" i also updated the bios (1.20 latest one) , reset the bios to default and reinstalled the proxmox server from scratch and did the apt update/upgrade.
Again .. machine wont boot with the proxmox-kernel-6.17.4-1-pve kernel.

Are we the only ones ?
m

daviddanko · Dec 21, 2025

When I did some chores during the boot, I realized it actually booted for me. But it took a really long time. There was a workaround which was working for me before the BIOS update.

Edit the /etc/default/grub by adding zfs_import_skip=1 to the GRUB_CMDLINE_LINUX_DEFAULT line.

For example:

Code:

GRUB_CMDLINE_LINUX_DEFAULT="quiet zfs_import_skip=1"

But currently with the newest BIOS I do not need this anymore.

MennoMega · Dec 21, 2025

Ok thx man ! That was not the issue but you helped me (sort off).
The issue was some default setting in teh bios.
When updating the bios the SATA/NVMe is set to "Raid ON" in stead of AHCI/NVMe.
See picture.
Now the systems are working again , thanks for pointing me in the right direction.

daviddanko · Dec 21, 2025

Hmm, I also changed that setting from RAID On to AHCI/NVMe, but it was so I could install Windows on the host and use the Windows update tool to update the BIOS. Maybe this change helped me as well? I tried so many things that I’m not sure what made the difference.

Regarding the BIOS update itself, can I ask how you managed to update it? For me the OTA tool didn’t work, creating a FAT32 flash drive and copying the BIOS update .exe didn’t work, and the fwupdate command was also unsuccessful.

MennoMega · Dec 21, 2025

Sure @daviddanko ,
You asked me : Regarding the BIOS update itself, can I ask how you managed to update it?
Just download the DELL Optiplex_*.exe to a USB drive. In my case Fat32 USB pen drive.
Then open via F12 the Bios-Update screen ( on the right side of the screen )
Then select the inserted USB pen drive with just the Optiplex_*.exe file , browse the USB pen drive and slect that file.
Then 2x UPDATE bios and confirm with the OK button.
Thats it ! Then load the bios defaults ( just to be sure ) and search for AHCI and change the setting to AHCI/NVMe , safe , reboot Done !
I am now on :
CPU(s) 20 x Intel(R) Core(TM) i5-14500T (1 Socket)
Kernel Version Linux 6.17.4-1-pve (2025-12-03T15:42Z)
Boot Mode EFI (Secure Boot)
Manager Version pve-manager/9.1.2/9d436f37a0ac4172

Good luck ! M.

ProximusAl · Dec 23, 2025

MennoMega said:
Ok thx man ! That was not the issue but you helped me (sort off).
The issue was some default setting in teh bios.
When updating the bios the SATA/NVMe is set to "Raid ON" in stead of AHCI/NVMe.
See picture.
Now the systems are working again , thanks for pointing me in the right direction.View attachment 94233

I’ve created an account just to say “Thank You!”

I had the same issue on my Dell Opti 7020 SFF, so reverted to previous kernel.

Applied your fix above, and all hell broke loose.
Changing the BIOS setting above, renamed my NIC which is tied to the bridge. After fixing that, still couldn’t connect, and it turns out my unifi switch took objection as multiple MACs were being thrown for the proxmox IP.
After rebooting the switch, finally I could access proxmox, then did the kernel update, and all now working.

I need to learn to stop fiddling this close to Christmas.

But again, massive thanks to @MennoMega for this.

gunpie · Dec 25, 2025

I experience similar trouble on an Acer Spin SP313-51N with the new kernel 6.17.4-1-pve.

Sigh no Bios-Update available. As far as I can tell, the problem is the builtin NVME, which is not used with the starting system.
I think, i can live with pinning the kernel to 6.17.2-2-pve for a while.

FlatBadger · Dec 27, 2025

I'm having the same issue on a GMKtek M5 Ultra and my Dell Optiplex 3060. I have also now pinned the kernel to 6.17.2-2 which is working just fine for me too. Thank you @MennoMega for providing the command to do this, very helpful.

Karanth1992 · Dec 27, 2025

MennoMega said:
Ok thx man ! That was not the issue but you helped me (sort off).
The issue was some default setting in teh bios.
When updating the bios the SATA/NVMe is set to "Raid ON" in stead of AHCI/NVMe.
See picture.
Now the systems are working again , thanks for pointing me in the right direction.View attachment 94233

thank you this fixed my issue , wish i had seen this post before reimaging my proxmox

akesp · Dec 27, 2025

Hi,

I'm having the same issues with kernel 6.17.4-1-pve (after the latest upgrade to Virtual Environment 9.1.4) on a Dell Precision 5820, with BIOS 2.44.0 06/10/2025. It won't boot at all. But with 6.17.2-2-pve pinned it works like it should. I'm using systemd-boot.

kejar31 · Dec 27, 2025

akesp said:
Hi,

I'm having the same issues with kernel 6.17.4-1-pve (after the latest upgrade to Virtual Environment 9.1.4) on a Dell Precision 5820, with BIOS 2.44.0 06/10/2025. It won't boot at all. But with 6.17.2-2-pve pinned it works like it should. I'm using systemd-boot.

Also have a 5820 and after the latest kernel i am getting a CPU hard lock on different cores.. booted back up into 6.17.2-2-pve and all seems fine now. Also using systemd-boot as i am booting into a zfs mirror.

Thing is I have 4 other proxmox boxes running without issue one with systemd-boot and booting into a zfs pool as well, although this is a homelab and this is the only dell 5820 I have.

Patacas · Dec 28, 2025

I have been dealing with serious problems for 3 days after upgrading to Proxmox VE 9.1, which uses kernel 6.17.

Cluster issue

I have a 7-node Proxmox cluster.
On one node, after upgrading to Proxmox 9.1 (kernel 6.17):

ZFS modules fail to load
ZFS pool does not import
System does not load modules correctly

Because of this, I removed the node from the cluster and attempted a clean reinstall.

Clean install problems (Proxmox 9.1 ISO)

A clean installation of Proxmox 9.1 fails consistently:

Multiple errors during installation (dpkg, libfaketime, initramfs)
Errors like:
- unable to sync file ... Input/output error
- unable to install initramfs
Installer fails while generating initramfs for kernel 6.17

I tested:

Different USB flash drives
New SSD disk for the Boot
Rufus (DD mode)
Balena Etcher
Different USB ports

Same errors every time.

Temporary workaround (not successful)

Out of desperation, I installed Proxmox 9.0 (kernel 6.14) first, which installed successfully, and then upgraded to 9.1.
After the upgrade:

The server crashes
Kernel modules do not load correctly
System is unstable / unusable

Hardware details

Server: HP ProLiant G8
Storage controller: SATA in AHCI mode
No RAID controller

This is the cli of the errors i have during a clean install of proxmox 9.1 in the server.

Processing triggers for postfix (3.10.5-1~deb13u1) Restarting postfix
invoke-rc.d: could not determine current runlevel
invoke-rc.d: policy-rc.d denied execution of restart.
updating /etc/localtime => /var/spool/postfix//etc/localtime
updating /etc/services => /var/spool/postfix//etc/services
updating /etc/resolv.conf => /var/spool/postfix//etc/resolv.conf
updating /etc/hosts => /var/spool/postfix//etc/hosts
updating /etc/host.conf => /var/spool/postfix//etc/host.conf
updating /etc/nsswitch.conf => /var/spool/postfix//etc/nsswitch.conf
"/usr/lib/x86_64-linux-gnu/libnss_systemd.so.2' -> 'lib/libnss_systemd.so.2'
postsuper: Deleted: messages
dpkg-divert: warning: please specify --no-rename explicitly, the default will change to --rename in 1.20.x Removing 'diversion of /usr/sbin/update-grub to /usr/sbin/update-grub.distrib by proxmox
dpkg-divert: warning: please specify --no-rename explicitly, the default will change to --rename in 1.20.x Removing 'diversion of /usr/sbin/update-initramfs to /usr/sbin/update-initramfs.distrib by proxmox' update-initramfs: Generating /boot/initrd.img-6.17.2-1-pve
sync: error syncing '/boot/initrd.img-6.17.2-1-pve': Input/output error
boot loader setup errors:
- unable to install initramfs
dpkg-divert: warning: please specify --no-rename explicitly, the default will change to --rename in 1.20.x Removing 'diversion of /sbin/start-stop-daemon to /sbin/start-stop-daemon.distrib by proxmox umount: /target/sys/firmware/efi/efivars: no mount point specified.

This is by far the worst kernel experience I’ve had in 7 years as a Proxmox user.

gfngfn256 · Dec 28, 2025

Patacas said:
This is by far the worst kernel experience I’ve had in 7 years as a Proxmox user.

For the HP ProLiant G8 that was released circa 2012, appears to have gone EOSL (End of Service Life) ~2021, your experience is probably better than any other OS!

Patacas · Dec 28, 2025

gfngfn256 said:
For the HP ProLiant G8 that was released circa 2012, appears to have gone EOSL (End of Service Life) ~2021, your experience is probably better than any other OS!

Men… one of the biggest advantages of Proxmox is that you can reuse servers that are no longer supported by the original vendor. By using it in clusters with HA, backups, etc., Proxmox is one of the most complete hypervisors available today.

A server that is end-of-support with 24 cores, a 10 Gbps network, and 128 GB of RAM is, for sure, a perfect candidate for running Linux-based systems on Proxmox. Now imagine a cluster of seven of those, I think you’ll agree with me.

However, since version 9.0, upgrades are no longer working as smoothly as before. I still have a test server that was originally running Proxmox 5.0 and has been upgraded over the years, but starting with kernel 6.14 I’ve encountered many errors related to dpkg and read-only PVE partitions.

Kernel 6.17 also has a lot of incompatibilities with certain SATA controllers and disks, things that really shouldn’t happen in production.
(I still don’t know how to get past this issue on PVE 9.1. If anyone has an idea, please let me know.)

carles89 · Dec 28, 2025

Patacas said:
Men… one of the biggest advantages of Proxmox is that you can reuse servers that are no longer supported by the original vendor. By using it in clusters with HA, backups, etc., Proxmox is one of the most complete hypervisors available today.

A server that is end-of-support with 24 cores, a 10 Gbps network, and 128 GB of RAM is, for sure, a perfect candidate for running Linux-based systems on Proxmox. Now imagine a cluster of seven of those, I think you’ll agree with me.

However, since version 9.0, upgrades are no longer working as smoothly as before. I still have a test server that was originally running Proxmox 5.0 and has been upgraded over the years, but starting with kernel 6.14 I’ve encountered many errors related to dpkg and read-only PVE partitions.

Kernel 6.17 also has a lot of incompatibilities with certain SATA controllers and disks, things that really shouldn’t happen in production.
(I still don’t know how to get past this issue on PVE 9.1. If anyone has an idea, please let me know.)

If everything works as expected with a previous kernel, an option would be to install PVE 9, pin [0] the kernel that works (i.e. 6.14) and then upgrade to PVE 9.1.

If it works, you could pin the same kernel on the rest of the nodes before upgrading.

It seems 6.17 is giving some trouble both on PVE and PBS, even in the enterprise repo...

[0] https://pve.proxmox.com/wiki/Host_Bootloader#sysboot_kernel_pin

Issues after upgrading to 6.17.4-1-pve

New Member

Well-Known Member

New Member

Well-Known Member

New Member

New Member

New Member

New Member

New Member

New Member

New Member

Member

New Member

New Member

New Member

Member

Member

Cluster issue​

Clean install problems (Proxmox 9.1 ISO)​

Temporary workaround (not successful)​

Hardware details​

Distinguished Member

Member

Renowned Member

We value your privacy

Cluster issue

Clean install problems (Proxmox 9.1 ISO)

Temporary workaround (not successful)

Hardware details