Trouble with kernel 6.17.4-2-pve

sobaka

Member
Aug 23, 2023
8
15
8
Proxmox cluster here, (current version 9.1.4) running for some months. Upon booting with the latest update to kernel 6.17.4-2-pve, the machine hung on "loading intial ramdisk" and then after some time, the console message appeared: "watchdog detected hard lockup on cpu 12" and that was the end.

I rebooted twice more, with the same results. Then I selected the previous kernel 6.17.2-2-pve and the machine came up and ran normally.

We have a problem.

Dell XPS 8960
Intel(R) Core(TM) i7-14700 (28 cores)
64 GB RAM
 
Please provide enough information to start debugging:
  • pveversion -v
  • pve-efiboot-tool kernel list
  • lsmod
  • lshw -short (if not present do: [B]apt install -y lshw[/B])
And ofcourse the journal logs from those failed starts. Use [CODE] tags to supply said information.
 
Please provide enough information to start debugging:
  • pveversion -v
  • pve-efiboot-tool kernel list
  • lsmod
  • lshw -short (if not present do: [B]apt install -y lshw[/B])
And ofcourse the journal logs from those failed starts. Use [CODE] tags to supply said information.
1.
Code:
root@kharkov:~# pveversion -v
proxmox-ve: 9.1.0 (running kernel: 6.17.2-2-pve)
pve-manager: 9.1.4 (running version: 9.1.4/5ac30304265fbd8e)
proxmox-kernel-helper: 9.0.4
proxmox-kernel-6.17.4-2-pve-signed: 6.17.4-2
proxmox-kernel-6.17: 6.17.4-2
proxmox-kernel-6.17.2-2-pve-signed: 6.17.2-2
proxmox-kernel-6.14.11-5-pve-signed: 6.14.11-5
proxmox-kernel-6.14: 6.14.11-5
proxmox-kernel-6.8.12-13-pve-signed: 6.8.12-13
proxmox-kernel-6.8: 6.8.12-13
proxmox-kernel-6.8.4-2-pve-signed: 6.8.4-2
ceph: 19.2.3-pve2
ceph-fuse: 19.2.3-pve2
corosync: 3.1.9-pve2
criu: 4.1.1-1
frr-pythontools: 10.4.1-1+pve1
ifupdown2: 3.3.0-1+pmx11
ksm-control-daemon: 1.5-1
libjs-extjs: 7.0.0-5
libproxmox-acme-perl: 1.7.0
libproxmox-backup-qemu0: 2.0.1
libproxmox-rs-perl: 0.4.1
libpve-access-control: 9.0.5
libpve-apiclient-perl: 3.4.2
libpve-cluster-api-perl: 9.0.7
libpve-cluster-perl: 9.0.7
libpve-common-perl: 9.1.4
libpve-guest-common-perl: 6.0.2
libpve-http-server-perl: 6.0.5
libpve-network-perl: 1.2.4
libpve-rs-perl: 0.11.4
libpve-storage-perl: 9.1.0
libspice-server1: 0.15.2-1+b1
lvm2: 2.03.31-2+pmx1
lxc-pve: 6.0.5-3
lxcfs: 6.0.4-pve1
novnc-pve: 1.6.0-3
proxmox-backup-client: 4.1.1-1
proxmox-backup-file-restore: 4.1.1-1
proxmox-backup-restore-image: 1.0.0
proxmox-firewall: 1.2.1
proxmox-kernel-helper: 9.0.4
proxmox-mail-forward: 1.0.2
proxmox-mini-journalreader: 1.6
proxmox-offline-mirror-helper: 0.7.3
proxmox-widget-toolkit: 5.1.5
pve-cluster: 9.0.7
pve-container: 6.0.18
pve-docs: 9.1.2
pve-edk2-firmware: 4.2025.05-2
pve-esxi-import-tools: 1.0.1
pve-firewall: 6.0.4
pve-firmware: 3.17-2
pve-ha-manager: 5.1.0
pve-i18n: 3.6.6
pve-qemu-kvm: 10.1.2-5
pve-xtermjs: 5.5.0-3
qemu-server: 9.1.3
smartmontools: 7.4-pve1
spiceterm: 3.4.1
swtpm: 0.8.0+pve3
vncterm: 1.9.1
zfsutils-linux: 2.3.4-pve1

2.
Code:
root@kharkov:~# pve-efiboot-tool kernel list
Manually selected kernels:
None.

Automatically selected kernels:
6.14.11-5-pve
6.17.2-2-pve
6.17.4-2-pve

3.
Code:
root@kharkov:~# lsmod
Module                  Size  Used by
veth                   40960  0
ebtable_filter         12288  0
ebtables               45056  1 ebtable_filter
ip_set                 61440  0
ip6table_raw           12288  0
iptable_raw            12288  0
ip6table_filter        12288  0
ip6_tables             32768  2 ip6table_filter,ip6table_raw
iptable_filter         12288  0
ip6_udp_tunnel         16384  0
udp_tunnel             32768  0
nf_tables             360448  0
bonding               245760  0
tls                   147456  1 bonding
softdog                12288  2
sunrpc                786432  1
nfnetlink_log          24576  1
binfmt_misc            24576  1
xe                   3567616  0
gpu_sched              65536  1 xe
drm_gpuvm              49152  1 xe
drm_gpusvm_helper      36864  1 xe
snd_hda_codec_intelhdmi    24576  1
drm_ttm_helper         16384  1 xe
snd_hda_codec_alc882    20480  1
drm_exec               12288  2 drm_gpuvm,xe
snd_hda_codec_realtek_lib    65536  1 snd_hda_codec_alc882
drm_suballoc_helper    16384  1 xe
snd_hda_codec_generic   102400  2 snd_hda_codec_realtek_lib,snd_hda_codec_alc882
snd_hda_intel          61440  0
snd_sof_pci_intel_tgl    12288  0
snd_sof_pci_intel_cnl    20480  1 snd_sof_pci_intel_tgl
snd_sof_intel_hda_generic    36864  2 snd_sof_pci_intel_tgl,snd_sof_pci_intel_cnl
soundwire_intel        81920  1 snd_sof_intel_hda_generic
snd_sof_intel_hda_sdw_bpt    20480  1 soundwire_intel
snd_sof_intel_hda_common   180224  4 snd_sof_intel_hda_sdw_bpt,snd_sof_intel_hda_generic,snd_sof_pci_intel_t
gl,snd_sof_pci_intel_cnl
snd_soc_hdac_hda       20480  1 snd_sof_intel_hda_common
snd_sof_intel_hda_mlink    45056  4 snd_sof_intel_hda_sdw_bpt,soundwire_intel,snd_sof_intel_hda_common,snd_s
of_intel_hda_generic
snd_sof_intel_hda      24576  2 snd_sof_intel_hda_common,snd_sof_intel_hda_generic
snd_hda_codec_hdmi     57344  1 snd_hda_codec_intelhdmi
soundwire_cadence      49152  1 soundwire_intel
snd_sof_pci            24576  3 snd_sof_intel_hda_generic,snd_sof_pci_intel_tgl,snd_sof_pci_intel_cnl
snd_sof_xtensa_dsp     12288  1 snd_sof_intel_hda_generic
snd_sof               380928  6 snd_sof_intel_hda_sdw_bpt,snd_sof_pci,snd_sof_intel_hda_common,snd_sof_intel
_hda_generic,snd_sof_intel_hda,snd_sof_pci_intel_cnl
snd_sof_utils          16384  1 snd_sof
snd_soc_acpi_intel_match   139264  3 snd_sof_intel_hda_generic,snd_sof_pci_intel_tgl,snd_sof_pci_intel_cnl
snd_soc_acpi_intel_sdca_quirks    12288  1 snd_soc_acpi_intel_match
soundwire_generic_allocation    20480  1 soundwire_intel
snd_soc_acpi           16384  2 snd_soc_acpi_intel_match,snd_sof_intel_hda_generic
soundwire_bus        1175552  3 soundwire_intel,soundwire_generic_allocation,soundwire_cadence
snd_soc_sdca           81920  2 snd_soc_acpi_intel_sdca_quirks,soundwire_bus
crc8                   12288  1 soundwire_cadence
intel_uncore_frequency    12288  0
snd_soc_avs           208896  0
intel_uncore_frequency_common    16384  1 intel_uncore_frequency
iwlmvm                753664  0
snd_soc_hda_codec      24576  1 snd_soc_avs
snd_hda_ext_core       32768  7 snd_sof_intel_hda_sdw_bpt,snd_soc_avs,snd_soc_hda_codec,snd_sof_intel_hda_co
mmon,snd_soc_hdac_hda,snd_sof_intel_hda_mlink,snd_sof_intel_hda
snd_hda_codec         196608  10 snd_hda_codec_generic,snd_soc_avs,snd_hda_codec_hdmi,snd_soc_hda_codec,snd_
hda_intel,snd_hda_codec_realtek_lib,snd_soc_hdac_hda,snd_hda_codec_alc882,snd_sof_intel_hda,snd_hda_codec_in
telhdmi
snd_hda_core          131072  12 snd_hda_codec_generic,snd_soc_avs,snd_hda_codec_hdmi,snd_soc_hda_codec,snd_
hda_intel,snd_hda_ext_core,snd_hda_codec,snd_sof_intel_hda_common,snd_hda_codec_realtek_lib,snd_soc_hdac_hda
,snd_sof_intel_hda,snd_hda_codec_intelhdmi
x86_pkg_temp_thermal    16384  0
intel_powerclamp       24576  0
coretemp               20480  0
snd_intel_dspcfg       45056  5 snd_soc_avs,snd_hda_intel,snd_sof,snd_sof_intel_hda_common,snd_sof_intel_hda
_generic
mac80211             1646592  1 iwlmvm
snd_intel_sdw_acpi     16384  2 snd_intel_dspcfg,snd_sof_intel_hda_generic
snd_hwdep              24576  1 snd_hda_codec
snd_soc_core          376832  7 snd_soc_avs,snd_soc_hda_codec,soundwire_intel,snd_sof,snd_soc_sdca,snd_sof_i
ntel_hda_common,snd_soc_hdac_hda
libarc4                12288  1 mac80211
snd_compress           32768  2 snd_soc_avs,snd_soc_core
kvm_intel             532480  18
i915                 4485120  2
ac97_bus               12288  1 snd_soc_core
snd_pcm_dmaengine      20480  1 snd_soc_core
processor_thermal_device_pci    16384  0
kvm                  1376256  9 kvm_intel
btusb                  77824  0
snd_pcm               188416  14 snd_soc_avs,snd_hda_codec_hdmi,snd_hda_intel,snd_hda_codec,soundwire_intel,
snd_sof,snd_soc_sdca,snd_sof_intel_hda_common,snd_compress,snd_sof_intel_hda_generic,snd_soc_core,snd_sof_ut
ils,snd_hda_core,snd_pcm_dmaengine
processor_thermal_device    20480  1 processor_thermal_device_pci
irqbypass              16384  1 kvm
processor_thermal_wt_hint    16384  2 processor_thermal_device_pci,processor_thermal_device
platform_temperature_control    20480  1 processor_thermal_device
processor_thermal_soc_slider    16384  1 processor_thermal_device
intel_pmc_core        135168  0
iwlwifi               585728  1 iwlmvm
snd_timer              53248  1 snd_pcm
processor_thermal_rfim    40960  1 processor_thermal_device
snd                   135168  12 snd_hda_codec_generic,snd_hda_codec_hdmi,snd_hwdep,snd_hda_intel,snd_hda_co
dec,snd_sof,snd_soc_sdca,snd_timer,snd_hda_codec_realtek_lib,snd_compress,snd_soc_core,snd_pcm
drm_buddy              28672  2 xe,i915
ttm                   118784  3 drm_ttm_helper,xe,i915
btrtl                  32768  1 btusb
processor_thermal_rapl    16384  1 processor_thermal_device
pmt_telemetry          16384  1 intel_pmc_core
intel_rapl_msr         20480  0
polyval_clmulni        12288  0
pmt_discovery          16384  1 pmt_telemetry
intel_rapl_common      49152  2 intel_rapl_msr,processor_thermal_rapl
processor_thermal_wt_req    12288  1 processor_thermal_device
ghash_clmulni_intel    12288  0
btintel                69632  1 btusb
mei_hdcp               28672  0
sch_fq_codel           24576  17
aesni_intel            94208  0
btbcm                  24576  1 btusb
soundcore              16384  1 snd
pmt_class              20480  2 pmt_telemetry,pmt_discovery
intel_pmc_ssram_telemetry    16384  1 intel_pmc_core
mei_pxp                16384  0
dell_wmi               24576  0
processor_thermal_power_floor    12288  2 processor_thermal_device_pci,processor_thermal_device
drm_display_helper    274432  2 xe,i915
btmtk                  28672  1 btusb
cmdlinepart            16384  0
processor_thermal_mbox    12288  4 processor_thermal_power_floor,processor_thermal_wt_req,processor_thermal_
rfim,processor_thermal_wt_hint
mei_me                 53248  2
platform_profile       20480  1 processor_thermal_soc_slider
bluetooth             995328  6 btrtl,btmtk,btintel,btbcm,btusb
mei                   172032  5 mei_hdcp,mei_pxp,mei_me
spi_nor               163840  0
cec                    94208  3 drm_display_helper,xe,i915
dell_smbios            36864  1 dell_wmi
rapl                   20480  0
intel_vsec             24576  3 intel_pmc_ssram_telemetry,pmt_telemetry,xe
dell_smm_hwmon         28672  0
mtd                    98304  3 spi_nor,cmdlinepart
dcdbas                 20480  1 dell_smbios
wmi_bmof               12288  0
acpi_pad              184320  0
rc_core                73728  2 cec
cfg80211             1343488  3 iwlmvm,iwlwifi,mac80211
intel_cstate           20480  0
dell_wmi_descriptor    20480  2 dell_wmi,dell_smbios
int3403_thermal        16384  0
int3400_thermal        24576  0
pcspkr                 12288  0
input_leds             12288  0
int340x_thermal_zone    16384  2 int3403_thermal,processor_thermal_device
acpi_thermal_rel       24576  1 int3400_thermal
mac_hid                12288  0
sparse_keymap          12288  1 dell_wmi
acpi_tad               20480  0
zfs                  6344704  6
spl                   151552  1 zfs
vhost_net              32768  7
vhost                  69632  1 vhost_net
vhost_iotlb            16384  1 vhost
tap                    28672  1 vhost_net
nvme_fabrics           36864  0
efi_pstore             12288  0
nfnetlink              20480  5 nf_tables,ip_set,nfnetlink_log
dmi_sysfs              20480  0
ip_tables              32768  2 iptable_filter,iptable_raw
x_tables               57344  7 ebtables,ip6table_filter,ip6table_raw,iptable_filter,ip6_tables,iptable_raw,
ip_tables
autofs4                57344  2
btrfs                2023424  0
blake2b_generic        24576  0
xor                    24576  1 btrfs
raid6_pq              122880  1 btrfs
hid_generic            12288  0
usbkbd                 12288  0
usbhid                 69632  0
hid                   258048  3 usbhid,snd_soc_sdca,hid_generic
dm_thin_pool           94208  8
dm_persistent_data    114688  1 dm_thin_pool
dm_bio_prison          24576  1 dm_thin_pool
dm_bufio               53248  1 dm_persistent_data
nvme                   61440  2
nvme_core             229376  4 nvme,nvme_fabrics
ahci                   49152  5
nvme_keyring           20480  2 nvme_core,nvme_fabrics
nvme_auth              28672  1 nvme_core
libahci                53248  1 ahci
rtsx_pci_sdmmc         36864  0
igb                   303104  0
xhci_pci               24576  0
i2c_i801               36864  0
r8169                 135168  0
i2c_mux                12288  1 i2c_i801
intel_lpss_pci         28672  0
spi_intel_pci          12288  0
i2c_algo_bit           16384  3 igb,xe,i915
xhci_hcd              389120  1 xhci_pci
i2c_smbus              20480  1 i2c_i801
realtek                49152  1
intel_lpss             12288  1 intel_lpss_pci
spi_intel              32768  1 spi_intel_pci
rtsx_pci              139264  1 rtsx_pci_sdmmc
dca                    20480  1 igb
vmd                    24576  0
idma64                 20480  0
video                  77824  3 dell_wmi,xe,i915
wmi                    28672  6 video,dell_wmi,wmi_bmof,dell_smm_hwmon,dell_smbios,dell_wmi_descriptor
pinctrl_alderlake      32768  0

1/2
 
Code:
root@kharkov:~# lshw -short

H/W path           Device        Class          Description
===========================================================
                                 system         XPS 8960 (0BC0)
/0                               bus            09M47G
/0/0                             memory         64KiB BIOS
/0/9                             memory         64GiB System Memory
/0/9/0                           memory         32GiB DIMM Synchronous 5600 MHz (0.2 ns)
/0/9/1                           memory         32GiB DIMM Synchronous 5600 MHz (0.2 ns)
/0/16                            memory         384KiB L1 cache
/0/1                             memory         256KiB L1 cache
/0/18                            memory         16MiB L2 cache
/0/19                            memory         33MiB L3 cache
/0/1a                            memory         384KiB L1 cache
/0/2                             memory         768KiB L1 cache
/0/1c                            memory         12MiB L2 cache
/0/3                             processor      Intel(R) Core(TM) i7-14700
/0/100                           bridge         Raptor Lake-S 8+12 - Host Bridge/DRAM Controller
/0/100/2           /dev/fb0      display        Raptor Lake-S GT1 [UHD Graphics 770]
/0/100/2/0         input10       input          DP-1
/0/100/4                         generic        Raptor Lake Dynamic Platform and Thermal Framework Processor
/0/100/8                         generic        GNA Scoring Accelerator module
/0/100/e                         storage        Volume Management Device NVMe RAID Controller Intel Corporat
/0/100/14                        bus            Alder Lake-S PCH USB 3.2 Gen 2x2 XHCI Controller
/0/100/14/0        usb1          bus            xHCI Host Controller
/0/100/14/0/7                    bus            USB2.1 Hub
/0/100/14/0/7/2                  bus            USB2.0 Hub
/0/100/14/0/7/2/1  input3        input          HID 04d9:a088 Mouse
/0/100/14/0/e                    communication  AX211 Bluetooth
/0/100/14/1        usb2          bus            xHCI Host Controller
/0/100/14/1/6                    bus            USB3.1 Hub
/0/100/14.2                      memory         RAM memory
/0/100/14.3        wlo1          network        Alder Lake-S PCH CNVi WiFi
/0/100/15                        bus            Alder Lake-S PCH Serial IO I2C Controller #0
/0/100/16                        communication  Alder Lake-S PCH HECI Controller #1
/0/100/17                        generic        RST VMD Managed Controller
/0/100/1b                        bridge         Intel Corporation
/0/100/1b/0        mmc0          bus            RTS525A PCI Express Card Reader
/0/100/1c                        bridge         Alder Lake-S PCH PCI Express Root Port #1
/0/100/1c/0        enp2s0f0      network        82575EB Gigabit Network Connection
/0/100/1c/0.1      enp2s0f1      network        82575EB Gigabit Network Connection
/0/100/1c.4                      bridge         Alder Lake-S PCH PCI Express Root Port #5
/0/100/1c.4/0      enp3s0        network        Killer E3000 2.5GbE Controller
/0/100/1d                        generic        RST VMD Managed Controller
/0/100/1f                        bridge         Z690 Chipset LPC/eSPI Controller
/0/100/1f/0                      system         PnP device PNP0c02
/0/100/1f/1                      system         PnP device PNP0c02
/0/100/1f/2                      system         PnP device PNP0c02
/0/100/1f/3                      system         PnP device PNP0c02
/0/100/1f/4                      system         PnP device PNP0c02
/0/100/1f.3        card0         multimedia     Alder Lake-S HD Audio Controller
/0/100/1f.3/0      input11       input          HDA Intel PCH Mic
/0/100/1f.3/1      input12       input          HDA Intel PCH Line
/0/100/1f.3/2      input13       input          HDA Intel PCH Line Out Front
/0/100/1f.3/3      input14       input          HDA Intel PCH Line Out Surround
/0/100/1f.3/4      input15       input          HDA Intel PCH Line Out CLFE
/0/100/1f.3/5      input16       input          HDA Intel PCH Line Out Side
/0/100/1f.3/6      input17       input          HDA Intel PCH Front Headphone
/0/100/1f.3/7      input18       input          HDA Intel PCH HDMI/DP,pcm=3
/0/100/1f.3/8      input19       input          HDA Intel PCH HDMI/DP,pcm=7
/0/100/1f.3/9      input20       input          HDA Intel PCH HDMI/DP,pcm=8
/0/100/1f.3/a      input21       input          HDA Intel PCH HDMI/DP,pcm=9
/0/100/1f.4                      bus            Alder Lake-S PCH SMBus Controller
/0/100/1f.5                      bus            Alder Lake-S PCH SPI Controller
/0/4                             storage        Alder Lake-S PCH SATA Controller [AHCI Mode]
/0/5                             generic        RST VMD Managed Controller
/0/1b.4                          bridge         Alder Lake-S PCH PCI Express Root Port #21
/0/1b.4/0                        storage        WD PC SN810 / Black SN850 NVMe SSD
/0/101                           bridge         Alder Lake-S PCH PCI Express Root Port #9
/0/101/0                         storage        P3 NVMe PCIe SSD (DRAM-less)
/0/6               scsi0         storage        
/0/6/0             /dev/sda      disk           1TB ST1000NM0018-2F2
/0/6/0/1           /dev/sda1     volume         1006KiB BIOS Boot partition
/0/6/0/2                         volume         1023MiB Windows FAT volume
/0/6/0/3           /dev/sda3     volume         930GiB LVM Physical Volume
/0/6/1             /dev/sdb      disk           1TB ST1000NM0018-2F2
/1                 /dev/nvme0    storage        CT2000P3SSD8
/1/0               hwmon3        disk           NVMe disk
/1/2               /dev/ng0n1    disk           NVMe disk
/1/1               /dev/nvme0n1  volume         1863GiB NVMe disk
/2                 /dev/nvme1    storage        PC SN810 NVMe WDC 1024GB
/2/0               hwmon2        disk           NVMe disk
/2/2               /dev/ng1n1    disk           NVMe disk
/2/1               /dev/nvme1n1  volume         953GiB NVMe disk
/3                 input0        input          Sleep Button
/4                 input1        input          Power Button
/5                 input2        input          Power Button
/6                 input7        input          PC Speaker
/7                 input8        input          Dell WMI hotkeys
/8                 input9        input          Video Bus


No persistent log entries exist for the aborted boot attempts

2/2
 
  • Like
Reactions: fstrankowski

Same issue reported by several others. All (including me) use a Dell system...

Pinning the kernel to an older version fixed it for me. Instructions are in this other thread.
 
Last edited:
I've got an issue where my system has crashed/rebooted since the upgrade yesterday. HP G3 Mini. Debugs attached. Not sure if related to the OP issue. I have 2 of these machines and the other server has not had any issues in the same time frame.
 

Attachments