Hi everyone,
I have a problem with a rather new build where the Proxmox node randomly crashes due to kernel panics with soft lockup errors (see attached logs).
CPU: Intel(R) Core(TM) i9-13900K
MOBO: Gigabyte Z790 UD
RAM: 128 GiB of DDR5 memory
STORAGE:
- ZFS in RAID1 for OS based on 2x 256GiB Samsung NMVE SSDs
- ZFS in RAID1 for VM storage on 2x 4TiB Samsung EVO 970
- ZFS in RAID1 for VM BACKUP storage on 2x 4TiB Samsung EVO 970
NICs: Intel card with 10gbps dual port based on the Intel X520-DA2 chip
I've also enabled ZFS caching (11GiB) by adding this line to `/etc/modprobe.d/zfs.conf`:
Here is the output of my pveversion:
Any idea how I can prevent these system crashes from happening?
Thanks in advance,
Bogdan M.
I have a problem with a rather new build where the Proxmox node randomly crashes due to kernel panics with soft lockup errors (see attached logs).
CPU: Intel(R) Core(TM) i9-13900K
MOBO: Gigabyte Z790 UD
RAM: 128 GiB of DDR5 memory
STORAGE:
- ZFS in RAID1 for OS based on 2x 256GiB Samsung NMVE SSDs
- ZFS in RAID1 for VM storage on 2x 4TiB Samsung EVO 970
- ZFS in RAID1 for VM BACKUP storage on 2x 4TiB Samsung EVO 970
NICs: Intel card with 10gbps dual port based on the Intel X520-DA2 chip
I've also enabled ZFS caching (11GiB) by adding this line to `/etc/modprobe.d/zfs.conf`:
options zfs zfs_arc_max=11811160064
Here is the output of my pveversion:
Code:
proxmox-ve: 8.0.1 (running kernel: 6.2.16-3-pve)
pve-manager: 8.0.3 (running version: 8.0.3/bbf3993334bfa916)
pve-kernel-6.2: 8.0.2
pve-kernel-6.2.16-3-pve: 6.2.16-3
ceph-fuse: 17.2.6-pve1+3
corosync: 3.1.7-pve3
criu: 3.17.1-2
glusterfs-client: 10.3-5
ifupdown2: 3.2.0-1+pmx2
ksm-control-daemon: 1.4-1
libjs-extjs: 7.0.0-3
libknet1: 1.25-pve1
libproxmox-acme-perl: 1.4.6
libproxmox-backup-qemu0: 1.4.0
libproxmox-rs-perl: 0.3.0
libpve-access-control: 8.0.3
libpve-apiclient-perl: 3.3.1
libpve-common-perl: 8.0.5
libpve-guest-common-perl: 5.0.3
libpve-http-server-perl: 5.0.3
libpve-rs-perl: 0.8.3
libpve-storage-perl: 8.0.1
libspice-server1: 0.15.1-1
lvm2: 2.03.16-2
lxc-pve: 5.0.2-4
lxcfs: 5.0.3-pve3
novnc-pve: 1.4.0-2
proxmox-backup-client: 2.99.0-1
proxmox-backup-file-restore: 2.99.0-1
proxmox-kernel-helper: 8.0.2
proxmox-mail-forward: 0.1.1-1
proxmox-mini-journalreader: 1.4.0
proxmox-widget-toolkit: 4.0.5
pve-cluster: 8.0.1
pve-container: 5.0.3
pve-docs: 8.0.3
pve-edk2-firmware: 3.20230228-4
pve-firewall: 5.0.2
pve-firmware: 3.7-1
pve-ha-manager: 4.0.2
pve-i18n: 3.0.4
pve-qemu-kvm: 8.0.2-3
pve-xtermjs: 4.16.0-3
qemu-server: 8.0.6
smartmontools: 7.3-pve1
spiceterm: 3.3.0
swtpm: 0.8.0+pve1
vncterm: 1.8.0
zfsutils-linux: 2.1.12-pve1
Any idea how I can prevent these system crashes from happening?
Thanks in advance,
Bogdan M.