Hello
I have 2 identical Hetzner root servers (128GB RAM) in a cluster and one of them (the master) is crashing every 2-3 days, the other one is fine for 9 days. Both have a handful of Linux VMs, nothing special just webservers.
journalctl doesn't show something special.
# uname -a
Linux n1 6.8.4-3-pve #1 SMP PREEMPT_DYNAMIC PMX 6.8.4-3 (2024-05-02T11:55Z) x86_64 GNU/Linux
root@n1:/var/crash# cat /proc/cmdline
BOOT_IMAGE=/vmlinuz-6.8.4-3-pve root=ZFS=/ROOT/pve-1 ro root=ZFS=rpool/ROOT/pve-1 boot=zfs quiet crashkernel=384M-:128M
I tried to enable kdump but not sure if I should follow https://www.cyberciti.biz/faq/how-to-on-enable-kernel-crash-dump-on-debian-linux/ or anything special for pve kernel?
thanks
Patrick
I have 2 identical Hetzner root servers (128GB RAM) in a cluster and one of them (the master) is crashing every 2-3 days, the other one is fine for 9 days. Both have a handful of Linux VMs, nothing special just webservers.
journalctl doesn't show something special.
# uname -a
Linux n1 6.8.4-3-pve #1 SMP PREEMPT_DYNAMIC PMX 6.8.4-3 (2024-05-02T11:55Z) x86_64 GNU/Linux
root@n1:/var/crash# cat /proc/cmdline
BOOT_IMAGE=/vmlinuz-6.8.4-3-pve root=ZFS=/ROOT/pve-1 ro root=ZFS=rpool/ROOT/pve-1 boot=zfs quiet crashkernel=384M-:128M
I tried to enable kdump but not sure if I should follow https://www.cyberciti.biz/faq/how-to-on-enable-kernel-crash-dump-on-debian-linux/ or anything special for pve kernel?
thanks
Patrick