Hi,
I'm having a problem with my proxmox server crashing intermittently (no CLI is shown just the kernel log), from anywhere between 90mins to 24hours after a reboot.
I've checked the logs in /var/logs and have not found anything that stands out in them in the lead up to a crash. The first crash happened after the server had been stable and running for 6 days, no updates were applied/configuration changed etc during this time.
I am going to perform a memtest tomorrow to check the RAM for faults, and I've already checked that the CPU temps look normal. Other than that does anybody have an ideas as to what could be causing this (or any other logs/files to look at that could help find out the cause)?
sys.log at the moment of a crash: (the white section with symbols appear to be a corruption in the log perhaps as it crashes?):
kern.log (the only entries before the crash in this log is when scheduled daily backups ran at 5am. The server crashed at around 2:40pm later in the day, before somebody could reboot at around 5:15pm):
The package versions being run:
I'm having a problem with my proxmox server crashing intermittently (no CLI is shown just the kernel log), from anywhere between 90mins to 24hours after a reboot.
I've checked the logs in /var/logs and have not found anything that stands out in them in the lead up to a crash. The first crash happened after the server had been stable and running for 6 days, no updates were applied/configuration changed etc during this time.
I am going to perform a memtest tomorrow to check the RAM for faults, and I've already checked that the CPU temps look normal. Other than that does anybody have an ideas as to what could be causing this (or any other logs/files to look at that could help find out the cause)?
sys.log at the moment of a crash: (the white section with symbols appear to be a corruption in the log perhaps as it crashes?):
kern.log (the only entries before the crash in this log is when scheduled daily backups ran at 5am. The server crashed at around 2:40pm later in the day, before somebody could reboot at around 5:15pm):
The package versions being run:
proxmox-ve: 6.1-2 (running kernel: 5.3.13-1-pve)
pve-manager: 6.1-5 (running version: 6.1-5/9bf06119)
pve-kernel-5.3: 6.1-1
pve-kernel-helper: 6.1-1
pve-kernel-5.0: 6.0-11
pve-kernel-5.3.13-1-pve: 5.3.13-1
pve-kernel-5.3.10-1-pve: 5.3.10-1
pve-kernel-5.0.21-5-pve: 5.0.21-10
pve-kernel-5.0.21-3-pve: 5.0.21-7
pve-kernel-5.0.21-2-pve: 5.0.21-7
pve-kernel-5.0.15-1-pve: 5.0.15-1
ceph-fuse: 12.2.11+dfsg1-2.1+b1
corosync: 3.0.2-pve4
criu: 3.11-3
glusterfs-client: 5.5-3
ifupdown: 0.8.35+pve1
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.13-pve1
libpve-access-control: 6.0-5
libpve-apiclient-perl: 3.0-2
libpve-common-perl: 6.0-9
libpve-guest-common-perl: 3.0-3
libpve-http-server-perl: 3.0-3
libpve-storage-perl: 6.1-3
libqb0: 1.0.5-1
libspice-server1: 0.14.2-4~pve6+1
lvm2: 2.03.02-pve3
lxc-pve: 3.2.1-1
lxcfs: 3.0.3-pve60
novnc-pve: 1.1.0-1
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.1-1
pve-cluster: 6.1-2
pve-container: 3.0-15
pve-docs: 6.1-3
pve-edk2-firmware: 2.20191127-1
pve-firewall: 4.0-9
pve-firmware: 3.0-4
pve-ha-manager: 3.0-8
pve-i18n: 2.0-3
pve-qemu-kvm: 4.1.1-2
pve-xtermjs: 3.13.2-1
qemu-server: 6.1-4
smartmontools: 7.0-pve2
spiceterm: 3.1-1
vncterm: 1.6-1
zfsutils-linux: 0.8.2-pve2