random server restart

juroo17

New Member
May 31, 2023
3
0
1
Hi, I have a problem with random server restarts. It restarts really randomly and I don't see much in the log.

Sometimes the server works for 6 days, sometimes it restarts 5-6 times a day.

Here are my logs

Code:
proxmox-ve: 8.0.2 (running kernel: 6.2.16-18-pve)
pve-manager: 8.0.4 (running version: 8.0.4/d258a813cfa6b390)
pve-kernel-6.2: 8.0.5
proxmox-kernel-helper: 8.0.3
proxmox-kernel-6.2.16-18-pve: 6.2.16-18
proxmox-kernel-6.2: 6.2.16-18
proxmox-kernel-6.2.16-15-pve: 6.2.16-15
proxmox-kernel-6.2.16-14-pve: 6.2.16-14
proxmox-kernel-6.2.16-12-pve: 6.2.16-12
pve-kernel-6.2.16-3-pve: 6.2.16-3
ceph: 17.2.6-pve1+3
ceph-fuse: 17.2.6-pve1+3
corosync: 3.1.7-pve3
criu: 3.17.1-2
glusterfs-client: 10.3-5
ifupdown2: 3.2.0-1+pmx5
ksm-control-daemon: 1.4-1
libjs-extjs: 7.0.0-4
libknet1: 1.28-pve1
libproxmox-acme-perl: 1.4.6
libproxmox-backup-qemu0: 1.4.0
libproxmox-rs-perl: 0.3.1
libpve-access-control: 8.0.5
libpve-apiclient-perl: 3.3.1
libpve-common-perl: 8.0.9
libpve-guest-common-perl: 5.0.5
libpve-http-server-perl: 5.0.4
libpve-rs-perl: 0.8.5
libpve-storage-perl: 8.0.2
libspice-server1: 0.15.1-1
lvm2: 2.03.16-2
lxc-pve: 5.0.2-4
lxcfs: 5.0.3-pve3
novnc-pve: 1.4.0-2
proxmox-backup-client: 3.0.4-1
proxmox-backup-file-restore: 3.0.4-1
proxmox-kernel-helper: 8.0.3
proxmox-mail-forward: 0.2.0
proxmox-mini-journalreader: 1.4.0
proxmox-widget-toolkit: 4.0.9
pve-cluster: 8.0.4
pve-container: 5.0.4
pve-docs: 8.0.5
pve-edk2-firmware: 3.20230228-4
pve-firewall: 5.0.3
pve-firmware: 3.8-3
pve-ha-manager: 4.0.2
pve-i18n: 3.0.7
pve-qemu-kvm: 8.0.2-7
pve-xtermjs: 4.16.0-3
qemu-server: 8.0.7
smartmontools: 7.3-pve1
spiceterm: 3.3.0
swtpm: 0.8.0+pve1
vncterm: 1.8.0
zfsutils-linux: 2.1.13-pve1
 

Attachments

In general, when having problems with system stability, it's a good idea to run memtest86+ overnight so you know if it is caused by bad RAM.

By the way...both sda and sdb got some bad sectors. You should check them with smartmontools.
 
Hello! In my case was a memory problem when trying to auto-update itself.
I had to disable the updates from the nodes and everything worked perfectly.
 
In general, when having problems with system stability, it's a good idea to run memtest86+ overnight so you know if it is caused by bad RAM.

By the way...both sda and sdb got some bad sectors. You should check them with smartmontools.
Hi, thaks for your advice. I run the memtest86+ for 9 hours and it seems like everything is ok
 

Attachments

  • memtest86+.jpg
    memtest86+.jpg
    147.9 KB · Views: 4
Hello! In my case was a memory problem when trying to auto-update itself.
I had to disable the updates from the nodes and everything worked perfectly.
Hi, i am not really sure how to do that. Can you explain me closely what you mean by that or where can i do that ?