Proxmox server randomly goes down

almere

New Member
Mar 27, 2018
1
1
3
31
Hello.

I have a PVE with local storage and basic VM's, nothing special. Randomly the host node "goes down" with all virtual machines. I'm not able to find anything in syslog / dmesg. With goes down I mean: no web interface, no reaction to network ping / SSH. Only hardware reboot helps.

Here is some more information:

Code:
pveversion -v
proxmox-ve: 5.1-42 (running kernel: 4.13.16-1-pve)
pve-manager: 5.1-46 (running version: 5.1-46/ae8241d4)
pve-kernel-4.13: 5.1-43
pve-kernel-4.13.16-1-pve: 4.13.16-43
pve-kernel-4.13.13-6-pve: 4.13.13-42
pve-kernel-4.10.17-2-pve: 4.10.17-20
corosync: 2.4.2-pve3
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: not correctly installed
libjs-extjs: 6.0.1-2
libpve-access-control: 5.0-8
libpve-common-perl: 5.0-28
libpve-guest-common-perl: 2.0-14
libpve-http-server-perl: 2.0-8
libpve-storage-perl: 5.0-17
libqb0: 1.0.1-1
lvm2: 2.02.168-pve6
lxc-pve: 2.1.1-3
lxcfs: 2.0.8-2
novnc-pve: 0.6-4
proxmox-widget-toolkit: 1.0-11
pve-cluster: 5.0-20
pve-container: 2.0-19
pve-docs: 5.1-16
pve-firewall: 3.0-5
pve-firmware: 2.0-4
pve-ha-manager: 2.0-5
pve-i18n: 1.0-4
pve-libspice-server1: 0.12.8-3
pve-qemu-kvm: 2.9.1-9
pve-xtermjs: 1.0-2
qemu-server: 5.0-22
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3

Latest records from syslog before freez / crash:
Code:
Mar 27 01:13:02 vps1 pvedaemon[12004]: <root@pam> successful auth for user 'root@pam'
Mar 27 01:13:03 vps1 pvedaemon[5681]: <root@pam> successful auth for user 'root@pam'
Mar 27 01:13:03 vps1 pvedaemon[6442]: <root@pam> successful auth for user 'root@pam'
Mar 27 01:13:03 vps1 pvedaemon[12004]: <root@pam> successful auth for user 'root@pam'


Any ideas about it? Where / what to check?

Thank's
 
  • Like
Reactions: chrone
I - and some others in this forum - also suffer from random reboots /restarts / crashes.
Maybe post if you are using ZFS, what CPU is in your host (meltdown..), RAM, etc.
 
Hello.

I have a PVE with local storage and basic VM's, nothing special. Randomly the host node "goes down" with all virtual machines. I'm not able to find anything in syslog / dmesg. With goes down I mean: no web interface, no reaction to network ping / SSH. Only hardware reboot helps.

Here is some more information:

Code:
ksm-control-daemon: not correctly installed


Any ideas about it? Where / what to check?

Thank's
Hi,
there are many posts like this - try the search function.
Often it's an issue of following: Heat/Ram/Bios/Power Supply

But one thing is strange - why is your ksm-control-daemon not correctly installed??

What happens is you install ksm again?
Code:
apt install ksm-control-daemon
Udo
 
  • Like
Reactions: chrone