proxmox 1.8 randomly freeze

lozair

Member
Nov 4, 2008
89
0
6
Hi,
We use here a 7 node proxmox cluster.
We use only kvm virtual machine on dell m610 servers.
We started this cluster two years ago and all worked perfectly.
Since two weeks, nodes freeze randomly.
The only solution was to reset the server....
Reading the forum i saw that was frequently due to bad memory hardware.
We have tested all ours servers with memtest and all is fine.
Searching in log files "/var/log/" there is any information about the problem source....

Anyone encountered this problem..
I don't know how to get more info in proxmox in order to resolve this issue.

Any advice will be greatly apprecated

regards
 
The output of the pveversion :

# pveversion --verbose
pve-manager: 1.8-18 (pve-manager/1.8/6070)
running kernel: 2.6.32-4-pve
proxmox-ve-2.6.32: 1.8-33
pve-kernel-2.6.32-4-pve: 2.6.32-33
qemu-server: 1.1-30
pve-firmware: 1.0-11
libpve-storage-perl: 1.0-17
vncterm: 0.9-2
vzctl: 3.0.28-1pve1
vzdump: 1.2-14
vzprocps: 2.0.11-2
vzquota: 3.0.11-1
pve-qemu-kvm: 0.14.1-1
ksm-control-daemon: 1.0-6
 
you are using a quite old packages, update to latest stable with 'aptitude update && aptitude full-upgrade' and lets see if you got any changes.
 
I wil update our servers.
I always ask the same question :)
Can i update from 1.8 to 1.9 each node and migrate kvm vm runnning on 1.8 to 1.9 in order to migrate smoothly.
We have now 70 vm in production...
Thks for your help
 
unfortunately no, you can't :-(

all nodes in a cluster must have same pve version, so you can't migrate smoothly...
you can backup on 1.8 and restore on 1.9, or "just upgrade all nodes (tm)" which should be better tried on a test cluster before...

Marco
 
there are several ways, depends on your setup and guest systems. most times it just works - update all nodes and reboot to get the new kernel. or do it node by node.

and yes, a testsystem is a good idea.