can not start vz containers after upgrade

cschnee

New Member
Sep 18, 2009
11
0
1
Hello,
I have 10 nodes in 2 clusters, the larger group of them has been running "aptitude update; aptitude upgrade" yesterday and I received a few pve related updates on all those machines.

Since then I can't start or reboot any openvz containers, the only hint I get is that when starting them from cmd-line with --verbose I get:

vzctl --verbose start 107
Starting container ...
Container is unmounted
Container is mounted
Adding IP address(es): xxx.xxx.xxx.9
vps-net_add WARNING: Device list is empty
vps-net_add WARNING: Device list is empty
Running container script: /etc/vz/dists/scripts/debian-add_ip.sh
Setting CPU units: 1000
Setting CPUs: 1
Configuring meminfo: 262144
Set hostname: nagios.tisnet.ch
Running container script: /etc/vz/dists/scripts/debian-set_hostname.sh
Running container script: /etc/vz/dists/scripts/set_dns.sh
File resolv.conf was modified
Setting quota ugidlimit: 0
Running container script: /etc/vz/dists/scripts/set_ugid_quota.sh
Container start in progress...

The "Device list empty" above triggered my interest and I found that the containers are reachable from the node itself but not from network. So I added manually the hardware address of the node with arp -s on another node and then I could ping the container from the node where I added the mac manually.

I verified that on the cluster that hasn't updated yet everything (create, restart, migrate) works perfect.

I did try with 2.6.18-2, 2.16.18-3 and 2.6.24-10 : no difference.

Does anybody else experience similar problems ?


EDIT: If I use /usr/lib/vzctl/scripts/vps-functions from the not-yet-upgraded install I get a few errors when starting the container but things start working again, this file seems borked in the update that is currently available.
 
Last edited:
I assume you working with the test repository (pvetest)?

if yes, downgrade the vzctl package.

Code:
wget ftp://download.proxmox.com/debian/dists/lenny/pve/binary-amd64/vzctl_3.0.23-1pve8_amd64.deb

and

dpkg -i vzctl_3.0.23-1pve8_amd64.deb
 
tom,

Thank you! It seems that did the trick, after downgrading vzctl I was able to restart containers. Didn't realize I was using the test repo, but you are completely right.

I am unsure about why I added the test repo to all nodes, but I guess now I can't just remove the line in sources.lst and hope things will be fine then, right ?
Is there any chance for me to revert to stable without reinstalling the nodes ?

Cheers,
Christoph
 
change pvetest to pve in the sources list.

and post the output of 'pveversion -v' so we can see which packages needs to be downgraded (like vzctl).
 
I have commented the pvetest repo in sources.lst already. After downgrading vzctl I've got the following pveversion -v output:

Code:
pve-manager: 1.5-9 (pve-manager/1.5/4728)
running kernel: 2.6.24-10-pve
proxmox-ve-2.6.18: 1.5-6
pve-kernel-2.6.18-3-pve: 2.6.18-6
pve-kernel-2.6.24-10-pve: 2.6.24-21
pve-kernel-2.6.18-2-pve: 2.6.18-5
qemu-server: 1.1-14
pve-firmware: 1.0-4
libpve-storage-perl: 1.0-11
vncterm: 0.9-2
vzctl: 3.0.23-1pve8
vzdump: 1.2-5
vzprocps: 2.0.11-1dso2
vzquota: 3.0.11-1
pve-qemu-kvm-2.6.18: 0.9.1-6
Cheers,
Christoph
 
there is still one major issue.

you are running a 2.6.24 kernel (running kernel: 2.6.24-10-pve) but you installed the pve-qemu-kvm package from the 2.6.18 branch.#
see also http://pve.proxmox.com/wiki/Proxmox_VE_Kernel

if you want to run the 2.6.24 kernel branch, run:
Code:
apt-get install proxmox-ve-2.6.24
 
Thanks, this is only from my testing while in trouble with vzctl and I had no chance to reboot the node since then.
While I was unable to start new containers I sorta panicked and installed the 2.6.24 kernel to see if it helps - at the same time I found the problem with vzctl. Actually this node should run 2.6.18-x because it has only openvz containers.
Therefor I guess the route for me should be to reboot back into 2.6.18 and then I would "aptitude purge pve-kernel-2.6.24-10-pve" ?

I was in a brave mood this morning and tried to "aptitude update; aptitude upgrade" on the mentioned node to see if any packages would get downgraded - however none was proposed to downgrade. Am I right supposing that once the stable repo gets updates that have a higher release than the testing I have installed right now I will be switched back to stable more or less automatically ?

I am still wondering why I decided to add the pvetest repo to this productions nodes - but well... ;-P
Thanks for helping me out of this unfortunate situation.
 
Thanks again, I was able to downgrade all 2.6.18 nodes so far, I was also able to remove 2.6.18-3 on all nodes that received it (the pure vz container nodes). On the 2.6.24 nodes replacing vzctl was enough as it seems to get back on track (stable branch).

I highly appreciated your kind and quick help!



Cheers,
Christoph
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!