Online migration fails with local CT using autofs inside

x3w

Member
Nov 20, 2011
16
0
21
Online migration fails with local CT using nfs and autofs inside.
If I stop autofs inside the CT, online migration works OK (with independant nfs mounts inside).

Please help me find a solution (soon in production) !
Many thanks


The log :
Code:
Oct 25 00:23:12 starting migration of CT 111 to node 'pmx185' (192.168.1.185)
Oct 25 00:23:12 container is running - using online migration
Oct 25 00:23:12 starting rsync phase 1
Oct 25 00:23:12 # /usr/bin/rsync -aHAX --delete --numeric-ids --sparse /var/lib/vz/private/111 root@192.168.1.185:/var/lib/vz/private
Oct 25 00:23:25 start live migration - suspending container
Oct 25 00:23:25 # vzctl --skiplock chkpnt 111 --suspend
Oct 25 00:23:25 Setting up checkpoint...
Oct 25 00:23:25         suspend...
Oct 25 00:23:25 Can not suspend container: No such process
[B]Oct 25 00:23:25 Error: autofs pipe is not attached (-3)[/B]
Oct 25 00:23:25 ERROR: Failed to suspend container: Checkpointing failed
Oct 25 00:23:25 aborting phase 1 - cleanup resources
Oct 25 00:23:25 removing copied files on target node
Oct 25 00:23:26 start final cleanup
Oct 25 00:23:26 ERROR: migration aborted (duration 00:00:14): Failed to suspend container: Checkpointing failed
TASK ERROR: migration aborted



lsof | grep nfs (from CT0) :
Code:
automount 589180     root  mem       REG              253,2             4940492 /var/lib/vz/root/111/usr/lib/autofs/mount_nfs.so (path dev=0,26)
automount 589255     root  mem       REG              253,2             4940492 /var/lib/vz/root/111/usr/lib/autofs/mount_nfs.so (path dev=0,26)
automount 589255     root    3u     FIFO               0,26      0t0    5013917 /var/lib/vz/root/111/var/run/autofs.fifo-nfs-data1
automount 589255     root    6r      DIR               0,37        0   10450767 /var/lib/vz/root/111/nfs/data1
automount 589331     root  mem       REG              253,2             4940492 /var/lib/vz/root/111/usr/lib/autofs/mount_nfs.so (path dev=0,26)
automount 589331     root    3u     FIFO               0,26      0t0    5013918 /var/lib/vz/root/111/var/run/autofs.fifo-nfs-data2
automount 589331     root    6r      DIR               0,38        0   10450827 /var/lib/vz/root/111/nfs/data2


cat /etc/modules (on CT0) :
Code:
# /etc/modules: kernel modules to load at boot time.
#
# This file contains the names of kernel modules that should be loaded
# at boot time, one per line. Lines beginning with "#" are ignored.
# Parameters can be specified after the module name.

autofs4

my pveversion -v :
Code:
pve-manager: 2.1-14 (pve-manager/2.1/f32f3f46)
running kernel: 2.6.32-12-pve
proxmox-ve-2.6.32: 2.1-68
pve-kernel-2.6.32-11-pve: 2.6.32-66
pve-kernel-2.6.32-12-pve: 2.6.32-68
lvm2: 2.02.95-1pve2
clvm: 2.02.95-1pve2
corosync-pve: 1.4.3-1
openais-pve: 1.1.4-2
libqb: 0.10.1-2
redhat-cluster-pve: 3.1.92-3
resource-agents-pve: 3.9.2-3
fence-agents-pve: 3.1.8-1
pve-cluster: 1.0-27
qemu-server: 2.0-49
pve-firmware: 1.0-18
libpve-common-perl: 1.0-30
libpve-access-control: 1.0-24
libpve-storage-perl: 2.0-31
vncterm: 1.0-3
vzctl: 3.0.30-2pve5
vzprocps: 2.0.11-2
vzquota: 3.0.12-3
pve-qemu-kvm: 1.1-8
ksm-control-daemon: 1.1-1
 
no idea if it helps but you ran a quite outdated version, upgrade to latest 2.2
 
Is this rather a Proxmox problem or an OpenVZ problem ?
It would be very usefull to have autofs working well in a CT (NFS is working like a charm so why not autofs ?).

Problem still exists in pve 3.1.
Also, there is a problem when stoping a CT using autofs inside (and running on the node local filesystem), I had to reboot the node.
That's makes autofs in a CT very difficult to use in production :)

Thanks for any help

# pveversion -v
proxmox-ve-2.6.32: 3.1-109 (running kernel: 2.6.32-23-pve)
pve-manager: 3.1-3 (running version: 3.1-3/dc0e9b0e)
pve-kernel-2.6.32-23-pve: 2.6.32-109
lvm2: 2.02.98-pve4
clvm: 2.02.98-pve4
corosync-pve: 1.4.5-1
openais-pve: 1.1.4-3
libqb0: 0.11.1-2
redhat-cluster-pve: 3.2.0-2
resource-agents-pve: 3.9.2-4
fence-agents-pve: 4.0.0-1
pve-cluster: 3.0-7
qemu-server: 3.1-1
pve-firmware: 1.0-23
libpve-common-perl: 3.0-6
libpve-access-control: 3.0-6
libpve-storage-perl: 3.0-10
pve-libspice-server1: 0.12.4-1
vncterm: 1.1-4
vzctl: 4.0-1pve3
vzprocps: 2.0.11-2
vzquota: 3.1-2
pve-qemu-kvm: 1.4-17
ksm-control-daemon: 1.1-1
glusterfs-client: 3.4.0-2

+ autofs4 module
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!