I'm starting a new thread for the discussion started in http://proxmox.com/forum/showthread.php?p=16824#post16824.
Yesterday, an update of the 2.6.18 PVE kernel from pvetest brought down our entire cluster within minutes; all the hosts were locked with a kernel panic apparently related to NFSv4. A screenshot of the panic can be seen here: http://proxmox.com/forum/attachment.php?attachmentid=158&d=1263407893.
The panic can be easily reproduced very easily: it will happen a few seconds after starting a container which uses our NFSv4 setup.
This is our mount script for the container:
This is the umount script:
vzmount is the very useful mount wrapper that permits using NFSv4 in the containers as well as allowing suspension/restoration. It can be found in the openvz forum, http://forum.openvz.org/index.php?t=msg&goto=8357&&srch=vzmount#msg_8357.
Yesterday, an update of the 2.6.18 PVE kernel from pvetest brought down our entire cluster within minutes; all the hosts were locked with a kernel panic apparently related to NFSv4. A screenshot of the panic can be seen here: http://proxmox.com/forum/attachment.php?attachmentid=158&d=1263407893.
The panic can be easily reproduced very easily: it will happen a few seconds after starting a container which uses our NFSv4 setup.
This is our mount script for the container:
Code:
#!/bin/sh
. /etc/vz/vz.conf
. ${VE_CONFFILE}
for foo in 0 1 2 3 4 5 6 7 8 9; do
if mountpoint /export/home2 > /dev/null; then
vzmount /export/home2 ${VE_ROOT}/export/home2
exit 0
else
mount /export/home2 || sleep 60
fi
done
mail -s "vzmount: mount failed @ `hostname`" email@address << EOF
El montaje de ${VE_ROOT}/export/home2 en `hostname` falló.
EOF
exit 1
Code:
#!/bin/sh
. /etc/vz/vz.conf
. ${VE_CONFFILE}
vzumount ${VE_ROOT}/export/home2 || true
umount /export/home2 2>/dev/null || true