Cannot boot after proxmox 4.2 install

Von Paul

New Member
Apr 29, 2016
12
1
3
49
Hello;

After installing proxmox via
apt-get install proxmox-ve ntp ssh postfix ksm-control-daemon open-iscsi systemd-sysv

the debian 8.6 OS does not boot after initial reboot immediately following the proxmox install. This has occurred 2 times on the same server. The server is a new supermicro board with two 3TB drives configured with RAID 1 and 1 LVGroup having 3 logical volumes.

The only thing I have done so far is to modify fstab, removing the UID for MD0 /boot and replacing it with /dev/md0 as presented below. As well, I ran update-grup after installing proxmox (after the first reboot).

#Below is some of the information presented with the failed boot (first reboot after installing proxmox):

----------------------------------------------------------------------------------------------------
Volume Group PVE not found
logical volume swap not found
logical volume root not found

Common Problems:
-Boot arg (cat /proc/cmdline)
-check root delay
-check root (did the system wait for the right device)
-Missing modules: (cat /proc/modules; ls /dev

Alert! /dev/mapper/pve-root does not exist
Mod probe: module ehci-orion not found in modules.dep
----------------------------------------------------------------------------------------------------

#Below is the Disk Configuration and other useful information

----------------------------------------------------------------------------------------------------
root@pmx01:~# cat /etc/debian_version
8.6


pvroot@pmx01:~# pveversion
pve-manager/4.2-18/158720b9 (running kernel: 4.4.16-1-pve)


root@pmx01:~# lvs
LV VG Attr LSize Pool Origin Data% Meta% Move Log Cpy%Sync Convert
data pve -wi-ao---- 2.62t
root pve -wi-ao---- 93.13g
swap pve -wi-ao---- 17.69g


root@pmx01:~# lsblk
NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT
sda 8:0 0 2.7T 0 disk
├─sda1 8:1 0 9M 0 part
├─sda2 8:2 0 286M 0 part
│ └─md0 9:0 0 285.7M 0 raid1 /boot
└─sda3 8:3 0 2.7T 0 part
└─md1 9:1 0 2.7T 0 raid1
├─pve-root 251:0 0 93.1G 0 lvm /
├─pve-swap 251:1 0 17.7G 0 lvm [SWAP]
└─pve-data 251:2 0 2.6T 0 lvm /var/lib/vz
sdb 8:16 0 2.7T 0 disk
├─sdb1 8:17 0 9M 0 part
├─sdb2 8:18 0 286M 0 part
│ └─md0 9:0 0 285.7M 0 raid1 /boot
└─sdb3 8:19 0 2.7T 0 part
└─md1 9:1 0 2.7T 0 raid1
├─pve-root 251:0 0 93.1G 0 lvm /
├─pve-swap 251:1 0 17.7G 0 lvm [SWAP]
└─pve-data 251:2 0 2.6T 0 lvm /var/lib/vz
sr0 11:0 1 3.7G 0 rom


root@pmx01:~# df -h
Filesystem Size Used Avail Use% Mounted on
udev 10M 0 10M 0% /dev
tmpfs 3.2G 8.9M 3.2G 1% /run
/dev/dm-0 92G 1.6G 86G 2% /
tmpfs 7.9G 25M 7.9G 1% /dev/shm
tmpfs 5.0M 0 5.0M 0% /run/lock
tmpfs 7.9G 0 7.9G 0% /sys/fs/cgroup
/dev/md0 277M 64M 199M 25% /boot
/dev/mapper/pve-data 2.6T 74M 2.5T 1% /var/lib/vz
/dev/fuse 30M 16K 30M 1% /etc/pve


root@pmx01:~# cat /etc/fstab
# /etc/fstab: static file system information.
#
# Use 'blkid' to print the universally unique identifier for a
# device; this may be used with UUID= as a more robust way to name devices
# that works even if disks are added and removed. See fstab(5).
#
# <file system> <mount point> <type> <options> <dump> <pass>
/dev/mapper/pve-root / ext4 errors=remount-ro 0 1
# /boot was on /dev/md0 during installation
/dev/md0 /boot ext2 defaults 0 2
/dev/mapper/pve-data /var/lib/vz ext4 defaults 0 2
/dev/mapper/pve-swap none swap sw 0 0
/dev/sr0 /media/cdrom0 udf,iso9660 user,noauto 0 0

----------------------------------------------------------------------------------------------------

The issue did not occur with several shutdown/restarts prior to installing proxmox.

After a hard shutdown or manual reboot > power on, the OS will boot. However, the issue will randomly reoccur with other shutdowns or reboots.

Any guidance on this issue would be much appreciated.

Cheers!
 
Some more information:

I can reproduce the issue when going into BIOS > make a change > save > issue occurs. Otherwise it appears to be random. If I perform the same previously mentioned steps, then select the older image...
root@pmx01:~# cat /proc/cmdline
BOOT_IMAGE=/vmlinuz-3.16.0-4-amd64 root=/dev/mapper/pve-root ro quiet

....the issue does not occur.

The issue is reproducible when using the below boot image:
root@pmx01:~# cat /proc/cmdline
BOOT_IMAGE=/vmlinuz-4.4.16-1-pve root=/dev/mapper/pve-root ro quiet


I have tried all kinds of stuff with no success other than selecting the older boot_image. Including modifying grub timeouts.
 
Hi,

have you update to the latest bios update from Supermicro?
 
Tested installing from an earlier release of proxmox 4.2 using the baremetal ISO. The issue was still present...

I just updated the supermicro BIOS from 3.5a to 3.5b and installed proxmox with the ISO again. This resulted in the same issue. Next I tried installing with the baremetal iso of version 3.4. I was able to restart the server 17 times without the issue occurring. Something seems to be wrong with version 4.2 and this specific hardware configuration. I don't know about v4.1.

hardware info:

MB: supermicro h8sgl-f
CPU: AMD opteron 6320
HDD: seagate 3TB 7200 RPM Barracuda
RAM: DDR3 1333 4GB Memory unbuffered ECC
 
Hi,

Thanks for posting with some help!

I did try that and unfortunately it didn't help. I can't recall how high I set the delay, but I tried something pretty high (~40) and then maybe even higher. Of course the bootup took a while, but on any given boot, the logical volume would not be found :(. Oh well, I think we will stay away from that hardware configuration from here on out.
 
If you really want to go with lvm, add with noauto option in /etc/fstab

/dev/mapper/pve-root / ext4 defaults,noauto 1 2


Afterwards, chmod +x /etc/rc.d/rc.local and add the following lines to /etc/rc.local

vgchange -a y
mount /dev/mapper/pve-root
 
Okay, nice suggestions. Wish I still had the server to test this out with. However, I will save this info, and if I see the issue again, this will be a good starting point.

Thanks!