VM won't start after a few days

Robstarusa

Renowned Member
Feb 19, 2009
89
4
73
Hi all,

I've had a windows 10 vm with Quadro 2000 card passed through working perfectly on & off foro 2 weeks. Now, after rebooting it a few times the vm will no longer start. "qm start <vmid>" from the CLI gives hangs for a minute or two and then gives me:

can't deactivate LV '/dev/nvme/vm-1010-disk-1': Logical volume nvme/vm-1010-disk-1 in use.

After it fails to start, an lvdisplay shows:

root@vm2:~# lvdisplay /dev/nvme/vm-1010-disk-1
--- Logical volume ---
LV Path /dev/nvme/vm-1010-disk-1
LV Name vm-1010-disk-1
VG Name nvme
LV UUID M2sJLM-Zxyq-OoUv-zozj-6ecD-YoFN-bsz3Re
LV Write Access read/write
LV Creation host, time vm2, 2017-02-11 17:59:32 -0600
LV Status available
# open 0
LV Size 120.00 GiB
Current LE 30720
Segments 1
Allocation inherit
Read ahead sectors auto
- currently set to 256
Block device 251:18

Strangely enough, if I comment out the hostpci0: and hostpci1: directives for my vga card/hdmi audio device (part of the vga card), the vm starts but without the passthrough VGA card.

System info:
root@vm2:~# pveversion -v
proxmox-ve: 4.4-80 (running kernel: 4.4.40-1-pve)
pve-manager: 4.4-12 (running version: 4.4-12/e71b7a74)
pve-kernel-4.4.35-2-pve: 4.4.35-79
pve-kernel-4.4.40-1-pve: 4.4.40-80
lvm2: 2.02.116-pve3
corosync-pve: 2.4.2-1
libqb0: 1.0-1
pve-cluster: 4.0-48
qemu-server: 4.0-109
pve-firmware: 1.1-10
libpve-common-perl: 4.0-92
libpve-access-control: 4.0-23
libpve-storage-perl: 4.0-73
pve-libspice-server1: 0.12.8-1
vncterm: 1.3-1
pve-docs: 4.4-3
pve-qemu-kvm: 2.7.1-3
pve-container: 1.0-94
pve-firewall: 2.0-33
pve-ha-manager: 1.0-40
ksm-control-daemon: 1.2-1
glusterfs-client: 3.5.2-2+deb8u3
lxc-pve: 2.0.7-3
lxcfs: 2.0.6-pve1
criu: 1.6.0-1
novnc-pve: 0.5-8
smartmontools: 6.5+svn4324-1~pve80
zfsutils: 0.6.5.9-pve15~bpo80
openvswitch-switch: 2.6.0-2

root@vm2:~# uname -a
Linux vm2 4.4.40-1-pve #1 SMP Wed Feb 8 16:13:20 CET 2017 x86_64 GNU/Linux

root@vm2:~# cat /proc/cpuinfo | grep -i model | sort -u
model : 45
model name : Intel(R) Xeon(R) CPU E5-2690 0 @ 2.90GHz

root@vm2:~# free -m
total used free shared buffers cached
Mem: 161213 160308 905 62 272 128685
-/+ buffers/cache: 31349 129863
Swap: 7551 0 7551

dmidecode info:
System Information
Manufacturer: LENOVO
Product Name: 056849U
Version: ThinkStation S30


Any idea what I should be looking at to solve this short of a reboot? could kvm be giving me an incorrect error message due to some issue with the vga card being passed through? Not sure why this would all of a sudden stop working.
 
VGA info:

root@vm2:~# lspci -s 05:00.0
05:00.0 VGA compatible controller: NVIDIA Corporation GF106GL [Quadro 2000] (rev a1)
root@vm2:~# lspci -s 05:00.1
05:00.1 Audio device: NVIDIA Corporation GF106 High Definition Audio Controller (rev a1)


vm config (with vga commented out):
root@vm2:~# more /etc/pve/qemu-server/1010.conf
bootdisk: virtio0
cores: 4
#hostpci0: 05:00.0,pcie=1
#hostpci1: 05:00.1,pcie=1
ide2: none,media=cdrom
machine: q35
memory: 8192
name: jarjar
net0: virtio=F2:0C:C9:56:EA:16,bridge=vmbr0
numa: 0
ostype: win10
scsihw: virtio-scsi-pci
smbios1: uuid=5794a987-01ad-4dff-8cfe-f69e18df3ce0
sockets: 1
usb0: host=041e:3010
usb1: host=046d:c31c
virtio0: nvme:vm-1010-disk-1,size=120G
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!