kernel bug with pci passthrough

sweet-t

New Member
Oct 22, 2012
19
2
1
I am attempting to pass my pci usb controller into my vm. It is a nvidia chipset card supporting reset and I have it plugged into my pci-x slot which I have had success passing through other pci devices in this slot. The iommu groups are fine for this slot.

When I attempt to start my VM it fails:
Code:
Status stopped: unable to read tail (got 0 bytes)

When I review my PVE syslog it appears to deregister the device (02:00) followed by a a kernel BUG.

Im not sure who to report this to or who is best to help me.

Code:
Dec 24 02:00:26 pve pvedaemon[5053]: start VM 106: UPID:pve:000013BD:0000DAF6:5A3F508A:qmstart:106:root@pam:
Dec 24 02:00:28 pve kernel: vfio-pci 0000:04:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem
Dec 24 02:00:28 pve kernel: vfio-pci 0000:04:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem
Dec 24 02:00:28 pve kernel: vfio-pci 0000:04:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem
Dec 24 02:00:28 pve systemd[1]: sound.target: Unit not needed anymore. Stopping.
Dec 24 02:00:28 pve systemd[1]: Stopped target Sound Card.
Dec 24 02:00:28 pve kernel: ehci-pci 0000:02:00.3: remove, state 4
Dec 24 02:00:28 pve kernel: usb usb3: USB disconnect, device number 1
Dec 24 02:00:28 pve kernel: ehci-pci 0000:02:00.3: USB bus 3 deregistered
Dec 24 02:00:28 pve kernel: ohci-pci 0000:02:00.1: remove, state 4
Dec 24 02:00:28 pve kernel: usb usb5: USB disconnect, device number 1
Dec 24 02:00:28 pve kernel: ohci-pci 0000:02:00.1: USB bus 5 deregistered
Dec 24 02:00:28 pve kernel: ohci-pci 0000:02:00.2: remove, state 4
Dec 24 02:00:28 pve kernel: usb usb6: USB disconnect, device number 1
Dec 24 02:00:28 pve kernel: ohci-pci 0000:02:00.2: USB bus 6 deregistered
Dec 24 02:00:28 pve kernel: ohci-pci 0000:02:00.0: remove, state 4
Dec 24 02:00:28 pve kernel: usb usb4: USB disconnect, device number 1
Dec 24 02:00:28 pve kernel: ------------[ cut here ]------------
Dec 24 02:00:28 pve kernel: kernel BUG at drivers/iommu/intel-iommu.c:3765!
Dec 24 02:00:28 pve kernel: invalid opcode: 0000 [#1] SMP
Dec 24 02:00:28 pve kernel: Modules linked in: veth ip_set ip6table_filter ip6_tables iptable_filter softdog nfnetlink_log nfnetlink dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio zfs(PO) intel_powerclamp coretemp kvm_intel kvm zunicode(PO) zavl(PO) crct10dif_pclmul icp(PO) crc32_pclmul ghash_clmulni_intel pcbc snd_hda_codec_analog snd_hda_codec_generic aesni_intel aes_x86_64 snd_hda_codec_hdmi snd_hda_intel zcommon(PO) znvpair(PO) snd_hda_codec dell_wmi snd_hda_core dell_smbios spl(O) crypto_simd snd_hwdep mxm_wmi sparse_keymap gpio_ich snd_pcm glue_helper video cryptd snd_timer dcdbas input_leds snd dell_smm_hwmon serio_raw soundcore pcspkr ppdev wmi_bmof intel_cstate mac_hid i7core_edac lpc_ich shpchp parport_pc parport wmi ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi
Dec 24 02:00:28 pve kernel:  vhost_net sunrpc vhost tap ip_tables x_tables autofs4 xfs libcrc32c btrfs xor raid6_pq vfio_pci irqbypass vfio_virqfd vfio_iommu_type1 vfio pci_stub usbmouse hid_generic usbkbd usbhid hid mptsas mptscsih mptbase tg3 ahci ptp i2c_i801 libahci pps_core scsi_transport_sas
Dec 24 02:00:28 pve kernel: CPU: 1 PID: 5053 Comm: task UPID:pve:0 Tainted: P           O    4.13.8-1-pve #1
Dec 24 02:00:28 pve kernel: Hardware name: Dell Inc. Precision WorkStation T7500  /06FW8P, BIOS A16 05/28/2013
Dec 24 02:00:28 pve kernel: task: ffff98599902c200 task.stack: ffffba83da834000
Dec 24 02:00:28 pve kernel: RIP: 0010:intel_unmap+0x1f1/0x200
Dec 24 02:00:28 pve kernel: RSP: 0018:ffffba83da837cf0 EFLAGS: 00010246
Dec 24 02:00:28 pve kernel: RAX: 0000000000000000 RBX: 00000000ffffc000 RCX: ffffffff9ef98b10
Dec 24 02:00:28 pve kernel: RDX: 0000000000000000 RSI: 0000000000000008 RDI: ffff9859c65f90a0
Dec 24 02:00:28 pve kernel: RBP: ffffba83da837d28 R08: 0000000000000000 R09: 0000000180200009
Dec 24 02:00:28 pve kernel: R10: ffffba83da837cf8 R11: 0000000000000e00 R12: 00000000ffffffff
Dec 24 02:00:28 pve kernel: R13: 0000000000001000 R14: ffffffff9efdad08 R15: ffff9859c65f90a0
Dec 24 02:00:28 pve kernel: FS:  00007f54fa7952c0(0000) GS:ffff9859cae40000(0000) knlGS:0000000000000000
Dec 24 02:00:28 pve kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dec 24 02:00:28 pve kernel: CR2: 00007f54fa5df408 CR3: 0000000be37a6000 CR4: 00000000000006e0
Dec 24 02:00:28 pve kernel: Call Trace:
Dec 24 02:00:28 pve kernel:  ? kfree+0x14a/0x160
Dec 24 02:00:28 pve kernel:  intel_free_coherent+0x5e/0x80
Dec 24 02:00:28 pve kernel:  ohci_stop+0x13d/0x190
Dec 24 02:00:28 pve kernel:  usb_remove_hcd+0xff/0x250
Dec 24 02:00:28 pve kernel:  usb_hcd_pci_remove+0x6f/0x140
Dec 24 02:00:28 pve kernel:  pci_device_remove+0x39/0xb0
Dec 24 02:00:28 pve kernel:  device_release_driver_internal+0x155/0x210
Dec 24 02:00:28 pve kernel:  device_release_driver+0x12/0x20
Dec 24 02:00:28 pve kernel:  unbind_store+0x10d/0x160
Dec 24 02:00:28 pve kernel:  drv_attr_store+0x25/0x30
Dec 24 02:00:28 pve kernel:  sysfs_kf_write+0x37/0x40
Dec 24 02:00:28 pve kernel:  kernfs_fop_write+0x120/0x1b0
Dec 24 02:00:28 pve kernel:  __vfs_write+0x18/0x40
Dec 24 02:00:28 pve kernel:  vfs_write+0xb5/0x1a0
Dec 24 02:00:28 pve kernel:  SyS_write+0x55/0xc0
Dec 24 02:00:28 pve kernel:  ? SyS_fcntl+0x80/0xb0
Dec 24 02:00:28 pve kernel:  entry_SYSCALL_64_fastpath+0x1e/0xa9
Dec 24 02:00:28 pve kernel: RIP: 0033:0x7f54f9e6b190
Dec 24 02:00:28 pve kernel: RSP: 002b:00007ffc793900b8 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
Dec 24 02:00:28 pve kernel: RAX: ffffffffffffffda RBX: 000000000000000e RCX: 00007f54f9e6b190
Dec 24 02:00:28 pve kernel: RDX: 000000000000000c RSI: 0000561ea9a2d820 RDI: 000000000000000e
Dec 24 02:00:28 pve kernel: RBP: 0000561ea410c010 R08: 0000000000000000 R09: 0000561ea450fbf0
Dec 24 02:00:28 pve kernel: R10: 0000561ea42b3e80 R11: 0000000000000246 R12: 0000561ea9a05e60
Dec 24 02:00:28 pve kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
Dec 24 02:00:28 pve kernel: Code: ff 49 8d bc 24 18 03 00 00 4c 89 ea 4c 89 f6 e8 e6 07 ff ff 48 89 df e8 1e ae ff ff 48 83 c4 10 5b 41 5c 41 5d 41 5e 41 5f 5d c3 <0f> 0b 49 8b 57 10 eb 8a 0f 1f 80 00 00 00 00 66 66 66 66 90 55 
Dec 24 02:00:28 pve kernel: RIP: intel_unmap+0x1f1/0x200 RSP: ffffba83da837cf0
Dec 24 02:00:28 pve kernel: ---[ end trace 8143b932e8aa06f7 ]---
Dec 24 02:00:28 pve pvedaemon[2144]: <root@pam> end task UPID:pve:000013BD:0000DAF6:5A3F508A:qmstart:106:root@pam: unable to read tail (got 0 bytes)

Code:
proxmox-ve: 5.1-27 (running kernel: 4.13.8-1-pve) 
pve-manager: 5.1-36 (running version: 5.1-36/131401db) 
pve-kernel-4.10.15-1-pve: 4.10.15-15 
pve-kernel-4.13.8-1-pve: 4.13.8-27 
libpve-http-server-perl: 2.0-6 
lvm2: 2.02.168-pve6 
corosync: 2.4.2-pve3 
libqb0: 1.0.1-1 
pve-cluster: 5.0-15 
qemu-server: 5.0-17 
pve-firmware: 2.0-3 
libpve-common-perl: 5.0-20 
libpve-guest-common-perl: 2.0-13 
libpve-access-control: 5.0-7 
libpve-storage-perl: 5.0-16 
pve-libspice-server1: 0.12.8-3 
vncterm: 1.5-2 
pve-docs: 5.1-12 
pve-qemu-kvm: 2.9.1-1 
pve-container: 2.0-17 
pve-firewall: 3.0-3 
pve-ha-manager: 2.0-3 
ksm-control-daemon: 1.2-2 
glusterfs-client: 3.8.8-1 
lxc-pve: 2.1.0-2 
lxcfs: 2.0.7-pve4 criu: 2.11.1-1~bpo90 
novnc-pve: 0.6-4 
smartmontools: 6.5+svn4324-1 
zfsutils-linux: 0.7.3-pve1~bpo9
 
Did u get any progress? I've got the same error but on the process of installing Win10 to Guest on PVE 5
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!