Greatings,
I upgraded one of my servers today and the kernel upgrade from 5.3.13-3 to 5.3.18-1 breaks PCI passthrough for my VM. The VM startup gets stuck with no response, strace on the process shows no activity.
The error message which seems to be responsible for that is the following:
After I reverted to Kernel 5.3.13-3 everything is working as expected again. Any idea what could cause that?
I upgraded one of my servers today and the kernel upgrade from 5.3.13-3 to 5.3.18-1 breaks PCI passthrough for my VM. The VM startup gets stuck with no response, strace on the process shows no activity.
The error message which seems to be responsible for that is the following:
Code:
Feb 10 20:39:50 pve01 pvedaemon[3667]: start VM 204: UPID:pve01:00000E53:000023DA:5E41B186:qmstart:204:root@pam:
Feb 10 20:39:50 pve01 kernel: [ 91.928584] general protection fault: 0000 [#1] SMP PTI
Feb 10 20:39:50 pve01 kernel: [ 91.928607] CPU: 1 PID: 3667 Comm: task UPID:pve01 Tainted: P O 5.3.18-1-pve #1
Feb 10 20:39:50 pve01 kernel: [ 91.928627] Hardware name: ASUSTeK COMPUTER INC. P10S-C Series/P10S-C Series, BIOS 4402 03/07/2018
Feb 10 20:39:50 pve01 kernel: [ 91.928651] RIP: 0010:remove_files.isra.1+0x24/0x70
Feb 10 20:39:50 pve01 kernel: [ 91.928664] Code: 00 00 00 0f 1f 00 0f 1f 44 00 00 55 48 89 e5 41 55 49 89 d5 41 54 49 89 fc 53 48 85 f6 74 24 48 8b 06 48 89 f3 48 85 c0 74 19 <48> 8b 30 31 d2 4c 89 e7 48 83 c3 08 e8 db d3 ff ff 48 8b 03 48 85
Feb 10 20:39:50 pve01 kernel: [ 91.928705] RSP: 0018:ffffa0db0dc53c18 EFLAGS: 00010206
Feb 10 20:39:50 pve01 kernel: [ 91.928719] RAX: 5efbb6e90f91173f RBX: ffff8edf542f7a80 RCX: 0000000000000000
Feb 10 20:39:50 pve01 kernel: [ 91.928736] RDX: ffff8edf5d2c9488 RSI: ffff8edf542f7a80 RDI: ffff8edf543886e8
Feb 10 20:39:50 pve01 kernel: [ 91.928753] RBP: ffffa0db0dc53c30 R08: 0000000000000000 R09: ffff8edf543882c0
Feb 10 20:39:50 pve01 kernel: [ 91.928770] R10: 0000000000000000 R11: 0000000000000001 R12: ffff8edf543886e8
Feb 10 20:39:50 pve01 kernel: [ 91.928787] R13: ffff8edf5d2c9488 R14: ffff8edf5d18a0b0 R15: fffffffffffffff2
Feb 10 20:39:50 pve01 kernel: [ 91.928805] FS: 00007f8c7a9a51c0(0000) GS:ffff8edf5fb00000(0000) knlGS:0000000000000000
Feb 10 20:39:50 pve01 kernel: [ 91.928824] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 10 20:39:50 pve01 kernel: [ 91.928838] CR2: 00007f8c6e3c4db8 CR3: 00000003d8a38004 CR4: 00000000003606e0
Feb 10 20:39:50 pve01 kernel: [ 91.928855] Call Trace:
Feb 10 20:39:50 pve01 kernel: [ 91.928866] sysfs_remove_group+0x44/0x90
Feb 10 20:39:50 pve01 kernel: [ 91.928878] sysfs_remove_groups+0x2e/0x50
Feb 10 20:39:50 pve01 kernel: [ 91.928890] device_remove_attrs+0x3e/0x70
Feb 10 20:39:50 pve01 kernel: [ 91.928901] device_del+0x160/0x370
Feb 10 20:39:50 pve01 kernel: [ 91.928911] cdev_device_del+0x1a/0x40
Feb 10 20:39:50 pve01 kernel: [ 91.928922] posix_clock_unregister+0x26/0x50
Feb 10 20:39:50 pve01 kernel: [ 91.928934] ptp_clock_unregister+0x6f/0x80
Feb 10 20:39:50 pve01 kernel: [ 91.928951] igb_ptp_stop+0x26/0x50 [igb]
Feb 10 20:39:50 pve01 kernel: [ 91.928964] igb_remove+0x4b/0x130 [igb]
Feb 10 20:39:50 pve01 kernel: [ 91.928975] pci_device_remove+0x3e/0xc0
Feb 10 20:39:50 pve01 kernel: [ 91.928986] device_release_driver_internal+0xe0/0x1b0
Feb 10 20:39:50 pve01 kernel: [ 91.928999] device_driver_detach+0x14/0x20
Feb 10 20:39:50 pve01 kernel: [ 91.929011] unbind_store+0xf9/0x130
Feb 10 20:39:50 pve01 kernel: [ 91.929021] drv_attr_store+0x27/0x40
Feb 10 20:39:50 pve01 kernel: [ 91.929032] sysfs_kf_write+0x3b/0x40
Feb 10 20:39:50 pve01 kernel: [ 91.929043] kernfs_fop_write+0xda/0x1c0
Feb 10 20:39:50 pve01 kernel: [ 91.929053] __vfs_write+0x1b/0x40
Feb 10 20:39:50 pve01 kernel: [ 91.929063] vfs_write+0xab/0x1b0
Feb 10 20:39:50 pve01 kernel: [ 91.929073] ksys_write+0x61/0xe0
Feb 10 20:39:50 pve01 kernel: [ 91.929082] __x64_sys_write+0x1a/0x20
Feb 10 20:39:50 pve01 kernel: [ 91.929093] do_syscall_64+0x5a/0x130
Feb 10 20:39:50 pve01 kernel: [ 91.929554] entry_SYSCALL_64_after_hwframe+0x44/0xa9
Feb 10 20:39:50 pve01 kernel: [ 91.930019] RIP: 0033:0x7f8c7abb2471
Feb 10 20:39:50 pve01 kernel: [ 91.930477] Code: 00 00 75 05 48 83 c4 58 c3 e8 0b 4d ff ff 66 2e 0f 1f 84 00 00 00 00 00 90 8b 05 da ef 00 00 85 c0 75 16 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 57 c3 66 0f 1f 44 00 00 41 54 49 89 d4 55 48
Feb 10 20:39:50 pve01 kernel: [ 91.931438] RSP: 002b:00007ffcb70c1d18 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
Feb 10 20:39:50 pve01 kernel: [ 91.931926] RAX: ffffffffffffffda RBX: 00005590e1c9d260 RCX: 00007f8c7abb2471
Feb 10 20:39:50 pve01 kernel: [ 91.932418] RDX: 000000000000000c RSI: 00005590e87d35f0 RDI: 000000000000000b
Feb 10 20:39:50 pve01 kernel: [ 91.932915] RBP: 00005590e87d35f0 R08: 0000000000000000 R09: aaaaaaaaaaaaaaab
Feb 10 20:39:50 pve01 kernel: [ 91.933419] R10: 00005590e87b5df8 R11: 0000000000000246 R12: 000000000000000c
Feb 10 20:39:50 pve01 kernel: [ 91.933914] R13: 00005590e1c9d260 R14: 000000000000000b R15: 00005590e87d2df0
Feb 10 20:39:50 pve01 kernel: [ 91.934405] Modules linked in: ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables sctp xt_conntrack xt_MASQUERADE nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 aufs iptable_filter bpfilter overlay softdog nfnetlink_log nfnetlink intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal intel_powerclamp kvm_intel kvm crct10dif_pclmul crc32_pclmul ghash_clmulni_intel ast aesni_intel aes_x86_64 crypto_simd cryptd drm_vram_helper glue_helper intel_cstate ttm intel_rapl_perf drm_kms_helper pcspkr drm fb_sys_fops syscopyarea sysfillrect sysimgblt mei_me mei intel_pch_thermal mac_hid vhost_net vhost tap ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi vfio_pci vfio_virqfd irqbypass vfio_iommu_type1 vfio nct6775 hwmon_vid coretemp sunrpc ip_tables x_tables autofs4 zfs(PO) zunicode(PO) zlua(PO) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) btrfs xor
Feb 10 20:39:50 pve01 kernel: [ 91.934430] zstd_compress raid6_pq libcrc32c i2c_i801 igb ahci i2c_algo_bit dca libahci video
Feb 10 20:39:50 pve01 kernel: [ 91.939179] ---[ end trace f32eff54a40d7579 ]---
Feb 10 20:39:51 pve01 pvedaemon[2088]: <root@pam> end task UPID:pve01:00000E53:000023DA:5E41B186:qmstart:204:root@pam: unable to read tail (got 0 bytes)
Feb 10 20:39:51 pve01 kernel: [ 93.231548] RIP: 0010:remove_files.isra.1+0x24/0x70
Feb 10 20:39:51 pve01 kernel: [ 93.231550] Code: 00 00 00 0f 1f 00 0f 1f 44 00 00 55 48 89 e5 41 55 49 89 d5 41 54 49 89 fc 53 48 85 f6 74 24 48 8b 06 48 89 f3 48 85 c0 74 19 <48> 8b 30 31 d2 4c 89 e7 48 83 c3 08 e8 db d3 ff ff 48 8b 03 48 85
Feb 10 20:39:51 pve01 kernel: [ 93.231551] RSP: 0018:ffffa0db0dc53c18 EFLAGS: 00010206
Feb 10 20:39:51 pve01 kernel: [ 93.231552] RAX: 5efbb6e90f91173f RBX: ffff8edf542f7a80 RCX: 0000000000000000
Feb 10 20:39:51 pve01 kernel: [ 93.231552] RDX: ffff8edf5d2c9488 RSI: ffff8edf542f7a80 RDI: ffff8edf543886e8
Feb 10 20:39:51 pve01 kernel: [ 93.231553] RBP: ffffa0db0dc53c30 R08: 0000000000000000 R09: ffff8edf543882c0
Feb 10 20:39:51 pve01 kernel: [ 93.231553] R10: 0000000000000000 R11: 0000000000000001 R12: ffff8edf543886e8
Feb 10 20:39:51 pve01 kernel: [ 93.231554] R13: ffff8edf5d2c9488 R14: ffff8edf5d18a0b0 R15: fffffffffffffff2
Feb 10 20:39:51 pve01 kernel: [ 93.231554] FS: 00007f8c7a9a51c0(0000) GS:ffff8edf5fb00000(0000) knlGS:0000000000000000
Feb 10 20:39:51 pve01 kernel: [ 93.231555] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 10 20:39:51 pve01 kernel: [ 93.231555] CR2: 00007f8c6e3c4db8 CR3: 00000003d8a38004 CR4: 00000000003606e0
After I reverted to Kernel 5.3.13-3 everything is working as expected again. Any idea what could cause that?