When I boot with my recent kernel (5.3.10) I cannot start my VM which gets a RTX 2080 SUPER passed through (I'll attach the full log with error below). Another VM which gets a GT 1030 passed through still works normal.
However, when I select the previous kernel (5.0.x) from the boot menu everything works fine. So this is what I did since the last kernel update.
But now there is a new kernel update available (5.3.13-1) and I'm worried to install it and have a few questions:
1) It looks like Proxmox only keeps the previous kernel available. So I fear I could end up with 5.3.10 and 5.3.13 but 5.0.x gone. That would be bad if the bug would still exist in 5.3.13. How can I ensure that Proxmox will keep the 5.0.x kernel including its boot menu entry?
2) If I would try the update and mess things up is there an easy way to get the old kernel back? Note that I'm on ZFS so maybe simply a
3) In case I have to stick with an old kernel, where do I set its entry in the bootloader as default? Currently I select it manually but that's a bit annoying if it would be required longer...
I'm guessing most can be answered by the Wiki but I couldn't really find the correct entries. So any links or tips are very welcome!
However, when I select the previous kernel (5.0.x) from the boot menu everything works fine. So this is what I did since the last kernel update.
But now there is a new kernel update available (5.3.13-1) and I'm worried to install it and have a few questions:
1) It looks like Proxmox only keeps the previous kernel available. So I fear I could end up with 5.3.10 and 5.3.13 but 5.0.x gone. That would be bad if the bug would still exist in 5.3.13. How can I ensure that Proxmox will keep the 5.0.x kernel including its boot menu entry?
2) If I would try the update and mess things up is there an easy way to get the old kernel back? Note that I'm on ZFS so maybe simply a
zfs snap rpool/ROOT/pve-1@beforeUpdate
and a rollback + restart in case things go south would work? But somehow I doubt it since the bootloader would have to get updated as well or?3) In case I have to stick with an old kernel, where do I set its entry in the bootloader as default? Currently I select it manually but that's a bit annoying if it would be required longer...
I'm guessing most can be answered by the Wiki but I couldn't really find the correct entries. So any links or tips are very welcome!
Code:
Dec 10 20:27:54 pve3 kernel: BUG: unable to handle page fault for address: ffffa38900111000
Dec 10 20:27:54 pve3 kernel: #PF: supervisor read access in kernel mode
Dec 10 20:27:54 pve3 kernel: #PF: error_code(0x0000) - not-present page
Dec 10 20:27:54 pve3 kernel: PGD ff8554067 P4D ff8554067 PUD ff8555067 PMD ff8556067 PTE 0
Dec 10 20:27:54 pve3 kernel: Oops: 0000 [#1] SMP NOPTI
Dec 10 20:27:54 pve3 kernel: CPU: 12 PID: 1313 Comm: kworker/12:2 Tainted: P O 5.3.10-1-pve #1
Dec 10 20:27:54 pve3 kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS PRO/X570 AORUS PRO, BIOS F10 11/15/2019
Dec 10 20:27:54 pve3 kernel: Workqueue: events ccg_pm_workaround_work [ucsi_ccg]
Dec 10 20:27:54 pve3 kernel: RIP: 0010:gpu_i2c_check_status.isra.5+0x52/0xa0 [i2c_nvidia_gpu]
Dec 10 20:27:54 pve3 kernel: Code: 25 00 00 00 60 3d 00 00 00 60 75 24 be 58 02 00 00 bf f4 01 00 00 e8 cd 16 9e ca 48 8b 05 c6 1c 52 cb 4c 39 e0 79 09 49 8b 06 <8b> 18 85 db 78 ce 48 8b 05 b1 1c 52 cb 49 39 c4 0f 88 b6 02 00 00
Dec 10 20:27:54 pve3 kernel: RSP: 0018:ffffa389013ebcc0 EFLAGS: 00010293
Dec 10 20:27:54 pve3 kernel: RAX: ffffa38900111000 RBX: 00000000ffffffff RCX: 0000000000000000
Dec 10 20:27:54 pve3 kernel: RDX: ffff95903eb1db40 RSI: 0000000000000000 RDI: ffffa389013ebbf8
Dec 10 20:27:54 pve3 kernel: RBP: ffffa389013ebce0 R08: 0000000000000000 R09: 0000000000000003
Dec 10 20:27:54 pve3 kernel: R10: 000000000000000e R11: ffff95903eb294c4 R12: 00000001000115fd
Dec 10 20:27:54 pve3 kernel: R13: ffff95902a98f018 R14: ffff95902a98f020 R15: 0000000100011503
Dec 10 20:27:54 pve3 kernel: FS: 0000000000000000(0000) GS:ffff95903eb00000(0000) knlGS:0000000000000000
Dec 10 20:27:54 pve3 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dec 10 20:27:54 pve3 kernel: CR2: ffffa38900111000 CR3: 0000000f86804000 CR4: 0000000000340ee0
Dec 10 20:27:54 pve3 kernel: Call Trace:
Dec 10 20:27:54 pve3 kernel: gpu_i2c_master_xfer+0xe8/0x22d [i2c_nvidia_gpu]
Dec 10 20:27:54 pve3 kernel: __i2c_transfer+0x180/0x4d0
Dec 10 20:27:54 pve3 kernel: i2c_transfer+0x88/0x100
Dec 10 20:27:54 pve3 kernel: ccg_read+0x11e/0x170 [ucsi_ccg]
Dec 10 20:27:54 pve3 kernel: ? __switch_to_asm+0x40/0x70
Dec 10 20:27:54 pve3 kernel: ? __switch_to_asm+0x40/0x70
Dec 10 20:27:54 pve3 kernel: ucsi_ccg_sync+0x56/0xb0 [ucsi_ccg]
Dec 10 20:27:54 pve3 kernel: ucsi_notify+0x26/0x120 [typec_ucsi]
Dec 10 20:27:54 pve3 kernel: ccg_pm_workaround_work+0x15/0x20 [ucsi_ccg]
Dec 10 20:27:54 pve3 kernel: process_one_work+0x20f/0x3d0
Dec 10 20:27:54 pve3 kernel: worker_thread+0x34/0x400
Dec 10 20:27:54 pve3 kernel: kthread+0x120/0x140
Dec 10 20:27:54 pve3 kernel: ? process_one_work+0x3d0/0x3d0
Dec 10 20:27:54 pve3 kernel: ? __kthread_parkme+0x70/0x70
Dec 10 20:27:54 pve3 kernel: ret_from_fork+0x22/0x40
Dec 10 20:27:54 pve3 kernel: Modules linked in: md4 cmac nls_utf8 cifs libarc4 fscache ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter bonding edac_mce_amd kvm_amd kvm softdog crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio aesni_intel aes_x86_64 crypto_simd cryptd glue_helper uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_common wmi_bmof mxm_wmi pcspkr snd_hda_intel snd_usb_audio snd_hda_codec videodev snd_usbmidi_lib k10temp snd_hda_core snd_rawmidi snd_seq_device snd_hwdep mc snd_pcm ucsi_ccg vhost_net typec_ucsi nfnetlink_log snd_timer vhost tap typec ccp snd nfnetlink ib_iser soundcore rdma_cm iw_cm ib_cm ib_core iscsi_tcp joydev libiscsi_tcp input_leds libiscsi scsi_transport_iscsi mac_hid sunrpc vfio_pci vfio_virqfd irqbypass vfio_iommu_type1 vfio ip_tables x_tables autofs4 zfs(PO) zunicode(PO) zlua(PO) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) btrfs xor
Dec 10 20:27:54 pve3 kernel: zstd_compress raid6_pq libcrc32c hid_logitech_hidpp hid_logitech_dj usbmouse hid_generic usbkbd usbhid hid i2c_piix4 i2c_nvidia_gpu ahci libahci igb i2c_algo_bit ixgbe xfrm_algo dca mdio wmi
Dec 10 20:27:54 pve3 kernel: CR2: ffffa38900111000
Dec 10 20:27:54 pve3 kernel: ---[ end trace 00cfd3a4f5498663 ]---
Dec 10 20:27:54 pve3 kernel: RIP: 0010:gpu_i2c_check_status.isra.5+0x52/0xa0 [i2c_nvidia_gpu]
Dec 10 20:27:54 pve3 kernel: Code: 25 00 00 00 60 3d 00 00 00 60 75 24 be 58 02 00 00 bf f4 01 00 00 e8 cd 16 9e ca 48 8b 05 c6 1c 52 cb 4c 39 e0 79 09 49 8b 06 <8b> 18 85 db 78 ce 48 8b 05 b1 1c 52 cb 49 39 c4 0f 88 b6 02 00 00
Dec 10 20:27:54 pve3 kernel: RSP: 0018:ffffa389013ebcc0 EFLAGS: 00010293
Dec 10 20:27:54 pve3 kernel: RAX: ffffa38900111000 RBX: 00000000ffffffff RCX: 0000000000000000
Dec 10 20:27:54 pve3 kernel: RDX: ffff95903eb1db40 RSI: 0000000000000000 RDI: ffffa389013ebbf8
Dec 10 20:27:54 pve3 kernel: RBP: ffffa389013ebce0 R08: 0000000000000000 R09: 0000000000000003
Dec 10 20:27:54 pve3 kernel: R10: 000000000000000e R11: ffff95903eb294c4 R12: 00000001000115fd
Dec 10 20:27:54 pve3 kernel: R13: ffff95902a98f018 R14: ffff95902a98f020 R15: 0000000100011503
Dec 10 20:27:54 pve3 kernel: FS: 0000000000000000(0000) GS:ffff95903eb00000(0000) knlGS:0000000000000000
Dec 10 20:27:54 pve3 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dec 10 20:27:54 pve3 kernel: CR2: ffffa38900111000 CR3: 0000000f86804000 CR4: 0000000000340ee0
Dec 10 20:27:54 pve3 systemd[1]: Started 102.scope.
Dec 10 20:27:55 pve3 systemd-udevd[12067]: Using default interface naming scheme 'v240'.
Dec 10 20:27:55 pve3 systemd-udevd[12067]: link_config: autonegotiation is unset or enabled, the speed and duplex are not writable.
Dec 10 20:27:55 pve3 systemd-udevd[12067]: Could not generate persistent MAC address for tap102i0: No such file or directory
Dec 10 20:27:55 pve3 kernel: device tap102i0 entered promiscuous mode
Dec 10 20:27:55 pve3 kernel: vmbr0: port 3(tap102i0) entered blocking state
Dec 10 20:27:55 pve3 kernel: vmbr0: port 3(tap102i0) entered disabled state
Dec 10 20:27:55 pve3 kernel: vmbr0: port 3(tap102i0) entered blocking state
Dec 10 20:27:55 pve3 kernel: vmbr0: port 3(tap102i0) entered forwarding state
Dec 10 20:27:57 pve3 kernel: vfio-pci 0000:0b:00.0: enabling device (0000 -> 0003)
Dec 10 20:27:57 pve3 kernel: vfio-pci 0000:0b:00.0: vfio_ecap_init: hiding ecap 0x1e@0x258
Dec 10 20:27:57 pve3 kernel: vfio-pci 0000:0b:00.0: vfio_ecap_init: hiding ecap 0x19@0x900
Last edited: