Having a weird issue where my server juts randomly hard locks and loses all connectivity.
Looks like a Kernel bug but I really don't know where to look. Did a google search and didn't find much of anything.
Virtual Environment 8.2.7
Any help would be appreciated. Lmk if you need anymore info.
Looks like a Kernel bug but I really don't know where to look. Did a google search and didn't find much of anything.
Virtual Environment 8.2.7
Oct 01 07:58:04 jf kernel: ------------[ cut here ]------------
Oct 01 07:58:04 jf kernel: WARNING: CPU: 1 PID: 95602 at kernel/fork.c:920 __mmdrop+0x1ae/0x1d0
Oct 01 07:58:04 jf kernel: Modules linked in: veth rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache netfs vfio_pci vfio_pci_core vfio_iommu_type1 vfio iommufd ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter sctp ip6_udp_tunnel udp_tunnel nf_tables bonding tls softdog sunrpc nfnetlink_log nfnetlink binfmt_misc intel_rapl_msr intel_rapl_common sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_codec_hdmi kvm nouveau ipmi_ssif irqbypass crct10dif_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel snd_hda_intel aesni_intel snd_intel_dspcfg snd_intel_sdw_acpi mxm_wmi drm_ttm_helper snd_hda_codec crypto_simd ttm cryptd snd_hda_core drm_display_helper rapl snd_hwdep cec snd_pcm mgag200 rc_core snd_timer ucsi_ccg drm_shmem_helper intel_cstate pcspkr snd drm_kms_helper typec_ucsi mei_me video soundcore typec mei ioatdma acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler acpi_pad joydev input_leds mac_hid zfs(PO) spl(O) vhost_net vhost vhost_iotlb
Oct 01 07:58:04 jf kernel: tap drm efi_pstore dmi_sysfs ip_tables x_tables autofs4 btrfs blake2b_generic xor raid6_pq hid_logitech_hidpp hid_logitech_dj dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio libcrc32c hid_generic usbmouse usbkbd usbhid hid isci xhci_pci xhci_pci_renesas crc32_pclmul igb ahci i2c_i801 ehci_pci libsas i2c_algo_bit libahci xhci_hcd i2c_smbus lpc_ich ehci_hcd dca i2c_nvidia_gpu scsi_transport_sas i2c_ccgx_ucsi wmi
Oct 01 07:58:04 jf kernel: CPU: 1 PID: 95602 Comm: iou-wrk-95342 Tainted: P O 6.5.11-4-pve #1
Oct 01 07:58:04 jf kernel: Hardware name: Supermicro X9DRi-LN4+/X9DR3-LN4+/X9DRi-LN4+/X9DR3-LN4+, BIOS 3.4 11/20/2019
Oct 01 07:58:04 jf kernel: RIP: 0010:__mmdrop+0x1ae/0x1d0
Oct 01 07:58:04 jf kernel: Code: 93 9d e8 c5 1f 0a 00 e9 29 ff ff ff be 03 00 00 00 48 89 d7 e8 a3 66 6b 00 e9 48 ff ff ff e8 69 19 13 00 e9 3e ff ff ff 0f 0b <0f> 0b e9 83 fe ff ff 0f 0b e9 92 fe ff ff 48 89 df e8 1c af 32 00
Oct 01 07:58:04 jf kernel: RSP: 0018:ffffc14e0d21bcc0 EFLAGS: 00010246
Oct 01 07:58:04 jf kernel: RAX: ffffa081598c1980 RBX: ffffa08152c69080 RCX: 0000000000000000
Oct 01 07:58:04 jf kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffa08152c69080
Oct 01 07:58:04 jf kernel: RBP: ffffc14e0d21bce8 R08: 0000000000000000 R09: 0000000000000000
Oct 01 07:58:04 jf kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffffa0812f873100
Oct 01 07:58:04 jf kernel: R13: ffffa07dc02f1980 R14: ffffa08152c69080 R15: 0000000000000000
Oct 01 07:58:04 jf kernel: FS: 00007f2968ff96c0(0000) GS:ffffa0812f840000(0000) knlGS:0000000000000000
Oct 01 07:58:04 jf kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct 01 07:58:04 jf kernel: CR2: 0000029475cfda7e CR3: 000000076c352001 CR4: 00000000001726e0
Oct 01 07:58:04 jf kernel: Call Trace:
Oct 01 07:58:04 jf kernel: <TASK>
Oct 01 07:58:04 jf kernel: ? show_regs+0x6d/0x80
Oct 01 07:58:04 jf kernel: ? __warn+0x89/0x160
Oct 01 07:58:04 jf kernel: ? __mmdrop+0x1ae/0x1d0
Oct 01 07:58:04 jf kernel: ? report_bug+0x17e/0x1b0
Oct 01 07:58:04 jf kernel: ? handle_bug+0x46/0x90
Oct 01 07:58:04 jf kernel: ? exc_invalid_op+0x18/0x80
Oct 01 07:58:04 jf kernel: ? asm_exc_invalid_op+0x1b/0x20
Oct 01 07:58:04 jf kernel: ? __mmdrop+0x1ae/0x1d0
Oct 01 07:58:04 jf kernel: finish_task_switch.isra.0+0x1a9/0x2c0
Oct 01 07:58:04 jf kernel: __schedule+0x405/0x1450
Oct 01 07:58:04 jf kernel: ? sysvec_apic_timer_interrupt+0xa6/0xd0
Oct 01 07:58:04 jf kernel: schedule+0x63/0x110
Oct 01 07:58:04 jf kernel: schedule_timeout+0x95/0x170
Oct 01 07:58:04 jf kernel: ? __pfx_process_timeout+0x10/0x10
Oct 01 07:58:04 jf kernel: io_wq_worker+0x1e9/0x3c0
Oct 01 07:58:04 jf kernel: ? raw_spin_rq_unlock+0x10/0x40
Oct 01 07:58:04 jf kernel: ? finish_task_switch.isra.0+0x85/0x2c0
Oct 01 07:58:04 jf kernel: ? __pfx_io_wq_worker+0x10/0x10
Oct 01 07:58:04 jf kernel: ret_from_fork+0x47/0x70
Oct 01 07:58:04 jf kernel: ? __pfx_io_wq_worker+0x10/0x10
Oct 01 07:58:04 jf kernel: ret_from_fork_asm+0x1b/0x30
Oct 01 07:58:04 jf kernel: RIP: 0033:0x0
Oct 01 07:58:04 jf kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
Oct 01 07:58:04 jf kernel: RSP: 002b:0000000000000000 EFLAGS: 00000246 ORIG_RAX: 00000000000001aa
Oct 01 07:58:04 jf kernel: RAX: 0000000000000000 RBX: 000055edc3144df0 RCX: 00007f2db7237b95
Oct 01 07:58:04 jf kernel: RDX: 0000000000000000 RSI: 0000000000000001 RDI: 0000000000000011
Oct 01 07:58:04 jf kernel: RBP: 000055edc3144df8 R08: 0000000000000000 R09: 0000000000000008
Oct 01 07:58:04 jf kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 000055edc3144ee0
Oct 01 07:58:04 jf kernel: R13: 0000000002846000 R14: 0000000000000012 R15: 00007f2908ecf790
Oct 01 07:58:04 jf kernel: </TASK>
Oct 01 07:58:04 jf kernel: ---[ end trace 0000000000000000 ]---
Oct 01 07:58:04 jf kernel: ------------[ cut here ]------------
Oct 01 07:58:04 jf kernel: kernel BUG at mm/mmu_notifier.c:805!
Oct 01 07:58:04 jf kernel: WARNING: CPU: 1 PID: 95602 at kernel/fork.c:920 __mmdrop+0x1ae/0x1d0
Oct 01 07:58:04 jf kernel: Modules linked in: veth rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache netfs vfio_pci vfio_pci_core vfio_iommu_type1 vfio iommufd ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter sctp ip6_udp_tunnel udp_tunnel nf_tables bonding tls softdog sunrpc nfnetlink_log nfnetlink binfmt_misc intel_rapl_msr intel_rapl_common sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_codec_hdmi kvm nouveau ipmi_ssif irqbypass crct10dif_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel snd_hda_intel aesni_intel snd_intel_dspcfg snd_intel_sdw_acpi mxm_wmi drm_ttm_helper snd_hda_codec crypto_simd ttm cryptd snd_hda_core drm_display_helper rapl snd_hwdep cec snd_pcm mgag200 rc_core snd_timer ucsi_ccg drm_shmem_helper intel_cstate pcspkr snd drm_kms_helper typec_ucsi mei_me video soundcore typec mei ioatdma acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler acpi_pad joydev input_leds mac_hid zfs(PO) spl(O) vhost_net vhost vhost_iotlb
Oct 01 07:58:04 jf kernel: tap drm efi_pstore dmi_sysfs ip_tables x_tables autofs4 btrfs blake2b_generic xor raid6_pq hid_logitech_hidpp hid_logitech_dj dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio libcrc32c hid_generic usbmouse usbkbd usbhid hid isci xhci_pci xhci_pci_renesas crc32_pclmul igb ahci i2c_i801 ehci_pci libsas i2c_algo_bit libahci xhci_hcd i2c_smbus lpc_ich ehci_hcd dca i2c_nvidia_gpu scsi_transport_sas i2c_ccgx_ucsi wmi
Oct 01 07:58:04 jf kernel: CPU: 1 PID: 95602 Comm: iou-wrk-95342 Tainted: P O 6.5.11-4-pve #1
Oct 01 07:58:04 jf kernel: Hardware name: Supermicro X9DRi-LN4+/X9DR3-LN4+/X9DRi-LN4+/X9DR3-LN4+, BIOS 3.4 11/20/2019
Oct 01 07:58:04 jf kernel: RIP: 0010:__mmdrop+0x1ae/0x1d0
Oct 01 07:58:04 jf kernel: Code: 93 9d e8 c5 1f 0a 00 e9 29 ff ff ff be 03 00 00 00 48 89 d7 e8 a3 66 6b 00 e9 48 ff ff ff e8 69 19 13 00 e9 3e ff ff ff 0f 0b <0f> 0b e9 83 fe ff ff 0f 0b e9 92 fe ff ff 48 89 df e8 1c af 32 00
Oct 01 07:58:04 jf kernel: RSP: 0018:ffffc14e0d21bcc0 EFLAGS: 00010246
Oct 01 07:58:04 jf kernel: RAX: ffffa081598c1980 RBX: ffffa08152c69080 RCX: 0000000000000000
Oct 01 07:58:04 jf kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffa08152c69080
Oct 01 07:58:04 jf kernel: RBP: ffffc14e0d21bce8 R08: 0000000000000000 R09: 0000000000000000
Oct 01 07:58:04 jf kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffffa0812f873100
Oct 01 07:58:04 jf kernel: R13: ffffa07dc02f1980 R14: ffffa08152c69080 R15: 0000000000000000
Oct 01 07:58:04 jf kernel: FS: 00007f2968ff96c0(0000) GS:ffffa0812f840000(0000) knlGS:0000000000000000
Oct 01 07:58:04 jf kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct 01 07:58:04 jf kernel: CR2: 0000029475cfda7e CR3: 000000076c352001 CR4: 00000000001726e0
Oct 01 07:58:04 jf kernel: Call Trace:
Oct 01 07:58:04 jf kernel: <TASK>
Oct 01 07:58:04 jf kernel: ? show_regs+0x6d/0x80
Oct 01 07:58:04 jf kernel: ? __warn+0x89/0x160
Oct 01 07:58:04 jf kernel: ? __mmdrop+0x1ae/0x1d0
Oct 01 07:58:04 jf kernel: ? report_bug+0x17e/0x1b0
Oct 01 07:58:04 jf kernel: ? handle_bug+0x46/0x90
Oct 01 07:58:04 jf kernel: ? exc_invalid_op+0x18/0x80
Oct 01 07:58:04 jf kernel: ? asm_exc_invalid_op+0x1b/0x20
Oct 01 07:58:04 jf kernel: ? __mmdrop+0x1ae/0x1d0
Oct 01 07:58:04 jf kernel: finish_task_switch.isra.0+0x1a9/0x2c0
Oct 01 07:58:04 jf kernel: __schedule+0x405/0x1450
Oct 01 07:58:04 jf kernel: ? sysvec_apic_timer_interrupt+0xa6/0xd0
Oct 01 07:58:04 jf kernel: schedule+0x63/0x110
Oct 01 07:58:04 jf kernel: schedule_timeout+0x95/0x170
Oct 01 07:58:04 jf kernel: ? __pfx_process_timeout+0x10/0x10
Oct 01 07:58:04 jf kernel: io_wq_worker+0x1e9/0x3c0
Oct 01 07:58:04 jf kernel: ? raw_spin_rq_unlock+0x10/0x40
Oct 01 07:58:04 jf kernel: ? finish_task_switch.isra.0+0x85/0x2c0
Oct 01 07:58:04 jf kernel: ? __pfx_io_wq_worker+0x10/0x10
Oct 01 07:58:04 jf kernel: ret_from_fork+0x47/0x70
Oct 01 07:58:04 jf kernel: ? __pfx_io_wq_worker+0x10/0x10
Oct 01 07:58:04 jf kernel: ret_from_fork_asm+0x1b/0x30
Oct 01 07:58:04 jf kernel: RIP: 0033:0x0
Oct 01 07:58:04 jf kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
Oct 01 07:58:04 jf kernel: RSP: 002b:0000000000000000 EFLAGS: 00000246 ORIG_RAX: 00000000000001aa
Oct 01 07:58:04 jf kernel: RAX: 0000000000000000 RBX: 000055edc3144df0 RCX: 00007f2db7237b95
Oct 01 07:58:04 jf kernel: RDX: 0000000000000000 RSI: 0000000000000001 RDI: 0000000000000011
Oct 01 07:58:04 jf kernel: RBP: 000055edc3144df8 R08: 0000000000000000 R09: 0000000000000008
Oct 01 07:58:04 jf kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 000055edc3144ee0
Oct 01 07:58:04 jf kernel: R13: 0000000002846000 R14: 0000000000000012 R15: 00007f2908ecf790
Oct 01 07:58:04 jf kernel: </TASK>
Oct 01 07:58:04 jf kernel: ---[ end trace 0000000000000000 ]---
Oct 01 07:58:04 jf kernel: ------------[ cut here ]------------
Oct 01 07:58:04 jf kernel: kernel BUG at mm/mmu_notifier.c:805!
Any help would be appreciated. Lmk if you need anymore info.