kernel: kernel BUG at mm/mmu_notifier.c:805!

SC8198

New Member
Dec 1, 2023
16
2
3
Having a weird issue where my server juts randomly hard locks and loses all connectivity.

Looks like a Kernel bug but I really don't know where to look. Did a google search and didn't find much of anything.

Virtual Environment 8.2.7

Oct 01 07:58:04 jf kernel: ------------[ cut here ]------------
Oct 01 07:58:04 jf kernel: WARNING: CPU: 1 PID: 95602 at kernel/fork.c:920 __mmdrop+0x1ae/0x1d0
Oct 01 07:58:04 jf kernel: Modules linked in: veth rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache netfs vfio_pci vfio_pci_core vfio_iommu_type1 vfio iommufd ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter sctp ip6_udp_tunnel udp_tunnel nf_tables bonding tls softdog sunrpc nfnetlink_log nfnetlink binfmt_misc intel_rapl_msr intel_rapl_common sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_codec_hdmi kvm nouveau ipmi_ssif irqbypass crct10dif_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel snd_hda_intel aesni_intel snd_intel_dspcfg snd_intel_sdw_acpi mxm_wmi drm_ttm_helper snd_hda_codec crypto_simd ttm cryptd snd_hda_core drm_display_helper rapl snd_hwdep cec snd_pcm mgag200 rc_core snd_timer ucsi_ccg drm_shmem_helper intel_cstate pcspkr snd drm_kms_helper typec_ucsi mei_me video soundcore typec mei ioatdma acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler acpi_pad joydev input_leds mac_hid zfs(PO) spl(O) vhost_net vhost vhost_iotlb
Oct 01 07:58:04 jf kernel: tap drm efi_pstore dmi_sysfs ip_tables x_tables autofs4 btrfs blake2b_generic xor raid6_pq hid_logitech_hidpp hid_logitech_dj dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio libcrc32c hid_generic usbmouse usbkbd usbhid hid isci xhci_pci xhci_pci_renesas crc32_pclmul igb ahci i2c_i801 ehci_pci libsas i2c_algo_bit libahci xhci_hcd i2c_smbus lpc_ich ehci_hcd dca i2c_nvidia_gpu scsi_transport_sas i2c_ccgx_ucsi wmi
Oct 01 07:58:04 jf kernel: CPU: 1 PID: 95602 Comm: iou-wrk-95342 Tainted: P O 6.5.11-4-pve #1
Oct 01 07:58:04 jf kernel: Hardware name: Supermicro X9DRi-LN4+/X9DR3-LN4+/X9DRi-LN4+/X9DR3-LN4+, BIOS 3.4 11/20/2019
Oct 01 07:58:04 jf kernel: RIP: 0010:__mmdrop+0x1ae/0x1d0
Oct 01 07:58:04 jf kernel: Code: 93 9d e8 c5 1f 0a 00 e9 29 ff ff ff be 03 00 00 00 48 89 d7 e8 a3 66 6b 00 e9 48 ff ff ff e8 69 19 13 00 e9 3e ff ff ff 0f 0b <0f> 0b e9 83 fe ff ff 0f 0b e9 92 fe ff ff 48 89 df e8 1c af 32 00
Oct 01 07:58:04 jf kernel: RSP: 0018:ffffc14e0d21bcc0 EFLAGS: 00010246
Oct 01 07:58:04 jf kernel: RAX: ffffa081598c1980 RBX: ffffa08152c69080 RCX: 0000000000000000
Oct 01 07:58:04 jf kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffa08152c69080
Oct 01 07:58:04 jf kernel: RBP: ffffc14e0d21bce8 R08: 0000000000000000 R09: 0000000000000000
Oct 01 07:58:04 jf kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffffa0812f873100
Oct 01 07:58:04 jf kernel: R13: ffffa07dc02f1980 R14: ffffa08152c69080 R15: 0000000000000000
Oct 01 07:58:04 jf kernel: FS: 00007f2968ff96c0(0000) GS:ffffa0812f840000(0000) knlGS:0000000000000000
Oct 01 07:58:04 jf kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct 01 07:58:04 jf kernel: CR2: 0000029475cfda7e CR3: 000000076c352001 CR4: 00000000001726e0
Oct 01 07:58:04 jf kernel: Call Trace:
Oct 01 07:58:04 jf kernel: <TASK>
Oct 01 07:58:04 jf kernel: ? show_regs+0x6d/0x80
Oct 01 07:58:04 jf kernel: ? __warn+0x89/0x160
Oct 01 07:58:04 jf kernel: ? __mmdrop+0x1ae/0x1d0
Oct 01 07:58:04 jf kernel: ? report_bug+0x17e/0x1b0
Oct 01 07:58:04 jf kernel: ? handle_bug+0x46/0x90
Oct 01 07:58:04 jf kernel: ? exc_invalid_op+0x18/0x80
Oct 01 07:58:04 jf kernel: ? asm_exc_invalid_op+0x1b/0x20
Oct 01 07:58:04 jf kernel: ? __mmdrop+0x1ae/0x1d0
Oct 01 07:58:04 jf kernel: finish_task_switch.isra.0+0x1a9/0x2c0
Oct 01 07:58:04 jf kernel: __schedule+0x405/0x1450
Oct 01 07:58:04 jf kernel: ? sysvec_apic_timer_interrupt+0xa6/0xd0
Oct 01 07:58:04 jf kernel: schedule+0x63/0x110
Oct 01 07:58:04 jf kernel: schedule_timeout+0x95/0x170
Oct 01 07:58:04 jf kernel: ? __pfx_process_timeout+0x10/0x10
Oct 01 07:58:04 jf kernel: io_wq_worker+0x1e9/0x3c0
Oct 01 07:58:04 jf kernel: ? raw_spin_rq_unlock+0x10/0x40
Oct 01 07:58:04 jf kernel: ? finish_task_switch.isra.0+0x85/0x2c0
Oct 01 07:58:04 jf kernel: ? __pfx_io_wq_worker+0x10/0x10
Oct 01 07:58:04 jf kernel: ret_from_fork+0x47/0x70
Oct 01 07:58:04 jf kernel: ? __pfx_io_wq_worker+0x10/0x10
Oct 01 07:58:04 jf kernel: ret_from_fork_asm+0x1b/0x30
Oct 01 07:58:04 jf kernel: RIP: 0033:0x0
Oct 01 07:58:04 jf kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
Oct 01 07:58:04 jf kernel: RSP: 002b:0000000000000000 EFLAGS: 00000246 ORIG_RAX: 00000000000001aa
Oct 01 07:58:04 jf kernel: RAX: 0000000000000000 RBX: 000055edc3144df0 RCX: 00007f2db7237b95
Oct 01 07:58:04 jf kernel: RDX: 0000000000000000 RSI: 0000000000000001 RDI: 0000000000000011
Oct 01 07:58:04 jf kernel: RBP: 000055edc3144df8 R08: 0000000000000000 R09: 0000000000000008
Oct 01 07:58:04 jf kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 000055edc3144ee0
Oct 01 07:58:04 jf kernel: R13: 0000000002846000 R14: 0000000000000012 R15: 00007f2908ecf790
Oct 01 07:58:04 jf kernel: </TASK>
Oct 01 07:58:04 jf kernel: ---[ end trace 0000000000000000 ]---
Oct 01 07:58:04 jf kernel: ------------[ cut here ]------------
Oct 01 07:58:04 jf kernel: kernel BUG at mm/mmu_notifier.c:805!

Any help would be appreciated. Lmk if you need anymore info.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!