Hi,
For the last several weeks my machine will freeze about once every 5 to 10 days. I've never been able to figure out why. Since last night it has crashed five times, and I would appreciate some help. The last crash was so bad I had to remove the power cord - I was only getting a blank screen through the Supermicro BMC.
I thought it may be a nic card issue, so I switched to a standard intel card - did not help.
Machine specs:
Here is what is in my proxmox /var/lib/systemd/pstore/ directory:
I see the following in my journal all the time, though, it does not seem to be important since it would crash hours or days after this:
I'd appreciate any help!
For the last several weeks my machine will freeze about once every 5 to 10 days. I've never been able to figure out why. Since last night it has crashed five times, and I would appreciate some help. The last crash was so bad I had to remove the power cord - I was only getting a blank screen through the Supermicro BMC.
I thought it may be a nic card issue, so I switched to a standard intel card - did not help.
Machine specs:
- Current Proxmox Virtual Environment 8.2.7 (all software up to date)
- Supermicro Super Server/X13SAE-F motherboard, BIOS 3.3b 08/26/2024 (up to date)
- 12th Gen Intel(R) Core(TM) i7-12700K
Here is what is in my proxmox /var/lib/systemd/pstore/ directory:
Code:
<4>[ 39.528159] ------------[ cut here ]------------
<4>[ 39.528295] general protection fault, maybe for address 0x1: 0000 [#1] PREEMPT SMP NOPTI
<4>[ 39.528297] CPU: 7 PID: 400 Comm: spl_kmem_cache Tainted: P W OE 6.8.12-2-pve #1
<4>[ 39.528298] Hardware name: Supermicro Super Server/X13SAE-F, BIOS 3.3b 08/26/2024
<4>[ 39.528298] RIP: 0010:fbcon_scroll+0x75/0x1c0
<4>[ 39.528303] Code: 25 bf 8b 90 d8 03 00 00 85 d2 74 23 b8 01 00 00 00 48 83 c4 08 5b 41 5c 41 5d 41 5e 41 5f 5d 31 d2 31 c9 31 f6 31 ff 45 31 c0 <c3> cc cc cc cc 80 bb ec 01 00 00 00 75 d4 48 8b 80 e0 03 00 00 8b
<4>[ 39.528303] RSP: 0018:ffffb6abc12cf780 EFLAGS: 00010046
<4>[ 39.528305] RAX: 0000000000000001 RBX: 0000000000000001 RCX: 0000000000000000
<4>[ 39.528305] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
<4>[ 39.528306] RBP: ffffb6abc12cf7c0 R08: 0000000000000000 R09: 0000000000000000
<4>[ 39.528306] R10: 0000000000000000 R11: 0000000000000000 R12: ffff9da84029e000
<4>[ 39.528306] R13: 0000000000000000 R14: 0000000000000030 R15: 0000000000000000
<4>[ 39.528307] FS: 0000000000000000(0000) GS:ffff9db77f580000(0000) knlGS:0000000000000000
<4>[ 39.528308] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[ 39.528308] CR2: 00007f5e402e9c1c CR3: 0000000fbee36001 CR4: 0000000000f72ef0
<4>[ 39.528309] PKRU: 55555554
<4>[ 39.528309] Call Trace:
<4>[ 39.528310] <TASK>
<4>[ 39.528311] ? show_regs+0x6d/0x80
<4>[ 39.528314] ? die_addr+0x37/0xa0
<4>[ 39.528315] ? exc_general_protection+0x1db/0x480
<4>[ 39.528318] ? asm_exc_general_protection+0x27/0x30
<4>[ 39.528321] ? fbcon_scroll+0x75/0x1c0
<4>[ 39.528323] ? console_flush_all+0x17f/0x390
<4>[ 39.528326] ? console_unlock+0x56/0x130
<4>[ 39.528327] ? vprintk_emit+0xd6/0x330
<4>[ 39.528329] ? vmap_small_pages_range_noflush+0x345/0x620
<4>[ 39.528330] ? vprintk_default+0x1d/0x30
<4>[ 39.528332] ? vprintk+0x42/0x80
<4>[ 39.528333] ? _printk+0x60/0x90
<4>[ 39.528334] ? report_bug+0x156/0x1b0
<4>[ 39.528336] ? handle_bug+0x46/0x90
<4>[ 39.528337] ? exc_invalid_op+0x18/0x80
<4>[ 39.528338] ? asm_exc_invalid_op+0x1b/0x20
<4>[ 39.528340] ? vmap_small_pages_range_noflush+0x345/0x620
<4>[ 39.528341] ? __vmap_pages_range_noflush+0x11a/0x150
<4>[ 39.528342] ? alloc_pages_bulk_array_mempolicy+0xbd/0x240
<4>[ 39.528344] ? __vmalloc_node_range+0x4a1/0x8f0
<4>[ 39.528347] ? spl_cache_grow_work+0x8a/0x250 [spl]
<4>[ 39.528356] ? __vmalloc_node+0x4e/0x80
<4>[ 39.528357] ? spl_cache_grow_work+0x8a/0x250 [spl]
<4>[ 39.528363] ? __vmalloc+0x1e/0x30
<4>[ 39.528364] ? spl_cache_grow_work+0x8a/0x250 [spl]
<4>[ 39.528370] ? taskq_thread+0x27f/0x4c0 [spl]
<4>[ 39.528377] ? finish_task_switch.isra.0+0x8c/0x310
<4>[ 39.528380] ? __pfx_default_wake_function+0x10/0x10
<4>[ 39.528382] ? __pfx_spl_cache_grow_work+0x10/0x10 [spl]
<4>[ 39.528388] ? __pfx_taskq_thread+0x10/0x10 [spl]
<4>[ 39.528394] ? kthread+0xef/0x120
<4>[ 39.528395] ? __pfx_kthread+0x10/0x10
<4>[ 39.528397] ? ret_from_fork+0x44/0x70
<4>[ 39.528398] ? __pfx_kthread+0x10/0x10
<4>[ 39.528399] ? ret_from_fork_asm+0x1b/0x30
<4>[ 39.528401] </TASK>
<4>[ 39.528401] Modules linked in: cfg80211 veth ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter nf_tables sunrpc binfmt_misc bonding tls nfnetlink_log nfnetlink snd_hda_cod
ec_hdmi snd_hda_codec_realtek snd_hda_codec_generic intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common x86_pkg_temp_thermal snd_sof_pci_intel_tgl i915(OE) intel_powerclamp snd_sof_intel_hda_
common soundwire_intel snd_sof_intel_hda_mlink soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_soc_hdac_hda snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi soundwire_generic
_allocation coretemp soundwire_bus snd_soc_core kvm_intel snd_compress ipmi_ssif xe ac97_bus snd_pcm_dmaengine kvm snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec crct10dif_pclmul polyval_clmulni polyval_gener
ic drm_gpuvm snd_hda_core ghash_clmulni_intel drm_exec sha256_ssse3 gpu_sched snd_hwdep drm_buddy sha1_ssse3 drm_suballoc_helper snd_pcm
<4>[ 39.528423] aesni_intel drm_ttm_helper cmdlinepart ttm snd_timer crypto_simd mei_hdcp mei_pxp cryptd spi_nor snd drm_display_helper rapl acpi_ipmi intel_cstate wmi_bmof pcspkr mtd ast cec soundcore ucsi_acpi mei_me ipm
i_si intel_pmc_core typec_ucsi rc_core mei ipmi_devintf intel_vsec typec ipmi_msghandler pmt_telemetry pmt_class acpi_pad acpi_tad input_leds joydev mac_hid vhost_net vhost vhost_iotlb tap vfio_pci vfio_pci_core irqbypass vfi
o_iommu_type1 vfio iommufd efi_pstore dmi_sysfs ip_tables x_tables autofs4 zfs(PO) spl(O) rndis_host cdc_ether usbnet mii btrfs blake2b_generic xor hid_generic usbmouse usbhid hid raid6_pq libcrc32c xhci_pci nvme xhci_pci_ren
esas crc32_pclmul igb e1000e nvme_core xhci_hcd spi_intel_pci i2c_i801 i2c_algo_bit igc intel_lpss_pci ahci spi_intel i2c_smbus dca nvme_auth intel_lpss libahci idma64 video wmi pinctrl_alderlake
<4>[ 39.528449] ---[ end trace 0000000000000000 ]---
I see the following in my journal all the time, though, it does not seem to be important since it would crash hours or days after this:
Code:
20.8898731
fwln10310: entered allmulticast
20.889915]
fuln103i0: entered promiscuous mode
20.889974] fubr10310:
port 1(fuln103io)
entered blocking state
20.8899851
fubr10310:
1 (fuln10310)
entered forwarding state
20.894718] fubr10310:
2 (veth10310)
entered blocking state
20.8947251
fubr10310:
2 (veth103io)
entered disabled state
20.894733] veth103i0:
entered allmulticast mode
20.8947551
veth103i0: entered promiscuous mode
20.922118]
ethe: renamed from vethjgSaMU
22.324199]
audit: type=1400 audit (1730406859.047:30): apparmon="STATUS" operation="profile_load" profile="/usr/bin/1xc-start
name="1xc-104_</var/lib/1xc›"
pid=2583 comm="apparmor_parsen"
22.6565671
vmbro: port 5(veth10410) entered blocking state
22.656733]
vmbr0: port 5(veth104i0) entered disabled
22.656870]
veth10410: entered allmulticast mode
22.657024] veth10410:
entered promiscuous mode
22. 683853]
etho: renamed from vethN5mCzQ
23.074763]
vmbro: port 5(veth10410) entered blocking state
23.074914]
vmbr0: port 5(veth104i0) entered forwarding state
23.329629]
audit: type=1400 audit (1730406860.052:31): apparmor="STATUS" operation="profile_load" profile="/us/bin/1xc-start
name="1xc-105_</var/1ib/1xc›"
pid=2835 comm="apparmor_parsen"
23.669414] vmbro: port 6(veth105i0) entered blocking state
23.6696461
vmbro: port 6(veth105i0) entered disabled
23.669881]
veth10510: entered allmulticast mode
23.6700411
veth10510: entered promiscuous mode
23.6972871
etho: renamed from vethuk1BrQ
24.0758781
vmbro: port 6(veth10510) entered blocking state
24.0760581
vmbro: port 6(veth10510) entered forwarding
24.3983081
tap10710: entered promiscuous mode
24.431116]
vmbro: port 7(fupr107p0) entered blocking state
24.431324]
vmbro: port 7(fur107p0) entered disabled
24.431526]
fupr107p0: entered allmulticast mode
24.431701]
24.431838]
fupr107p0: entered promiscuous mode vmbro: port 7(fupr107p0) entered blocking
24.431944]
vmbro: port 7(fupr107p0) entered forwarding state
24.436588]
fubr10710: port 1(fuln107i0) entered blocking state
24.436739]
fubr 10710:
port 1(fuln107i0) entered disabled
24.4368551
fuln10710:
entered allmulticast mode
24.436984] fuln107i0:
entered promiscuous mode
24.437120] fubr10710:
port 1(fuln10710) entered blocking state
24.437214] fubr10710:
port 1(fuln107i0) entered forwarding state
24.4423271
fubr107i0: port
2(tap10710) entered blocking state
24.442498]
fubr10710: port
2(tap107i0) entered
disabled
state
24.442669]
tap10710: entered allmulticast mode
24.442850]
fubr10710: port 2(tap107i0) entered
24.442946]
blocking state
fubr10710:
port
2 (tap107i0)
entered
25.497917] fubr10310:
port
forwarding state
2 (veth103i0)
entered blocking state
25.4980951
fubr103i0:
port
2 (veth10310)
entered forwarding state
I'd appreciate any help!