Intermitten host and VM crashes

safe.book7822

New Member
Oct 2, 2023
5
0
1
Hi Team,

I am new into proxmox and looking for some assitsance. I am running an Optiplex with an intel i9500 currently only configured with one Windows VM

The issue that I am having however is I am getting intermitten crashes where the Host and VM will become unresponsive. Neither host or VM will respond to ping requests and I am unable to bring up the proxmox WebUI.

I am yet to be able to pin the triggers as sometimes it can run for as long as 12 hours or as little as 20 mins. I can be activitly connected to the VM using RDP and the connection just drops or the system can just be on idle and it will go offline.

Any assitance would be apprecitated thanks
 
Can you provide the output of pveversion -v, lscpu, dmidecode -t bios?
Do you see anything in the logs?
 
Code:
root@pve1:~# pveversion -v
proxmox-ve: 8.0.2 (running kernel: 6.2.16-14-pve)
pve-manager: 8.0.4 (running version: 8.0.4/d258a813cfa6b390)
pve-kernel-6.2: 8.0.5
proxmox-kernel-helper: 8.0.3
proxmox-kernel-6.2.16-14-pve: 6.2.16-14
proxmox-kernel-6.2: 6.2.16-14
pve-kernel-6.2.16-3-pve: 6.2.16-3
ceph-fuse: 17.2.6-pve1+3
corosync: 3.1.7-pve3
criu: 3.17.1-2
glusterfs-client: 10.3-5
ifupdown2: 3.2.0-1+pmx4
ksm-control-daemon: 1.4-1
libjs-extjs: 7.0.0-4
libknet1: 1.25-pve1
libproxmox-acme-perl: 1.4.6
libproxmox-backup-qemu0: 1.4.0
libproxmox-rs-perl: 0.3.1
libpve-access-control: 8.0.5
libpve-apiclient-perl: 3.3.1
libpve-common-perl: 8.0.8
libpve-guest-common-perl: 5.0.4
libpve-http-server-perl: 5.0.4
libpve-rs-perl: 0.8.5
libpve-storage-perl: 8.0.2
libspice-server1: 0.15.1-1
lvm2: 2.03.16-2
lxc-pve: 5.0.2-4
lxcfs: 5.0.3-pve3
novnc-pve: 1.4.0-2
proxmox-backup-client: 3.0.2-1
proxmox-backup-file-restore: 3.0.2-1
proxmox-kernel-helper: 8.0.3
proxmox-mail-forward: 0.2.0
proxmox-mini-journalreader: 1.4.0
proxmox-widget-toolkit: 4.0.6
pve-cluster: 8.0.3
pve-container: 5.0.4
pve-docs: 8.0.4
pve-edk2-firmware: 3.20230228-4
pve-firewall: 5.0.3
pve-firmware: 3.8-2
pve-ha-manager: 4.0.2
pve-i18n: 3.0.7
pve-qemu-kvm: 8.0.2-6
pve-xtermjs: 4.16.0-3
qemu-server: 8.0.7
smartmontools: 7.3-pve1
spiceterm: 3.3.0
swtpm: 0.8.0+pve1
vncterm: 1.8.0
zfsutils-linux: 2.1.12-pve1
root@pve1:~# lscpu
Architecture:            x86_64
  CPU op-mode(s):        32-bit, 64-bit
  Address sizes:         39 bits physical, 48 bits virtual
  Byte Order:            Little Endian
CPU(s):                  6
  On-line CPU(s) list:   0-5
Vendor ID:               GenuineIntel
  BIOS Vendor ID:        Intel(R) Corporation
  Model name:            Intel(R) Core(TM) i5-9500 CPU @ 3.00GHz
    BIOS Model name:     Intel(R) Core(TM) i5-9500 CPU @ 3.00GHz  CPU @ 2.9GHz
    BIOS CPU family:     205
    CPU family:          6
    Model:               158
    Thread(s) per core:  1
    Core(s) per socket:  6
    Socket(s):           1
    Stepping:            10
    CPU(s) scaling MHz:  91%
    CPU max MHz:         4400.0000
    CPU min MHz:         800.0000
    BogoMIPS:            6000.00
    Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fx
                         sr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts re
                         p_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx
                         est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer
                          aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb invpcid_single pti ssbd ib
                         rs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 sm
                         ep bmi2 erms invpcid rtm mpx rdseed adx smap clflushopt intel_pt xsaveopt xsavec xgetbv1 xsaves
                          dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp md_clear flush_l1d
Virtualization features:
  Virtualization:        VT-x
Caches (sum of all):
  L1d:                   192 KiB (6 instances)
  L1i:                   192 KiB (6 instances)
  L2:                    1.5 MiB (6 instances)
  L3:                    9 MiB (1 instance)
NUMA:
  NUMA node(s):          1
  NUMA node0 CPU(s):     0-5
Vulnerabilities:
  Gather data sampling:  Vulnerable: No microcode
  Itlb multihit:         KVM: Mitigation: Split huge pages
  L1tf:                  Mitigation; PTE Inversion; VMX conditional cache flushes, SMT disabled
  Mds:                   Mitigation; Clear CPU buffers; SMT disabled
  Meltdown:              Mitigation; PTI
  Mmio stale data:       Mitigation; Clear CPU buffers; SMT disabled
  Retbleed:              Mitigation; IBRS
  Spec rstack overflow:  Not affected
  Spec store bypass:     Mitigation; Speculative Store Bypass disabled via prctl
  Spectre v1:            Mitigation; usercopy/swapgs barriers and __user pointer sanitization
  Spectre v2:            Mitigation; IBRS, IBPB conditional, STIBP disabled, RSB filling, PBRSB-eIBRS Not affected
  Srbds:                 Vulnerable: No microcode
  Tsx async abort:       Mitigation; Clear CPU buffers; SMT disabled
root@pve1:~# dmidecode -t bios
# dmidecode 3.4
Getting SMBIOS data from sysfs.
SMBIOS 3.1.1 present.

Handle 0x0000, DMI type 0, 26 bytes
BIOS Information
        Vendor: Dell Inc.
        Version: 1.3.1
        Release Date: 02/06/2020
        Address: 0xF0000
        Runtime Size: 64 kB
        ROM Size: 32 MB
        Characteristics:
                PCI is supported
                PNP is supported
                BIOS is upgradeable
                BIOS shadowing is allowed
                Boot from CD is supported
                Selectable boot is supported
                EDD is supported
                5.25"/1.2 MB floppy services are supported (int 13h)
                3.5"/720 kB floppy services are supported (int 13h)
                3.5"/2.88 MB floppy services are supported (int 13h)
                Print screen service is supported (int 5h)
                8042 keyboard services are supported (int 9h)
                Serial services are supported (int 14h)
                Printer services are supported (int 17h)
                ACPI is supported
                USB legacy is supported
                BIOS boot specification is supported
                Function key-initiated network boot is supported
                Targeted content distribution is supported
                UEFI is supported
        BIOS Revision: 1.3

Handle 0xF03F, DMI type 13, 22 bytes
BIOS Language Information
        Language Description Format: Long
        Installable Languages: 2
                en|US|iso8859-1
                <BAD INDEX>
        Currently Installed Language: en|US|iso8859-1

root@pve1:~#root@pve1:~# pveversion -v
proxmox-ve: 8.0.2 (running kernel: 6.2.16-14-pve)
pve-manager: 8.0.4 (running version: 8.0.4/d258a813cfa6b390)
pve-kernel-6.2: 8.0.5
proxmox-kernel-helper: 8.0.3
proxmox-kernel-6.2.16-14-pve: 6.2.16-14
proxmox-kernel-6.2: 6.2.16-14
pve-kernel-6.2.16-3-pve: 6.2.16-3
ceph-fuse: 17.2.6-pve1+3
corosync: 3.1.7-pve3
criu: 3.17.1-2
glusterfs-client: 10.3-5
ifupdown2: 3.2.0-1+pmx4
ksm-control-daemon: 1.4-1
libjs-extjs: 7.0.0-4
libknet1: 1.25-pve1
libproxmox-acme-perl: 1.4.6
libproxmox-backup-qemu0: 1.4.0
libproxmox-rs-perl: 0.3.1
libpve-access-control: 8.0.5
libpve-apiclient-perl: 3.3.1
libpve-common-perl: 8.0.8
libpve-guest-common-perl: 5.0.4
libpve-http-server-perl: 5.0.4
libpve-rs-perl: 0.8.5
libpve-storage-perl: 8.0.2
libspice-server1: 0.15.1-1
lvm2: 2.03.16-2
lxc-pve: 5.0.2-4
lxcfs: 5.0.3-pve3
novnc-pve: 1.4.0-2
proxmox-backup-client: 3.0.2-1
proxmox-backup-file-restore: 3.0.2-1
proxmox-kernel-helper: 8.0.3
proxmox-mail-forward: 0.2.0
proxmox-mini-journalreader: 1.4.0
proxmox-widget-toolkit: 4.0.6
pve-cluster: 8.0.3
pve-container: 5.0.4
pve-docs: 8.0.4
pve-edk2-firmware: 3.20230228-4
pve-firewall: 5.0.3
pve-firmware: 3.8-2
pve-ha-manager: 4.0.2
pve-i18n: 3.0.7
pve-qemu-kvm: 8.0.2-6
pve-xtermjs: 4.16.0-3
qemu-server: 8.0.7
smartmontools: 7.3-pve1
spiceterm: 3.3.0
swtpm: 0.8.0+pve1
vncterm: 1.8.0
zfsutils-linux: 2.1.12-pve1
root@pve1:~# lscpu
Architecture:            x86_64
  CPU op-mode(s):        32-bit, 64-bit
  Address sizes:         39 bits physical, 48 bits virtual
  Byte Order:            Little Endian
CPU(s):                  6
  On-line CPU(s) list:   0-5
Vendor ID:               GenuineIntel
  BIOS Vendor ID:        Intel(R) Corporation
  Model name:            Intel(R) Core(TM) i5-9500 CPU @ 3.00GHz
    BIOS Model name:     Intel(R) Core(TM) i5-9500 CPU @ 3.00GHz  CPU @ 2.9GHz
    BIOS CPU family:     205
    CPU family:          6
    Model:               158
    Thread(s) per core:  1
    Core(s) per socket:  6
    Socket(s):           1
    Stepping:            10
    CPU(s) scaling MHz:  91%
    CPU max MHz:         4400.0000
    CPU min MHz:         800.0000
    BogoMIPS:            6000.00
    Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fx
                         sr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts re
                         p_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx
                         est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer
                          aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb invpcid_single pti ssbd ib
                         rs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 sm
                         ep bmi2 erms invpcid rtm mpx rdseed adx smap clflushopt intel_pt xsaveopt xsavec xgetbv1 xsaves
                          dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp md_clear flush_l1d
Virtualization features:
  Virtualization:        VT-x
Caches (sum of all):
  L1d:                   192 KiB (6 instances)
  L1i:                   192 KiB (6 instances)
  L2:                    1.5 MiB (6 instances)
  L3:                    9 MiB (1 instance)
NUMA:
  NUMA node(s):          1
  NUMA node0 CPU(s):     0-5
Vulnerabilities:
  Gather data sampling:  Vulnerable: No microcode
  Itlb multihit:         KVM: Mitigation: Split huge pages
  L1tf:                  Mitigation; PTE Inversion; VMX conditional cache flushes, SMT disabled
  Mds:                   Mitigation; Clear CPU buffers; SMT disabled
  Meltdown:              Mitigation; PTI
  Mmio stale data:       Mitigation; Clear CPU buffers; SMT disabled
  Retbleed:              Mitigation; IBRS
  Spec rstack overflow:  Not affected
  Spec store bypass:     Mitigation; Speculative Store Bypass disabled via prctl
  Spectre v1:            Mitigation; usercopy/swapgs barriers and __user pointer sanitization
  Spectre v2:            Mitigation; IBRS, IBPB conditional, STIBP disabled, RSB filling, PBRSB-eIBRS Not affected
  Srbds:                 Vulnerable: No microcode
  Tsx async abort:       Mitigation; Clear CPU buffers; SMT disabled
root@pve1:~# dmidecode -t bios
# dmidecode 3.4
Getting SMBIOS data from sysfs.
SMBIOS 3.1.1 present.

Handle 0x0000, DMI type 0, 26 bytes
BIOS Information
        Vendor: Dell Inc.
        Version: 1.3.1
        Release Date: 02/06/2020
        Address: 0xF0000
        Runtime Size: 64 kB
        ROM Size: 32 MB
        Characteristics:
                PCI is supported
                PNP is supported
                BIOS is upgradeable
                BIOS shadowing is allowed
                Boot from CD is supported
                Selectable boot is supported
                EDD is supported
                5.25"/1.2 MB floppy services are supported (int 13h)
                3.5"/720 kB floppy services are supported (int 13h)
                3.5"/2.88 MB floppy services are supported (int 13h)
                Print screen service is supported (int 5h)
                8042 keyboard services are supported (int 9h)
                Serial services are supported (int 14h)
                Printer services are supported (int 17h)
                ACPI is supported
                USB legacy is supported
                BIOS boot specification is supported
                Function key-initiated network boot is supported
                Targeted content distribution is supported
                UEFI is supported
        BIOS Revision: 1.3

Handle 0xF03F, DMI type 13, 22 bytes
BIOS Language Information
        Language Description Format: Long
        Installable Languages: 2
                en|US|iso8859-1
                <BAD INDEX>
        Currently Installed Language: en|US|iso8859-1

root@pve1:~#
Can you provide the output of pveversion -v, lscpu, dmidecode -t bios?
Do you see anything in the logs?
 
Logs

Code:
Oct 04 10:49:45 pve1 kernel: ------------[ cut here ]------------
Oct 04 10:49:45 pve1 kernel: NETDEV WATCHDOG: enp1s0 (r8169): transmit queue 0 timed out
Oct 04 10:49:45 pve1 kernel: WARNING: CPU: 3 PID: 0 at net/sched/sch_generic.c:525 dev_watchdog+0x23a/0x250
Oct 04 10:49:45 pve1 kernel: Modules linked in: veth ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter nf_tables bonding tls sunrpc nfnetlink_log nfnetlink binfmt_misc snd_hda_codec_hdmi snd_ctl_led snd_hda_codec_realtek snd_hda_codec_generic snd_sof_pci_intel_cnl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof intel_rapl_msr snd_sof_utils intel_rapl_common snd_soc_hdac_hda intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp snd_hda_ext_core coretemp snd_soc_acpi_intel_match snd_soc_acpi kvm_intel soundwire_bus i915 snd_soc_core snd_compress ac97_bus kvm snd_pcm_dmaengine irqbypass drm_buddy crct10dif_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel ttm snd_hda_intel sha512_ssse3 mei_hdcp drm_display_helper mei_pxp snd_intel_dspcfg snd_intel_sdw_acpi cec aesni_intel snd_hda_codec dell_wmi rc_core crypto_simd ledtrig_audio snd_hda_core
Oct 04 10:49:45 pve1 kernel:  cryptd snd_hwdep drm_kms_helper rapl snd_pcm i2c_algo_bit dell_smbios syscopyarea dcdbas joydev intel_cstate dell_wmi_sysman sparse_keymap dell_wmi_descriptor firmware_attributes_class cmdlinepart pcspkr snd_timer sysfillrect snd spi_nor mei_me input_leds wmi_bmof soundcore mtd ee1004 mei intel_pch_thermal sysimgblt acpi_pad mac_hid zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) vhost_net vhost vhost_iotlb tap drm efi_pstore dmi_sysfs ip_tables x_tables autofs4 ses enclosure scsi_transport_sas hid_logitech_hidpp hid_logitech_dj hid_generic usbkbd usbmouse btrfs usbhid hid blake2b_generic xor raid6_pq simplefb uas usb_storage dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio libcrc32c xhci_pci xhci_pci_renesas nvme spi_intel_pci crc32_pclmul spi_intel nvme_core r8169 ahci i2c_i801 xhci_hcd i2c_smbus nvme_common realtek libahci video wmi
Oct 04 10:49:45 pve1 kernel: CPU: 3 PID: 0 Comm: swapper/3 Tainted: P           O       6.2.16-14-pve #1
Oct 04 10:49:45 pve1 kernel: Hardware name: Dell Inc. OptiPlex 3070/07WP95, BIOS 1.3.1 02/06/2020
Oct 04 10:49:45 pve1 kernel: RIP: 0010:dev_watchdog+0x23a/0x250
Oct 04 10:49:45 pve1 kernel: Code: 00 e9 2b ff ff ff 48 89 df c6 05 ac 5d 7d 01 01 e8 bb 08 f8 ff 44 89 f1 48 89 de 48 c7 c7 90 87 c0 99 48 89 c2 e8 56 91 30 ff <0f> 0b e9 1c ff ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00
Oct 04 10:49:45 pve1 kernel: RSP: 0018:ffffa23f401c4e38 EFLAGS: 00010246
Oct 04 10:49:45 pve1 kernel: RAX: 0000000000000000 RBX: ffff9188c1e74000 RCX: 0000000000000000
Oct 04 10:49:45 pve1 kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
Oct 04 10:49:45 pve1 kernel: RBP: ffffa23f401c4e68 R08: 0000000000000000 R09: 0000000000000000
Oct 04 10:49:45 pve1 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffff9188c1e744c8
Oct 04 10:49:45 pve1 kernel: R13: ffff9188c1e7441c R14: 0000000000000000 R15: 0000000000000000
Oct 04 10:49:45 pve1 kernel: FS:  0000000000000000(0000) GS:ffff918bee4c0000(0000) knlGS:0000000000000000
Oct 04 10:49:45 pve1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct 04 10:49:45 pve1 kernel: CR2: 00007f98fbf6d420 CR3: 00000002b1c10003 CR4: 00000000003726e0
Oct 04 10:49:45 pve1 kernel: Call Trace:
Oct 04 10:49:45 pve1 kernel:  <IRQ>
Oct 04 10:49:45 pve1 kernel:  ? show_regs+0x6d/0x80
Oct 04 10:49:45 pve1 kernel:  ? __warn+0x89/0x160
Oct 04 10:49:45 pve1 kernel:  ? dev_watchdog+0x23a/0x250
Oct 04 10:49:45 pve1 kernel:  ? report_bug+0x17e/0x1b0
Oct 04 10:49:45 pve1 kernel:  ? irq_work_queue+0x2f/0x70
Oct 04 10:49:45 pve1 kernel:  ? handle_bug+0x46/0x90
Oct 04 10:49:45 pve1 kernel:  ? exc_invalid_op+0x18/0x80
Oct 04 10:49:45 pve1 kernel:  ? asm_exc_invalid_op+0x1b/0x20
Oct 04 10:49:45 pve1 kernel:  ? dev_watchdog+0x23a/0x250
Oct 04 10:49:45 pve1 kernel:  ? dev_watchdog+0x23a/0x250
Oct 04 10:49:45 pve1 kernel:  ? __pfx_dev_watchdog+0x10/0x10
Oct 04 10:49:45 pve1 kernel:  call_timer_fn+0x29/0x160
Oct 04 10:49:45 pve1 kernel:  ? __pfx_dev_watchdog+0x10/0x10
Oct 04 10:49:45 pve1 kernel:  __run_timers+0x259/0x310
Oct 04 10:49:45 pve1 kernel:  run_timer_softirq+0x1d/0x40
Oct 04 10:49:45 pve1 kernel:  __do_softirq+0xd6/0x346
Oct 04 10:49:45 pve1 kernel:  ? hrtimer_interrupt+0x11f/0x250
Oct 04 10:49:45 pve1 kernel:  __irq_exit_rcu+0xa2/0xd0
Oct 04 10:49:45 pve1 kernel:  irq_exit_rcu+0xe/0x20
Oct 04 10:49:45 pve1 kernel:  sysvec_apic_timer_interrupt+0x92/0xd0
Oct 04 10:49:45 pve1 kernel:  </IRQ>
Oct 04 10:49:45 pve1 kernel:  <TASK>
Oct 04 10:49:45 pve1 kernel:  asm_sysvec_apic_timer_interrupt+0x1b/0x20
Oct 04 10:49:45 pve1 kernel: RIP: 0010:cpuidle_enter_state+0xde/0x6f0
Oct 04 10:49:45 pve1 kernel: Code: 12 17 67 e8 f4 64 4a ff 8b 53 04 49 89 c7 0f 1f 44 00 00 31 ff e8 22 6d 49 ff 80 7d d0 00 0f 85 eb 00 00 00 fb 0f 1f 44 00 00 <45> 85 f6 0f 88 12 02 00 00 4d 63 ee 49 83 fd 09 0f 87 c7 04 00 00
Oct 04 10:49:45 pve1 kernel: RSP: 0018:ffffa23f40113e38 EFLAGS: 00000246
Oct 04 10:49:45 pve1 kernel: RAX: 0000000000000000 RBX: ffffc23f3fcc0010 RCX: 0000000000000000
Oct 04 10:49:45 pve1 kernel: RDX: 0000000000000003 RSI: 0000000000000000 RDI: 0000000000000000
Oct 04 10:49:45 pve1 kernel: RBP: ffffa23f40113e88 R08: 0000000000000000 R09: 0000000000000000
Oct 04 10:49:45 pve1 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffffffff9a6c3a40
Oct 04 10:49:45 pve1 kernel: R13: 0000000000000008 R14: 0000000000000008 R15: 00000ca76fd2b5aa
Oct 04 10:49:45 pve1 kernel:  ? cpuidle_enter_state+0xce/0x6f0
Oct 04 10:49:45 pve1 kernel:  cpuidle_enter+0x2e/0x50
Oct 04 10:49:45 pve1 kernel:  do_idle+0x216/0x2a0
Oct 04 10:49:45 pve1 kernel:  cpu_startup_entry+0x1d/0x20
Oct 04 10:49:45 pve1 kernel:  start_secondary+0x122/0x160
Oct 04 10:49:45 pve1 kernel:  secondary_startup_64_no_verify+0xe5/0xeb
Oct 04 10:49:45 pve1 kernel:  </TASK>
Oct 04 10:49:45 pve1 kernel: ---[ end trace 0000000000000000 ]---
Oct 04 10:49:45 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_chipcmd_cond == 1 (loop: 100, delay: 100).
Oct 04 10:49:45 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:49:45 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:49:45 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:49:45 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:49:45 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:49:45 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:49:45 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 10:49:46 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 10:49:46 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 10:52:08 pve1 kernel: net_ratelimit: 9 callbacks suppressed
Oct 04 10:52:08 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_chipcmd_cond == 1 (loop: 100, delay: 100).
Oct 04 10:52:08 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:52:08 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:52:08 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:52:08 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:52:08 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:52:08 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:52:08 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 10:52:08 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 10:52:08 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 10:55:10 pve1 kernel: net_ratelimit: 9 callbacks suppressed
Oct 04 10:55:10 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_chipcmd_cond == 1 (loop: 100, delay: 100).
Oct 04 10:55:10 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:55:10 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:55:10 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:55:10 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:55:10 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:55:10 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:55:10 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 10:55:10 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 10:55:10 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 10:58:09 pve1 smartd[824]: Device: /dev/sdb [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 114 to 116
Oct 04 10:58:09 pve1 smartd[824]: Device: /dev/sdc [SAT], 1 Currently unreadable (pending) sectors
Oct 04 10:58:11 pve1 kernel: net_ratelimit: 9 callbacks suppressed
Oct 04 10:58:11 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_chipcmd_cond == 1 (loop: 100, delay: 100).
Oct 04 10:58:11 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:58:11 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:58:11 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:58:11 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:58:11 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:58:11 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 10:58:11 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 10:58:11 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 10:58:11 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 11:00:53 pve1 kernel: net_ratelimit: 9 callbacks suppressed
Oct 04 11:00:53 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_chipcmd_cond == 1 (loop: 100, delay: 100).
Oct 04 11:00:53 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 11:00:53 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 11:00:53 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 11:00:53 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 11:00:53 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 11:00:53 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 04 11:00:53 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 11:00:53 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 11:00:53 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 04 11:03:45 pve1 kernel: net_ratelimit: 9 callbacks suppressed

Seems to still be generating logs even when its unresponsive. 10.49 was the last time my plex server went offline and host and VM unresponsive
 
So I believe I followed the forum post you linked and the reliabiliy seemed to be the best yet however it has once again becoming unresponsive.


Code:
Oct 06 12:13:33 pve1 kernel: ------------[ cut here ]------------
Oct 06 12:13:33 pve1 kernel: NETDEV WATCHDOG: enp1s0 (r8169): transmit queue 0 timed out
Oct 06 12:13:33 pve1 kernel: WARNING: CPU: 3 PID: 0 at net/sched/sch_generic.c:525 dev_watchdog+0x23a/0x250
Oct 06 12:13:33 pve1 kernel: Modules linked in: veth ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter nf_tables bonding tls sunrpc nfnetlink_log nfnetlink binfmt_misc snd_hda_codec_hdmi snd_ctl_led snd_hda_codec_realtek snd_hda_codec_generic joydev input_leds snd_sof_pci_intel_cnl snd_sof_intel_hda_common soundwire_intel intel_rapl_msr soundwire_generic_allocation intel_rapl_common soundwire_cadence snd_sof_intel_hda intel_tcc_cooling x86_pkg_temp_thermal snd_sof_pci intel_powerclamp snd_sof_xtensa_dsp coretemp snd_sof snd_sof_utils snd_soc_hdac_hda kvm_intel snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi i915 soundwire_bus kvm snd_soc_core snd_compress ac97_bus irqbypass snd_pcm_dmaengine crct10dif_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel sha512_ssse3 drm_buddy snd_hda_intel ttm snd_intel_dspcfg snd_intel_sdw_acpi drm_display_helper snd_hda_codec aesni_intel cec mei_hdcp mei_pxp crypto_simd snd_hda_core rc_core cryptd
Oct 06 12:13:33 pve1 kernel:  snd_hwdep dell_wmi rapl ledtrig_audio drm_kms_helper snd_pcm dell_smbios intel_cstate dell_wmi_sysman dcdbas sparse_keymap firmware_attributes_class pcspkr cmdlinepart i2c_algo_bit snd_timer spi_nor mei_me syscopyarea snd sysfillrect soundcore dell_wmi_descriptor mtd wmi_bmof ee1004 mei sysimgblt intel_pch_thermal acpi_pad mac_hid zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) vhost_net vhost vhost_iotlb tap drm efi_pstore dmi_sysfs ip_tables x_tables autofs4 hid_logitech_hidpp ses enclosure scsi_transport_sas btrfs blake2b_generic xor hid_logitech_dj raid6_pq hid_generic usbkbd usbmouse usbhid hid simplefb uas usb_storage dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio libcrc32c r8169 nvme xhci_pci i2c_i801 xhci_pci_renesas spi_intel_pci realtek spi_intel i2c_smbus crc32_pclmul video nvme_core ahci libahci xhci_hcd nvme_common wmi
Oct 06 12:13:33 pve1 kernel: CPU: 3 PID: 0 Comm: swapper/3 Tainted: P           O       6.2.16-15-pve #1
Oct 06 12:13:33 pve1 kernel: Hardware name: Dell Inc. OptiPlex 3070/07WP95, BIOS 1.3.1 02/06/2020
Oct 06 12:13:33 pve1 kernel: RIP: 0010:dev_watchdog+0x23a/0x250
Oct 06 12:13:33 pve1 kernel: Code: 00 e9 2b ff ff ff 48 89 df c6 05 ac 5d 7d 01 01 e8 bb 08 f8 ff 44 89 f1 48 89 de 48 c7 c7 90 87 40 af 48 89 c2 e8 56 91 30 ff <0f> 0b e9 1c ff ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00
Oct 06 12:13:33 pve1 kernel: RSP: 0018:ffffaa5b801c4e38 EFLAGS: 00010246
Oct 06 12:13:33 pve1 kernel: RAX: 0000000000000000 RBX: ffff8a0a8662c000 RCX: 0000000000000000
Oct 06 12:13:33 pve1 kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
Oct 06 12:13:33 pve1 kernel: RBP: ffffaa5b801c4e68 R08: 0000000000000000 R09: 0000000000000000
Oct 06 12:13:33 pve1 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffff8a0a8662c4c8
Oct 06 12:13:33 pve1 kernel: R13: ffff8a0a8662c41c R14: 0000000000000000 R15: 0000000000000000
Oct 06 12:13:33 pve1 kernel: FS:  0000000000000000(0000) GS:ffff8a0dae4c0000(0000) knlGS:0000000000000000
Oct 06 12:13:33 pve1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct 06 12:13:33 pve1 kernel: CR2: 00005b0809afc390 CR3: 000000042c810004 CR4: 00000000003726e0
Oct 06 12:13:33 pve1 kernel: Call Trace:
Oct 06 12:13:33 pve1 kernel:  <IRQ>
Oct 06 12:13:33 pve1 kernel:  ? show_regs+0x6d/0x80
Oct 06 12:13:33 pve1 kernel:  ? __warn+0x89/0x160
Oct 06 12:13:33 pve1 kernel:  ? dev_watchdog+0x23a/0x250
Oct 06 12:13:33 pve1 kernel:  ? report_bug+0x17e/0x1b0
Oct 06 12:13:33 pve1 kernel:  ? handle_bug+0x46/0x90
Oct 06 12:13:33 pve1 kernel:  ? exc_invalid_op+0x18/0x80
Oct 06 12:13:33 pve1 kernel:  ? asm_exc_invalid_op+0x1b/0x20
Oct 06 12:13:33 pve1 kernel:  ? dev_watchdog+0x23a/0x250
Oct 06 12:13:33 pve1 kernel:  ? dev_watchdog+0x23a/0x250
Oct 06 12:13:33 pve1 kernel:  ? __pfx_dev_watchdog+0x10/0x10
Oct 06 12:13:33 pve1 kernel:  call_timer_fn+0x29/0x160
Oct 06 12:13:33 pve1 kernel:  ? __pfx_dev_watchdog+0x10/0x10
Oct 06 12:13:33 pve1 kernel:  __run_timers+0x259/0x310
Oct 06 12:13:33 pve1 kernel:  run_timer_softirq+0x1d/0x40
Oct 06 12:13:33 pve1 kernel:  __do_softirq+0xd6/0x346
Oct 06 12:13:33 pve1 kernel:  ? hrtimer_interrupt+0x11f/0x250
Oct 06 12:13:33 pve1 kernel:  __irq_exit_rcu+0xa2/0xd0
Oct 06 12:13:33 pve1 kernel:  irq_exit_rcu+0xe/0x20
Oct 06 12:13:33 pve1 kernel:  sysvec_apic_timer_interrupt+0x92/0xd0
Oct 06 12:13:33 pve1 kernel:  </IRQ>
Oct 06 12:13:33 pve1 kernel:  <TASK>
Oct 06 12:13:33 pve1 kernel:  asm_sysvec_apic_timer_interrupt+0x1b/0x20
Oct 06 12:13:33 pve1 kernel: RIP: 0010:cpuidle_enter_state+0xde/0x6f0
Oct 06 12:13:33 pve1 kernel: Code: 12 97 51 e8 f4 64 4a ff 8b 53 04 49 89 c7 0f 1f 44 00 00 31 ff e8 22 6d 49 ff 80 7d d0 00 0f 85 eb 00 00 00 fb 0f 1f 44 00 00 <45> 85 f6 0f 88 12 02 00 00 4d 63 ee 49 83 fd 09 0f 87 c7 04 00 00
Oct 06 12:13:33 pve1 kernel: RSP: 0018:ffffaa5b80113e38 EFLAGS: 00000246
Oct 06 12:13:33 pve1 kernel: RAX: 0000000000000000 RBX: ffffca5b7fcc0008 RCX: 0000000000000000
Oct 06 12:13:33 pve1 kernel: RDX: 0000000000000003 RSI: 0000000000000000 RDI: 0000000000000000
Oct 06 12:13:33 pve1 kernel: RBP: ffffaa5b80113e88 R08: 0000000000000000 R09: 0000000000000000
Oct 06 12:13:33 pve1 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffffffffafec3a40
Oct 06 12:13:33 pve1 kernel: R13: 0000000000000006 R14: 0000000000000006 R15: 000078509da8a0ec
Oct 06 12:13:33 pve1 kernel:  ? cpuidle_enter_state+0xce/0x6f0
Oct 06 12:13:33 pve1 kernel:  cpuidle_enter+0x2e/0x50
Oct 06 12:13:33 pve1 kernel:  do_idle+0x216/0x2a0
Oct 06 12:13:33 pve1 kernel:  cpu_startup_entry+0x1d/0x20
Oct 06 12:13:33 pve1 kernel:  start_secondary+0x122/0x160
Oct 06 12:13:33 pve1 kernel:  secondary_startup_64_no_verify+0xe5/0xeb
Oct 06 12:13:33 pve1 kernel:  </TASK>
Oct 06 12:13:33 pve1 kernel: ---[ end trace 0000000000000000 ]---
Oct 06 12:13:33 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_chipcmd_cond == 1 (loop: 100, delay: 100).
Oct 06 12:13:33 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 06 12:13:33 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 06 12:13:33 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 06 12:13:33 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 06 12:13:33 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 06 12:13:33 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 06 12:13:33 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 06 12:13:33 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 06 12:13:33 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 06 12:13:43 pve1 kernel: net_ratelimit: 9 callbacks suppressed
Oct 06 12:13:43 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_chipcmd_cond == 1 (loop: 100, delay: 100).
Oct 06 12:13:43 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 06 12:13:43 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 06 12:13:43 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 06 12:13:43 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 06 12:13:43 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 06 12:13:43 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_ephyar_cond == 1 (loop: 100, delay: 10).
Oct 06 12:13:43 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 06 12:13:43 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 06 12:13:43 pve1 kernel: r8169 0000:01:00.0 enp1s0: rtl_eriar_cond == 1 (loop: 100, delay: 100).
Oct 06 12:13:53 pve1 kernel: net_ratelimit: 9 callbacks suppressed
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!