Random restarts - prx kernel: mpt3sas_cm0: overriding NVDATA EEDPTagMode setting

facto

New Member
Apr 23, 2024
2
0
1
This is a fresh install of proxmox with zero changes.

Code:
journalctl -p err
May 20 16:46:26 prx kernel: mpt3sas_cm0: overriding NVDATA EEDPTagMode setting
May 20 16:46:39 prx pvecm[1479]: got inotify poll request in wrong process - disabling inotify

I am only assuming this is the problem. In my last install I had an unraid VM that would crash after only a few minutes and sometimes crash proxmox with it and this kernel error would appear right before the crash. I had previously passed thru the HD controller mpt3sas and assumed this was the issue. I then removed the controller from the VM but nothing changed. I decided a fresh install would fix everything but no dice.

Without the VM running proxmox would go sometimes 3 hours without it crashing and restarting.

Im new to proxmox and linux. Any help is appreciated.

Code:
inxi -Fxz
System:
  Kernel: 6.8.4-2-pve arch: x86_64 bits: 64 compiler: gcc v: 12.2.0 Console: pty pts/0
    Distro: Debian GNU/Linux 12 (bookworm)
Machine:
  Type: Desktop System: Dell product: Precision Tower 7910 v: N/A serial: <filter>
  Mobo: Dell model: 0NK5PH v: A00 serial: <filter> UEFI: Dell v: A34 date: 10/19/2020
CPU:
  Info: 2x 6-core model: Intel Xeon E5-2620 v3 bits: 64 type: MT MCP SMP arch: Haswell rev: 2
    cache: L1: 2x 384 KiB (768 KiB) L2: 2x 1.5 MiB (3 MiB) L3: 2x 15 MiB (30 MiB)
  Speed (MHz): avg: 3199 high: 3200 min/max: 1200/3200 cores: 1: 3200 2: 3200 3: 3200 4: 3200
    5: 3200 6: 3200 7: 3200 8: 3200 9: 3200 10: 3200 11: 3200 12: 3200 13: 3200 14: 3200 15: 3200
    16: 3193 17: 3200 18: 3200 19: 3200 20: 3200 21: 3200 22: 3200 23: 3200 24: 3200
    bogomips: 114933
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
Graphics:
  Device-1: NVIDIA GF119 [NVS 310] vendor: Hewlett-Packard driver: nouveau v: kernel arch: Fermi
    bus-ID: 03:00.0 temp: 56.0 C
  Display: server: No display server data found. Headless machine? tty: 110x40
    resolution: 3440x1440
  API: OpenGL Message: GL data unavailable in console for root.
Audio:
  Device-1: Intel C610/X99 series HD Audio vendor: Dell driver: snd_hda_intel v: kernel
    bus-ID: 00:1b.0
  Device-2: NVIDIA GF119 HDMI Audio vendor: Hewlett-Packard driver: snd_hda_intel v: kernel
    bus-ID: 03:00.1
  API: ALSA v: k6.8.4-2-pve status: kernel-api
Network:
  Device-1: Intel Ethernet I217-LM vendor: Dell driver: e1000e v: kernel port: 9020
    bus-ID: 00:19.0
  IF: enp0s25 state: down mac: <filter>
  Device-2: Intel I210 Gigabit Network vendor: Dell driver: igb v: kernel port: 6000
    bus-ID: 09:00.0
  IF: enp9s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
  Device-3: Intel 82571EB/82571GB Gigabit Ethernet D0/D1 driver: e1000e v: kernel port: 5020
    bus-ID: 0c:00.0
  IF: enp12s0f0 state: down mac: <filter>
  Device-4: Intel 82571EB/82571GB Gigabit Ethernet D0/D1 driver: e1000e v: kernel port: 5000
    bus-ID: 0c:00.1
  IF: enp12s0f1 state: down mac: <filter>
  IF-ID-1: bonding_masters state: N/A speed: N/A duplex: N/A mac: N/A
  IF-ID-2: vmbr0 state: up speed: 1000 Mbps duplex: unknown mac: <filter>
Drives:
  Local Storage: total: 4.13 TiB lvm-free: 16 GiB used: 2.9 GiB (0.1%)
  ID-1: /dev/nvme0n1 vendor: SK Hynix model: HFM256GDJTNG-8310A size: 238.47 GiB temp: 53.9 C
  ID-2: /dev/sda vendor: Seagate model: ST1000DM003-1SB102 size: 931.51 GiB
  ID-3: /dev/sdb vendor: Western Digital model: WD10EZEX-00BN5A0 size: 931.51 GiB
  ID-4: /dev/sdc vendor: Crucial model: CT250MX200SSD1 size: 232.89 GiB
  ID-5: /dev/sdd vendor: Seagate model: ST2000DM006-2DM164 size: 1.82 TiB
  ID-6: /dev/sde type: USB vendor: SanDisk model: USB 3.2Gen1 size: 28.67 GiB
Partition:
  ID-1: / size: 67.73 GiB used: 2.89 GiB (4.3%) fs: ext4 dev: /dev/dm-1 mapped: pve-root
  ID-2: /boot/efi size: 1022 MiB used: 11.6 MiB (1.1%) fs: vfat dev: /dev/nvme0n1p2
Swap:
  ID-1: swap-1 type: partition size: 8 GiB used: 0 KiB (0.0%) dev: /dev/dm-0 mapped: pve-swap
Sensors:
  System Temperatures: cpu: 49.0 C mobo: N/A sodimm: SODIMM C gpu: nouveau temp: 56.0 C
  Fan Speeds (RPM): cpu: 999 gpu: nouveau fan: 2940
Info:
  Processes: 355 Uptime: 16m Memory: 31.31 GiB used: 1.88 GiB (6.0%) Init: systemd
  target: graphical (5) Compilers: N/A Packages: 764 Shell: Bash v: 5.2.15 inxi: 3.3.26

As you can see from the attached screenshot proxmox ran for almost 4 hours before crashing and restarting.
 

Attachments

  • prox 3plus hours before restart Screenshot 2024-05-20 212116.png
    prox 3plus hours before restart Screenshot 2024-05-20 212116.png
    30.3 KB · Views: 3
Last edited:
This is a fresh install of proxmox with zero changes.

Code:
journalctl -p err
May 20 16:46:26 prx kernel: mpt3sas_cm0: overriding NVDATA EEDPTagMode setting
May 20 16:46:39 prx pvecm[1479]: got inotify poll request in wrong process - disabling inotify

I am only assuming this is the problem. In my last install I had an unraid VM that would crash after only a few minutes and sometimes crash proxmox with it and this kernel error would appear right before the crash. I had previously passed thru the HD controller mpt3sas and assumed this was the issue. I then removed the controller from the VM but nothing changed. I decided a fresh install would fix everything but no dice.

Without the VM running proxmox would go sometimes 3 hours without it crashing and restarting.

Im new to proxmox and linux. Any help is appreciated.

Code:
inxi -Fxz
System:
  Kernel: 6.8.4-2-pve arch: x86_64 bits: 64 compiler: gcc v: 12.2.0 Console: pty pts/0
    Distro: Debian GNU/Linux 12 (bookworm)
Machine:
  Type: Desktop System: Dell product: Precision Tower 7910 v: N/A serial: <filter>
  Mobo: Dell model: 0NK5PH v: A00 serial: <filter> UEFI: Dell v: A34 date: 10/19/2020
CPU:
  Info: 2x 6-core model: Intel Xeon E5-2620 v3 bits: 64 type: MT MCP SMP arch: Haswell rev: 2
    cache: L1: 2x 384 KiB (768 KiB) L2: 2x 1.5 MiB (3 MiB) L3: 2x 15 MiB (30 MiB)
  Speed (MHz): avg: 3199 high: 3200 min/max: 1200/3200 cores: 1: 3200 2: 3200 3: 3200 4: 3200
    5: 3200 6: 3200 7: 3200 8: 3200 9: 3200 10: 3200 11: 3200 12: 3200 13: 3200 14: 3200 15: 3200
    16: 3193 17: 3200 18: 3200 19: 3200 20: 3200 21: 3200 22: 3200 23: 3200 24: 3200
    bogomips: 114933
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
Graphics:
  Device-1: NVIDIA GF119 [NVS 310] vendor: Hewlett-Packard driver: nouveau v: kernel arch: Fermi
    bus-ID: 03:00.0 temp: 56.0 C
  Display: server: No display server data found. Headless machine? tty: 110x40
    resolution: 3440x1440
  API: OpenGL Message: GL data unavailable in console for root.
Audio:
  Device-1: Intel C610/X99 series HD Audio vendor: Dell driver: snd_hda_intel v: kernel
    bus-ID: 00:1b.0
  Device-2: NVIDIA GF119 HDMI Audio vendor: Hewlett-Packard driver: snd_hda_intel v: kernel
    bus-ID: 03:00.1
  API: ALSA v: k6.8.4-2-pve status: kernel-api
Network:
  Device-1: Intel Ethernet I217-LM vendor: Dell driver: e1000e v: kernel port: 9020
    bus-ID: 00:19.0
  IF: enp0s25 state: down mac: <filter>
  Device-2: Intel I210 Gigabit Network vendor: Dell driver: igb v: kernel port: 6000
    bus-ID: 09:00.0
  IF: enp9s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
  Device-3: Intel 82571EB/82571GB Gigabit Ethernet D0/D1 driver: e1000e v: kernel port: 5020
    bus-ID: 0c:00.0
  IF: enp12s0f0 state: down mac: <filter>
  Device-4: Intel 82571EB/82571GB Gigabit Ethernet D0/D1 driver: e1000e v: kernel port: 5000
    bus-ID: 0c:00.1
  IF: enp12s0f1 state: down mac: <filter>
  IF-ID-1: bonding_masters state: N/A speed: N/A duplex: N/A mac: N/A
  IF-ID-2: vmbr0 state: up speed: 1000 Mbps duplex: unknown mac: <filter>
Drives:
  Local Storage: total: 4.13 TiB lvm-free: 16 GiB used: 2.9 GiB (0.1%)
  ID-1: /dev/nvme0n1 vendor: SK Hynix model: HFM256GDJTNG-8310A size: 238.47 GiB temp: 53.9 C
  ID-2: /dev/sda vendor: Seagate model: ST1000DM003-1SB102 size: 931.51 GiB
  ID-3: /dev/sdb vendor: Western Digital model: WD10EZEX-00BN5A0 size: 931.51 GiB
  ID-4: /dev/sdc vendor: Crucial model: CT250MX200SSD1 size: 232.89 GiB
  ID-5: /dev/sdd vendor: Seagate model: ST2000DM006-2DM164 size: 1.82 TiB
  ID-6: /dev/sde type: USB vendor: SanDisk model: USB 3.2Gen1 size: 28.67 GiB
Partition:
  ID-1: / size: 67.73 GiB used: 2.89 GiB (4.3%) fs: ext4 dev: /dev/dm-1 mapped: pve-root
  ID-2: /boot/efi size: 1022 MiB used: 11.6 MiB (1.1%) fs: vfat dev: /dev/nvme0n1p2
Swap:
  ID-1: swap-1 type: partition size: 8 GiB used: 0 KiB (0.0%) dev: /dev/dm-0 mapped: pve-swap
Sensors:
  System Temperatures: cpu: 49.0 C mobo: N/A sodimm: SODIMM C gpu: nouveau temp: 56.0 C
  Fan Speeds (RPM): cpu: 999 gpu: nouveau fan: 2940
Info:
  Processes: 355 Uptime: 16m Memory: 31.31 GiB used: 1.88 GiB (6.0%) Init: systemd
  target: graphical (5) Compilers: N/A Packages: 764 Shell: Bash v: 5.2.15 inxi: 3.3.26

As you can see from the attached screenshot proxmox ran for almost 4 hours before crashing and restarting.
I also see this message prx kernel: mpt3sas_cm0: overriding NVDATA EEDPTagMode setting on boot in journalctl and if I have the truenas VM start, entire proxmox would crash randomly and without the VM being on, proxmox would be stable.

Did you find a solution?