Hello Everyone.
I am trying to get my system working as it should for the last couple of days without success.
System is HP Elite Desk G9 I5 13500 16Gb Ram 1Tb NVME, located in Basement near POE Switch, Router without keyboard/monitor.
Latest Proxmox v9.0 iso is installed on the System and updated upgraded.
I currently have Home Assistant OS VM
Pihole LXC
Frigate LXC
Few other LXC services such as Audiobookshelf etc.
Separate QNAP Nas with 16TB hdds are running along side.
My problem is, system is unfortunately crashes and freezes really often, and randomly, sometimes after 18 hours of uptime, sometimes 2 hours of uptime.
I have managed to get this, as soon as it goes unresponsive i get hundres of this repeating one after another until I hard reset the system. It works like that for few hours then error repeates itself, it becomes inaccessable.
I have disabled all possible Cstate- powersaving options, secureboot, TPM etc in Bios.
I really do not know what to do, I found some forum post from 2019 that said kernel update resolves the issue but i am already running more updated kernel then in the forum post from 2019.
Could you guide me? I really wish to continue using the Proxmox but if it goes offline every couple of hours i need to look for another solution as i can not afford to lose Frigate-Home assistant usage in the home for extended periods of time. Thank you very much
I am trying to get my system working as it should for the last couple of days without success.
System is HP Elite Desk G9 I5 13500 16Gb Ram 1Tb NVME, located in Basement near POE Switch, Router without keyboard/monitor.
Latest Proxmox v9.0 iso is installed on the System and updated upgraded.
I currently have Home Assistant OS VM
Pihole LXC
Frigate LXC
Few other LXC services such as Audiobookshelf etc.
Separate QNAP Nas with 16TB hdds are running along side.
My problem is, system is unfortunately crashes and freezes really often, and randomly, sometimes after 18 hours of uptime, sometimes 2 hours of uptime.
I have managed to get this, as soon as it goes unresponsive i get hundres of this repeating one after another until I hard reset the system. It works like that for few hours then error repeates itself, it becomes inaccessable.
I have disabled all possible Cstate- powersaving options, secureboot, TPM etc in Bios.
I really do not know what to do, I found some forum post from 2019 that said kernel update resolves the issue but i am already running more updated kernel then in the forum post from 2019.
Could you guide me? I really wish to continue using the Proxmox but if it goes offline every couple of hours i need to look for another solution as i can not afford to lose Frigate-Home assistant usage in the home for extended periods of time. Thank you very much
Code:
Sep 23 09:14:18 pve kernel: e1000e 0000:00:1f.6 eno1: Detected Hardware Unit Hang:
TDH <64>
TDT <82>
next_to_use <82>
next_to_clean <63>
buffer_info[next_to_clean]:
time_stamp <10413a331>
next_to_watch <64>
jiffies <1042dbe40>
next_to_watch.status <0>
MAC Status <80083>
PHY Status <796d>
PHY 1000BASE-T Status <3800>
PHY Extended Status <3000>
PCI Status <10>
Sep 23 09:14:20 pve systemd-logind[893]: Power key pressed short.
Sep 23 09:14:20 pve systemd-logind[893]: Powering off...
Sep 23 09:14:20 pve systemd-logind[893]: System is powering down.
-- Reboot --