I run Proxmox on a miniPC (intel n5105, 16GB ram,eth connected, 2x2tb disks in a zfs pool) as a home server with nothing fancy - a HomeAssistant VM and couple of lxc containers with Samba and a media server and some web services on them.
At seemingly random times the node would become unresponsive - webUI does not resolve and SSH is not possible, but would respond to ping.
There are no errors in the syslog, in fact there are no records in the logs it seams after such an event happens until I hard restart the machine.
These events are not associated with peak usage of any of the services and are not regular in any way, sometime it would go a couple of weeks before happening, sometimes just a few hours.
There is no resource shortage ever on anything, no load peaks above 50% on CPU and RAM.
Weirdly enough I just found out that the Samba is working and services on that container can be resolved, while other services are inaccessible. I connected a monitor to the server and it spams a PCIe bus error: severity uncorrected (non-fatal) ACSViol (First).
The lack of logs has me stumped. I'm not a particularly confident admin user but, I feel like I've read through all the possible search threads here and am completely out of ideas on what to try. Any thoughts are welcome.
At seemingly random times the node would become unresponsive - webUI does not resolve and SSH is not possible, but would respond to ping.
There are no errors in the syslog, in fact there are no records in the logs it seams after such an event happens until I hard restart the machine.
These events are not associated with peak usage of any of the services and are not regular in any way, sometime it would go a couple of weeks before happening, sometimes just a few hours.
There is no resource shortage ever on anything, no load peaks above 50% on CPU and RAM.
Weirdly enough I just found out that the Samba is working and services on that container can be resolved, while other services are inaccessible. I connected a monitor to the server and it spams a PCIe bus error: severity uncorrected (non-fatal) ACSViol (First).
The lack of logs has me stumped. I'm not a particularly confident admin user but, I feel like I've read through all the possible search threads here and am completely out of ideas on what to try. Any thoughts are welcome.