I have a couple of PCs running Proxmox - but I am far from a Linux guru.
I recently installed immich in an Ubuntu VM. It has a high workload importing a few hundred thousand photos and I have seen my Proxmox machine “crash”.
By crash, I mean become unresponsive across all VMs and the PVE web interface. It doesn’t respond to ping.
I would have blamed it on overheating, but the PC does have intel vPro (a cheap man’s ipmi) and I can remotely control it. Using vPro, I can access the CLI and log in. This is the surprising bit! I can run commands, but ping from PVE doesn’t reach anywhere. After rebooting, netstat shows that the CPU went to near zero when it “crashed”, but obviously it was still collecting some data.
Rebooting the machine from the CLI makes things work again, usually for a few hours until it happens again.
How can I find out what the problem is?
I recently installed immich in an Ubuntu VM. It has a high workload importing a few hundred thousand photos and I have seen my Proxmox machine “crash”.
By crash, I mean become unresponsive across all VMs and the PVE web interface. It doesn’t respond to ping.
I would have blamed it on overheating, but the PC does have intel vPro (a cheap man’s ipmi) and I can remotely control it. Using vPro, I can access the CLI and log in. This is the surprising bit! I can run commands, but ping from PVE doesn’t reach anywhere. After rebooting, netstat shows that the CPU went to near zero when it “crashed”, but obviously it was still collecting some data.
Rebooting the machine from the CLI makes things work again, usually for a few hours until it happens again.
How can I find out what the problem is?