Hey everyone, this is my first post! I've had one node running 24/7 (with probably weekly restarts) for about four months or so, and have not had any problems.
Today, I have had two crashes. Last night, I updated all of the VMs with Ubuntu 18.04 updates that rolled out.
I thought maybe it was related to today's Proxmox updates that I installed, but those were at 9:46AM and the crash occurred around 9:22AM or so.
I looked through all Proxmox logs (syslog, kernel, apt, auth, etc) right before the crash, and found nothing. I looked through 7 VMs and their logs, and found nothing. Two are webservers, with nginx and basically have holding pages. One is a MYSQL server. One is a mail server, and one is a pfsense server, and one is an nginx reverse proxy, and one is a pihole server.
The second crash was at 5:55PM or so. Turned it back on around 7:22PM and it's been running since.
There is no sign from the charts on the node Summary page in Proxmox that the CPU was running high, or high use of RAM, or I/O, etc. I am running Proxmox 5.2-5.
As far as hardware, it's pretty simple with a Gigabyte motherboard using an AMD FX8350 processor with 32GB of RAM and two SSDs running normally without RAID.
The symptoms when I have found it crashed are: the power light on the power button is still lit. The entire unit is "off". No fans running, no network cards are blinking. If I press the power button, it won't force off. I have to shutdown the power supply and then switch it back to on, then power on the unit.
It is main-powered through a battery backup unit, and I checked the unit and it has no problems.
I did read about setting up a kernel log, but I thought I would post and see if anyone would like to troubleshoot with me. I'm thinking there is something I might find where I haven't looked, because I don't know it's there.
EDIT: Just crashed again at 9:55pm and I was there when it happened. It just shuts down.
Today, I have had two crashes. Last night, I updated all of the VMs with Ubuntu 18.04 updates that rolled out.
I thought maybe it was related to today's Proxmox updates that I installed, but those were at 9:46AM and the crash occurred around 9:22AM or so.
I looked through all Proxmox logs (syslog, kernel, apt, auth, etc) right before the crash, and found nothing. I looked through 7 VMs and their logs, and found nothing. Two are webservers, with nginx and basically have holding pages. One is a MYSQL server. One is a mail server, and one is a pfsense server, and one is an nginx reverse proxy, and one is a pihole server.
The second crash was at 5:55PM or so. Turned it back on around 7:22PM and it's been running since.
There is no sign from the charts on the node Summary page in Proxmox that the CPU was running high, or high use of RAM, or I/O, etc. I am running Proxmox 5.2-5.
As far as hardware, it's pretty simple with a Gigabyte motherboard using an AMD FX8350 processor with 32GB of RAM and two SSDs running normally without RAID.
The symptoms when I have found it crashed are: the power light on the power button is still lit. The entire unit is "off". No fans running, no network cards are blinking. If I press the power button, it won't force off. I have to shutdown the power supply and then switch it back to on, then power on the unit.
It is main-powered through a battery backup unit, and I checked the unit and it has no problems.
I did read about setting up a kernel log, but I thought I would post and see if anyone would like to troubleshoot with me. I'm thinking there is something I might find where I haven't looked, because I don't know it's there.
EDIT: Just crashed again at 9:55pm and I was there when it happened. It just shuts down.
Last edited: