Proxmox Host Hard Freezing - Memtest Clean

Solinus

Member
Dec 30, 2021
13
0
6
40
Having issues with one of the nodes in one cluster, and a node in another cluster. I migrated from Hyper-V to Proxmox in December 2021. I never had issues at all with Hyper-V and didn't change my hardware at all when I rebuild each of the clusters.

My problem is that one of the nodes will freeze. Sometimes it takes a matter of hours other times days. I did manage to get one of the servers to stay online without issues for about 45 days until I updated the nodes and now we're back to freezing. One of the nodes is especially egregious while the other node has only frozen once or twice since migrating. All my other nodes run perfectly and without issue. Both servers are completely different models and manufacturers so there's nothing common there between them. I did run both the built in memtest and one from a USB and had zero errors after running for days and multiple passes. I don't know what else to check. Logs don't appear at first glance to show much, but I'll admit I'm not sure what I should be looking for. Any one have any ideas? I highly doubt it's hardware as this only happened after migrating to Proxmox. Again Hyper-V never had any issues and my servers were on at times for hundred of days.

On the physical machine the console is on screen but completely locked. I can still reach the iKVM/IPMI/iDRAC and reboot the server remotely, thankfully, but I'd like to figure out why this is happening and fix it.

Any ideas?
 
Last edited:
Hi,
are there any errors in the syslog or in journalctl -b -1?