[SOLVED] [SOLVED] Random crashes with Elitedesk 800 G4 after upgrade to 8.2

user_001

New Member
Sep 23, 2024
2
1
3
HI all,

I was running into random crashes after updating proxmox to 8.2 on my 3 nodes cluster (running Elitedesk 800 G4).
I was planning to ask a question on the forum, but after struggling for more than 2 days on it I have managed to solve it (hopefully).
So now, I just share my solution (with debugging steps) in the hope it will help some of you.

1. Problem description

I run a 3 nodes cluster with each node being identical (HP Elitedesk 800 G4, 16GB memory).
After I have upgraded to 8.2 (for 7.15), I ran into weird "crashes": Some nodes were randomly restarting.
So I look into the forum and all, but could not find a remedy.

2. Problem identification

I was thinking it might be linked to a power issue, but since it was happening on the 3 nodes randomly, I could not have 3 different issues with transformers.
To clear the power management issue, I decided to reduce the load on 1 server: transfered the instances that was running on it on the 2 different nodes and look at what happened.

Result: this node was running not longer than 20 minutes until reboot. Nothing in the logs, nothing in the kernel logs, nothing in the journctl...
Looked everywhere. But it was clearly linked to power management.


3. Problem solution

In my particular case, I had to change the max cstate to 7 in nano /etc/default/grub

GRUB_CMDLINE_LINUX_DEFAULT="quiet intel_iommu=on intel_idle.max_cstate=7 i915.enable_dc=0 ahci.mobile_lpm_policy=1"

Then do a update-grub, followed by proxmox-boot-tool refresh and then reboot.

In my case, the 20 minutes reboot server is now online for more than 24H.

I hope it helps
 

Attachments

  • chrome_iHtKm8GLDp.png
    chrome_iHtKm8GLDp.png
    82.5 KB · Views: 7
  • chrome_76SHjIM549.png
    chrome_76SHjIM549.png
    40.6 KB · Views: 7
  • Like
Reactions: efrej
Hey man, just wanted to say that I really appreciate this post. I had exactly the same issue on a HP Elitedesk 800 G4 and spent hours trying to resolve the issue to no success. Then I did exactly what you described above and the problem was gone! Thanks a lot for making this guide on how to resolve the issue :cool:
 
Hey man, just wanted to say that I really appreciate this post. I had exactly the same issue on a HP Elitedesk 800 G4 and spent hours trying to resolve the issue to no success. Then I did exactly what you described above and the problem was gone! Thanks a lot for making this guide on how to resolve the issue :cool:
No pb.
After I spent so many hours pulling my hairs out, I figured that maybe somebody else could use this little bit of feedback.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!