Proxmox 8 hangs after upgrade from 7

mhadzi

New Member
Dec 23, 2024
6
2
3
Hello all,

I would appreciate any help I can get. I’m new to Proxmox, so step-by-step instructions would be greatly appreciated.

I recently upgraded an older server that worked fine with version 7. It’s a SuperMicro X11SSH-F/-LN4F. However, the server no longer stays active long enough for me to send any commands to fix the issues. It freezes at random times—sometimes during login and other times even before that. The only way to do anything is to power it down and back up again, but even then, I can’t fix anything before it freezes again. So far, I’ve had no luck.

I’ve also noticed that whenever my server is connected to my home network, the entire network crashes, and I can’t access any other computers on the network.

Here’s what I’ve tried so far:

• Running the recommended network repair commands from the wiki, but the server freezes before I can complete them.

• Deleting ntpdate (as suggested in a forum), but this didn’t fix the problem.

• Booting the server with an older kernel—same freezing issue.

• Starting in Rescue mode, but it still freezes without showing any error messages.

• Using a GParted Live CD, but I’m unsure what steps to take from there.

This is all over my head, and I’d like to avoid a fresh install of VE6 or VE7, as those versions worked fine. Any guidance to rescue the environment would be greatly appreciated!
 
Would try to exchange the os disk to any other or a usb disk and install pve 8 onto. Evaluate if you have any hw error then or it's related to your old os disk system.
 
Thank you for your response and recommendation.

I installed Proxmox 8 onto a USB drive and tested it on another computer to confirm that the USB works properly, and it does. However, when I tried booting the USB on the server, the results were the same.

I attempted both the regular Proxmox boot option and the rescue mode, but the server still hangs or freezes at random times. Sometimes it freezes before the login page appears, and other times it freezes after loading and logging in.

Can you think of anything else I can try to resolve this issue?
 
So there's somethink with your hardware and it's not related to your os disk nor the pve os itself.
Run memtest for few hours (it's installed with pve in /boot by default lso).
 
Hi again,
I ran a memory test on the server for about 11 hours, completing 4 full passes without any errors.
Do you have any other recommendations on what I can try next?
 
Its possible that the NW NIC/cable on the older server is faulty - hence you are seeing a total crash with even your general NW going down.

I'd try the following:

1. Try swapping out the cables.

2. Swapping out that NW NIC for a different one. Maybe you can even attach a USB to ETH one & disable the current one.

3. Try a regular Debian install & see how that server behaves.
 
  • Like
Reactions: waltar
I wanted to provide an update. I attempted another boot from the hard drive using the older kernel 5.15.158-2-pve. This time, I received a message stating that a PVE group was found and would be used. I believe that booting from the USB where I installed Proxmox may have made some changes that helped resolve part of the issue.

The server has now been running for over 12 hours without freezing. After doing some research, I found instructions to pin the working kernel, which I followed. However, when I ran the refresh command, I received an error stating that nothing would be changed or modified.

I have not rebooted the server yet. Before I do, I’m wondering if there’s anything else I should update, install, or change to prevent any further issues.

Any recommendations would be greatly appreciated. Thank you again for your help!
 
I wanted to give another update. I had connected the server to an old smart plug, which I had forgotten was set to auto-reboot on Saturday mornings. This morning, the server rebooted automatically, but it froze again and won’t boot up, just like before.

I’ve tried everything I can think of:

• Booting with both old and new kernels.

• Using the rescue mode command.

• Updating the BIOS and server firmware.

• Using a USB-to-network adapter.

Unfortunately, none of these attempts have worked, and the server continues to freeze.

Any additional suggestions or insights would be greatly appreciated.
 
Could be the cpu itself. That's a change one by one try and error repair. Good luck.
 
I just wanted to provide an update in case someone else encounters this issue. I was able to get in touch with Supermicro support, and they were able to help me. Below are the instructions they provided, and I can confirm that this worked, as I was once again able to upgrade to Proxmox 8.

“You can try disabling internal graphics in BIOS:
Advanced -> Chipset Configuration -> System Agent (SA) Configuration -> Graphics Configuration -> Internal Graphics -> Disable.”
 
@mhadzi Hi.
I recently upgraded Proxmox from version 7 to 8 as well and encountered similar behavior.
The server is hosted by a cloud provider, so I don’t have direct access to the BIOS settings.

Thank you for sharing this insight on the topic — I’ll try to have the integrated graphics disabled via technical support and will post an update regarding this issue.

Perhaps there's already a bug report — could someone kindly share a link if available?