VM freezes irregularly

1661449448177.png
1661449457360.png

After 23h of uptime, my pfsense crashed again.
My ubuntu 22.04 VM crashed at 07:30 and my Windows Server 2022 VM, crashed at 07:04 in the morning.
 
My VM is freezing upon soft reboot on every kernel, 5.19.3 included. I thought it might be the same issue as the freeze during normal work, but maybe it's a separate one.
Can you let us know if your VMs freeze during normal operation (not rebooting) on 5.19.3? Try to keep the VMs running as long as you can without rebooting.

If you have stability over a week, this is a good sign and I might try and repackage the 5.19.3 kernel for testing on Proxmox.
 
If anyone's interested in running the 5.19.3 mainline kernel on Proxmox, I've repackaged the deb files and removed the zst compression. You can download the repackaged kernel here:

Edit: link removed - there's newer Proxmox kernels available in the post below.
 
Last edited:
I confirm, I also came back under proxmox since esxi was also crashing. I installed the latest updates and there, everything has been working for me for 1 day and 20 hours now, gold than before, everything freezes several times on the same day
 
  • Like
Reactions: gyrex
I've been running my host with Proxmox Edge kernel 5.19.4 for almost two days with no crashes, whereas with any other kernel my OPNSense and Home Assistant VMs would crash every 4-5 hours. Here's the link if you want to try for yourselves:
https://github.com/fabianishere/pve-edge-kernel
I had no idea these kernels existed. It would have been nice to have known when we were troubleshooting.
 
  • Like
Reactions: BarTouZ
I confirm, I also came back under proxmox since esxi was also crashing. I installed the latest updates and there, everything has been working for me for 1 day and 20 hours now, gold than before, everything freezes several times on the same day
Are you running the edge kernel? I'll try switching back to Proxmox today and run the edge kernel too.
 
I did the updates from Proxmox, which offered me the installation of PVE 5.15..39-4.

Since the 2 test VMs are running correctly and quite well I must say...
But I would like it to run for at least 1 week to be able to decide...

ut in any case, for the moment, I touch wood, it is promising...

1661661954979.png

1661662119496.png
 
  • Like
Reactions: rRobbie
OK, so I've moved my VMs back to Proxmox and I've installed the edge 5.19.4 kernel

Code:
root@pve:~# uname -a
Linux pve 5.19.4-edge #1 SMP PREEMPT_DYNAMIC PVE Edge 5.19.4-1 (2022-08-25) x86_64 GNU/Linux

@fabian In case there's an issue with this kernel, is there any kind of deep debugging we can do to try and nail down this issue? I've set up netconsole & kernel debugging from within the VM from this page: https://wiki.ubuntu.com/Kernel/Netconsole . Is there anything else we can do to capture this panic and provide as much information for you?
 
I just installed the edge kernel .4 this morning and so far up 6 hours no vm reboots. Will see how far I get. Then do some stress testing and report back. Definitely an improvement so far.
 
I did the updates from Proxmox, which offered me the installation of PVE 5.15..39-4.
I was using the same kernel as yours, but my old CentOS 7 vm failed (crashes every few hours.)

I'm now on Fabian's 5.19 version and I'll report back.
 
I was using the same kernel as yours, but my old CentOS 7 vm failed (crashes every few hours.)

I'm now on Fabian's 5.19 version and I'll report back.
My 2 VMs that crashed/rebooted every 1-2 hours are a Cent7 and Fedora 36 both running newest epel kernel-5. Doesn't appear to be the kernels inside the VMs but the proxmox kernel.. Going strong now 14 hours. We'll see in a day or 2 if it's stable long term.
 
Has anyone had any freezes on the edge 5.19x kernel? My VMs have been up for 18 hours so far without issue.
 
1661746878323.png

For my part, I have 2 VMs out of the 3 that hold up. I have my "IoT" VM which went wrong 2x with an uptime of +- 5-6h.
Here, I relaunched it yesterday and it seems to hold.

1661746957543.png
1661746971800.png

The "System" VM is holding up, it's been running for as long as the host... So even if it's not 100% reliable yet in 5.15.39-4, I remain positive because there is an improvement.

1661746909928.png
1661746935078.png

I will also try the kernel edge and give my feelings.
 
  • Like
Reactions: rRobbie
So far so good. My 3 VMs (cent7 and fedora36) have been up 14 hours now. Prior I was lucky to get 2 hours before reboot/lock up. Will update tomorrow when uptime is > 1 day.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!