amd also have microcode update...On my AMD-cpu? seems like a good idea! ;-)
amd also have microcode update...On my AMD-cpu? seems like a good idea! ;-)
vendor_id : AuthenticAMD
cpu family : 23
model : 96
model name : AMD Ryzen 7 4800H with Radeon Graphics
stepping : 1
microcode : 0x8600106
I am not using this crontab at all and CPU governor is set to default. Never changed it.Not 100% sure that it is related, but I had a crontab entry for setting the CPU governor to `performance` after every reboot. I removed the entry yesterday and set the governor to `powersave` instead. So far it has been running without interruptions.
I used a tteck helper script for setting up the crontab. This one: https://github.com/tteck/Proxmox/blob/main/misc/scaling-governor.sh.
Edit: I also have the microcode for Intel installed.
# cat /sys/devices/system/cpu/cpu0/cpufreq/scaling_governor
performance
I used this script: https://github.com/tteck/Proxmox/blob/main/misc/microcode.sh.How did you update microcode?
Tried it myself. Didn't do anyhting:I used this script: https://github.com/tteck/Proxmox/blob/main/misc/microcode.sh.
# journalctl -k | grep -E "microcode" | head -n 1
Oct 04 08:22:21 proxmox kernel: Zenbleed: please update your microcode for the most optimal fix
# journalctl -k | grep -E "microcode" | head -n 1
Oct 04 08:22:21 proxmox kernel: Zenbleed: please update your microcode for the most optimal fix
GRUB_CMDLINE_LINUX_DEFAULT="quiet consoleblank=0 nox2apic"
GRUB_CMDLINE_LINUX="root=ZFS=rpool/ROOT/pve-1 boot=zfs consoleblank=0 nox2apic"
I had severe problems with my N305 host since I upgraded the kernel.... I actually thought the hardware was kaputt and bought new barebone... (not yet arrived). Now I read this and it almost make me cry... I'm having spontaneous reboots/resets, without any logs or anything.Interesting since 6.8.12-X my N305 server also hungs, no output on HDMI or keyboard.
Updating micrcode would solve this freezes? I have them too now. When server was always stable until latest update.
My CPU is actually the same as from this guy:
https://www.reddit.com/r/linux/comments/15xvpfg/updating_your_amd_microcode_in_linux/
But instructions are too complicated for me to follow. Do not even know if I have the latest microcode installed:
Code:vendor_id : AuthenticAMD cpu family : 23 model : 96 model name : AMD Ryzen 7 4800H with Radeon Graphics stepping : 1 microcode : 0x8600106
So what is the official workaround? There are two options from this page https://www.thomas-krenn.com/de/wiki/Known_Issues_Proxmox_VE_8.2.
Downgrade kernel to 6.5 or make changes for 6.8. But for this second option I do not even have the file /etc/kernel/cmdline.
So what do you suggest? I had two freezes in last 3 days. Kernel downgrade?
Since many companies switched to Proxmox the Proxmox staff has a lot to do with migrations and the support for this companies. Unfortunately I have the feeling you just get support from them by buying a license… the golden times are over :/Where is Proxmox Support? With all respect to those who have contributed to this thread, there is no reasonable solution i can detect that I would attempt on a subscribed production cluster. It annoys me that Proxmox is leaving it to community to sort out their bugs and it doesn't give me confidence that any advertised proxmox-kernel update should be attempted on a stable cluster
You are also using the B650D4U? The board seems to be the problem. Many people switched to the H13SAE-MF from supermicro and the problem with sudden reboots (mine) has been solved.Since many companies switched to Proxmox the Proxmox staff has a lot to do with migrations and the support for this companies. Unfortunately I have the feeling you just get support from them by buying a license… the golden times are over :/
You are also using the B650D4U? The board seems to be the problem. Many people switched to the H13SAE-MF from supermicro and the problem with sudden reboots (mine) has been solved.
Really disappointed by asrock and the seller seems to not want to take the board back. Having the third exchange board and don't trust it anymore. The X570D4U (AM4) on the other hand is rock solid!
After my second board started randomly rebooting after several weeks, i had enough.Yep using this board but also with other boards this issues appears. Also with Proxmox 7 I had no issues. So I don’t think it’s the board The board has some issues, but you can fix and then this issue is solved.
I also know people using Supermicro and older Intel systems with the same issues since Proxmox 8.
I’ll keep that in mind, thanks! Just in time I wrote my message I got a server with that board which was unresponsive. Needed to reset the IPMI otherwise a reboot didn’t work haha…After my second board started randomly rebooting after several weeks, i had enough.
Also other people mention that even H5 REV 4.01 has the Post-Code-00-Problem...
I dont know
Interesting. I'm using the same (40G) with the system.Currently using Mellanox CX3 Pro. Anyone else which had a similar issue?