Hello,
we are seeing a number of MCE/RAS events on several Proxmox VE 9.2 hosts after upgrading from the 6.17 kernel series to the new 7.0 kernel series and would like to know whether others are observing something similar.
CPU: Model name: AMD Ryzen 9 5950X 16-Core Processor cpuid=0x00a20f10 microcode=0x0a20102e
Since upgrading to kernel 7.0.x we have seen MCE events on multiple hosts.
Example event:
2026-06-01 09:53:50 +0000 error: Uncorrected, software containable error., CPU 2, bank Unified Memory Controller V2 (bank=7), mcg mcgstatus=0, mci Error_overflow Processor_context_corrupt Poison consumed Task_context_corrupt, mcgcap=0x0000011c, status=0xffff8dad806f7b80, misc=0x10000000000000, walltime=0x6a1d56ae, cpu=0x00000001, cpuid=0x00a20f10, apicid=0x00000002, bank=0x00000007, microcode=0x0a20102e
Log:
Jun 01 09:53:50 proxmox02 kernel: mce: [Hardware Error]: Machine check events logged
Jun 01 09:53:50 proxmox02 kernel: [Hardware Error]: System Fatal error.
Jun 01 09:53:50 proxmox02 kernel: [Hardware Error]: CPU:1 (19:21:0) MC7_STATUS[Over|UE|MiscV|AddrV|PCC|SyndV|-|Poison|Scrub]: 0xffff8dad806f7b80
Jun 01 09:53:50 proxmox02 kernel: [Hardware Error]: Error Addr: 0x0000000000000000
Jun 01 09:53:50 proxmox02 kernel: [Hardware Error]: IPID: 0x0000000000000000, Syndrome: 0x0000000000000000
Jun 01 09:53:50 proxmox02 kernel: [Hardware Error]: Bank 7 is reserved.
Jun 01 09:53:50 proxmox02 kernel: [Hardware Error]: cache level: RESV, tx: INSN
All affected systems (5 so far) have the same CPU model (Ryzen 5950X) using the same microcode version. We are running this machines for quite a while and hadn't this kind of errors with kernel 6.x
Did anyone experience similar issues?
we are seeing a number of MCE/RAS events on several Proxmox VE 9.2 hosts after upgrading from the 6.17 kernel series to the new 7.0 kernel series and would like to know whether others are observing something similar.
- Hetzner dedicated servers
- AMD Ryzen 9 5950X
- ZFS
- Multiple independent hosts affected
- Proxmox VE 9.2.x
- Kernel: 7.0.6-2-pve
CPU: Model name: AMD Ryzen 9 5950X 16-Core Processor cpuid=0x00a20f10 microcode=0x0a20102e
Since upgrading to kernel 7.0.x we have seen MCE events on multiple hosts.
Example event:
2026-06-01 09:53:50 +0000 error: Uncorrected, software containable error., CPU 2, bank Unified Memory Controller V2 (bank=7), mcg mcgstatus=0, mci Error_overflow Processor_context_corrupt Poison consumed Task_context_corrupt, mcgcap=0x0000011c, status=0xffff8dad806f7b80, misc=0x10000000000000, walltime=0x6a1d56ae, cpu=0x00000001, cpuid=0x00a20f10, apicid=0x00000002, bank=0x00000007, microcode=0x0a20102e
Log:
Jun 01 09:53:50 proxmox02 kernel: mce: [Hardware Error]: Machine check events logged
Jun 01 09:53:50 proxmox02 kernel: [Hardware Error]: System Fatal error.
Jun 01 09:53:50 proxmox02 kernel: [Hardware Error]: CPU:1 (19:21:0) MC7_STATUS[Over|UE|MiscV|AddrV|PCC|SyndV|-|Poison|Scrub]: 0xffff8dad806f7b80
Jun 01 09:53:50 proxmox02 kernel: [Hardware Error]: Error Addr: 0x0000000000000000
Jun 01 09:53:50 proxmox02 kernel: [Hardware Error]: IPID: 0x0000000000000000, Syndrome: 0x0000000000000000
Jun 01 09:53:50 proxmox02 kernel: [Hardware Error]: Bank 7 is reserved.
Jun 01 09:53:50 proxmox02 kernel: [Hardware Error]: cache level: RESV, tx: INSN
All affected systems (5 so far) have the same CPU model (Ryzen 5950X) using the same microcode version. We are running this machines for quite a while and hadn't this kind of errors with kernel 6.x
Did anyone experience similar issues?