Hi,
I’m new to the forum and running into a problem with one of my servers. We are actively moving from ESXi to Proxmox, and we have successfully converted Dell 7920 and R740 servers, which have been running perfectly. However, our latest batch of R750 servers is experiencing issues.
I’m running a Proxmox cluster with multiple nodes, and I’ve encountered an issue with one of the servers that has Intel Xeon Silver 4310 CPUs. Specifically, when the system is under load (e.g., hosting Windows VMs), I see “split lock” messages in the logs. This causes noticeable performance degradation and sluggishness, particularly in Windows 11 and Server 2022 VMs.
Here are some details about the affected node:
• Server Model: Dell R750 with Intel Xeon Silver 4310 CPUs (2 sockets, 12 cores per socket, 48 threads total)
• Proxmox Version: 8.3.2
• Kernel: Linux 6.8.12-5-pve
• CPU Flags: Split lock detection enabled (split_lock_detect flag present)
• Virtualization: Intel VT-x enabled, KVM hypervisor used
• Operating System: Windows 11, Windows Server 2022 VMs
• Vulnerabilities: Microcode update mitigations in place
Interestingly, no split lock messages are detected when starting a Linux VM on the same node. This makes it seem like the issue is specifically related to Windows VMs.
Additional Information:
• The configuration of this server is exactly the same as another node in the cluster, which uses Intel Xeon Silver 4210R CPUs (40 threads) and is not showing any split lock warnings. Performance is stable across both nodes.
Troubleshooting I’ve tried:
• Ensured that the server and Proxmox are fully updated, including microcode updates.
• Checked kernel logs for any related errors.
• Attempted to adjust CPU pinning and virtualization options for the VMs.
• Disabled hyper-threading, but the issue persists.
Does anyone have experience with similar issues on Intel Xeon 4310 CPUs under Proxmox? Specifically, is there a way to mitigate or fix split lock errors? Could this be related to the microcode or specific kernel settings? Any suggestions on further diagnostics or configuration changes would be greatly appreciated.
Thanks in advance for your help!
I’m new to the forum and running into a problem with one of my servers. We are actively moving from ESXi to Proxmox, and we have successfully converted Dell 7920 and R740 servers, which have been running perfectly. However, our latest batch of R750 servers is experiencing issues.
I’m running a Proxmox cluster with multiple nodes, and I’ve encountered an issue with one of the servers that has Intel Xeon Silver 4310 CPUs. Specifically, when the system is under load (e.g., hosting Windows VMs), I see “split lock” messages in the logs. This causes noticeable performance degradation and sluggishness, particularly in Windows 11 and Server 2022 VMs.
Here are some details about the affected node:
• Server Model: Dell R750 with Intel Xeon Silver 4310 CPUs (2 sockets, 12 cores per socket, 48 threads total)
• Proxmox Version: 8.3.2
• Kernel: Linux 6.8.12-5-pve
• CPU Flags: Split lock detection enabled (split_lock_detect flag present)
• Virtualization: Intel VT-x enabled, KVM hypervisor used
• Operating System: Windows 11, Windows Server 2022 VMs
• Vulnerabilities: Microcode update mitigations in place
Interestingly, no split lock messages are detected when starting a Linux VM on the same node. This makes it seem like the issue is specifically related to Windows VMs.
Additional Information:
• The configuration of this server is exactly the same as another node in the cluster, which uses Intel Xeon Silver 4210R CPUs (40 threads) and is not showing any split lock warnings. Performance is stable across both nodes.
Troubleshooting I’ve tried:
• Ensured that the server and Proxmox are fully updated, including microcode updates.
• Checked kernel logs for any related errors.
• Attempted to adjust CPU pinning and virtualization options for the VMs.
• Disabled hyper-threading, but the issue persists.
Does anyone have experience with similar issues on Intel Xeon 4310 CPUs under Proxmox? Specifically, is there a way to mitigate or fix split lock errors? Could this be related to the microcode or specific kernel settings? Any suggestions on further diagnostics or configuration changes would be greatly appreciated.
Thanks in advance for your help!
Bash:
Jan 20 12:13:30 server1 kernel: x86/split lock detection: #AC: CPU 1/KVM/1938821 took a split_lock trap at address: 0x7eedd050
Jan 20 12:13:30 server1 kernel: x86/split lock detection: #AC: CPU 6/KVM/1938826 took a split_lock trap at address: 0x7eedd050
Jan 20 12:13:30 server1 kernel: x86/split lock detection: #AC: CPU 5/KVM/1938825 took a split_lock trap at address: 0x7eedd050
Jan 20 12:13:30 server1 kernel: x86/split lock detection: #AC: CPU 4/KVM/1938824 took a split_lock trap at address: 0x7eedd050
Jan 20 12:13:30 server1 kernel: x86/split lock detection: #AC: CPU 7/KVM/1938827 took a split_lock trap at address: 0x7eedd050
Jan 20 12:13:30 server1 kernel: x86/split lock detection: #AC: CPU 3/KVM/1938823 took a split_lock trap at address: 0x7eedd050
Bash:
With Split lock messages:
root@server1:~# lscpu
CPU(s): 48
On-line CPU(s) list: 0-47
Vendor ID: GenuineIntel
BIOS Vendor ID: Intel
Model name: Intel(R) Xeon(R) Silver 4310 CPU @ 2.10GHz
Virtualization features:
Virtualization: VT-x
Caches (sum of all):
L1d: 1.1 MiB (24 instances)
L1i: 768 KiB (24 instances)
L2: 30 MiB (24 instances)
L3: 36 MiB (2 instances)
NUMA:
NUMA node(s): 2
Bash:
Without Split lock messages:
root@server2:~# lscpu
CPU(s): 40
On-line CPU(s) list: 0-39
Vendor ID: GenuineIntel
BIOS Vendor ID: Intel
Model name: Intel(R) Xeon(R) Silver 4210R CPU @ 2.40GHz
Virtualization features:
Virtualization: VT-x
Caches (sum of all):
L1d: 640 KiB (20 instances)
L1i: 640 KiB (20 instances)
L2: 20 MiB (20 instances)
L3: 27.5 MiB (2 instances)
NUMA:
NUMA node(s): 2