I upgraded and patched my system coincidently this afternoon and got proxmox 8.1 right away and immediately ran into the error with the stalling CPUs which has been there before
https://forum.proxmox.com/threads/rcu-info-rcu_sched-self-detected-stall-on-cpu.109112/
https://forum.proxmox.com/threads/rcu_sched-self-detected-stall-on-cpu.68399/
https://forum.proxmox.com/threads/rcu_sched-self-detected-stall-on-cpu.111439/
and more.
I am running 4 server with
- Intel(R) Xeon(R) Silver 4310
- 8 x 4TB Samsung SSD each (as ceph OSDs)
- 768 GB RAM
- 2 x 10Gbit Lacp trunk for storage network
Utilization is
CPU < 50%
Memory <50%
Storage throughput at ~ 250 - 500 MiB/s (it can do 2GiB/s)
running on the new kernel 6.5.11-4 immediately crashed all VM that had a little bit of load. i tried all aio settings with iothreads, that did not help at all.
At last i downgraded to kernel 6.2.16-19
and so far it works again, i migrated several VM form hosts with the NEW kernel to the host with the OLD kernel, so far no problem.
There seems to be a problem with the new 6.5.11-4 kernel.
For people with the same problem, i pinned the older kernel like so
https://forum.proxmox.com/threads/rcu-info-rcu_sched-self-detected-stall-on-cpu.109112/
https://forum.proxmox.com/threads/rcu_sched-self-detected-stall-on-cpu.68399/
https://forum.proxmox.com/threads/rcu_sched-self-detected-stall-on-cpu.111439/
and more.
I am running 4 server with
- Intel(R) Xeon(R) Silver 4310
- 8 x 4TB Samsung SSD each (as ceph OSDs)
- 768 GB RAM
- 2 x 10Gbit Lacp trunk for storage network
Utilization is
CPU < 50%
Memory <50%
Storage throughput at ~ 250 - 500 MiB/s (it can do 2GiB/s)
running on the new kernel 6.5.11-4 immediately crashed all VM that had a little bit of load. i tried all aio settings with iothreads, that did not help at all.
At last i downgraded to kernel 6.2.16-19
and so far it works again, i migrated several VM form hosts with the NEW kernel to the host with the OLD kernel, so far no problem.
There seems to be a problem with the new 6.5.11-4 kernel.
For people with the same problem, i pinned the older kernel like so
Code:
#> proxmox-boot-tool kernel pin 6.2.16-19-pve
Last edited: