[BUG] CRC errors with Ceph

PaulX

New Member
Sep 11, 2024
1
0
1
Hey Proxmox Forum,

I am happy with proxmox for many years, but have a bug that we could not solve here. We have a "buggy" VM that we can start and run into massive CRC Ceph errors till the VM crashes. That makes the full host laggy too. We have the problem only with one VM and could delete but would like to understand where it is from.

We tested to restore the VM from a backup and had the same error. Same with the following CPU's from different motherboard companies and on different clusters.
48 x AMD EPYC 7402P 24-Core Processor
128 x AMD EPYC 9534 64-Core Processor
16 x Intel(R) Xeon(R) Gold 6434

Here on one of them:

Nload Proxmox Host: https://cdnx.me/I/UCX8eUnHK.png
VM freezing: https://cdnx.me/I/ATrZ8KM1Q.png
Proxmox Host: https://cdnx.me/I/tQP2vaotP.png
Ceph Load: https://cdnx.me/I/VEo4ipDBN.png
Ceph Log: https://cdnx.me/I/GQoicuqGO.png

Ceph Storage Host(All on the same):
ceph -v
ceph version 18.2.2 (531c0d11a1c5d39fbfe6aa8a521f023abf3bf3e2) reef (stable)

Proxmox(All on the same): ceph -v
#ceph version 18.2.2 (e9fe820e7fffd1b7cde143a9f77653b73fcec748) reef (stable)


Code:
root@XXXXXXXXPRX1:~# pveversion -v
proxmox-ve: 8.2.0 (running kernel: 6.8.12-1-pve)
pve-manager: 8.2.4 (running version: 8.2.4/faa83925c9641325)
proxmox-kernel-helper: 8.1.0
proxmox-kernel-6.8: 6.8.12-1
proxmox-kernel-6.8.12-1-pve-signed: 6.8.12-1
amd64-microcode: 3.20240820.1~deb12u1
ceph-fuse: 18.2.2-pve1
corosync: 3.1.7-pve3
criu: 3.17.1-2
glusterfs-client: 10.3-5
ifupdown: residual config
ifupdown2: 3.2.0-1+pmx9
libjs-extjs: 7.0.0-4
libknet1: 1.28-pve1
libproxmox-acme-perl: 1.5.1
libproxmox-backup-qemu0: 1.4.1
libproxmox-rs-perl: 0.3.3
libpve-access-control: 8.1.4
libpve-apiclient-perl: 3.3.2
libpve-cluster-api-perl: 8.0.7
libpve-cluster-perl: 8.0.7
libpve-common-perl: 8.2.2
libpve-guest-common-perl: 5.1.4
libpve-http-server-perl: 5.1.0
libpve-network-perl: 0.9.8
libpve-rs-perl: 0.8.9
libpve-storage-perl: 8.2.3
libspice-server1: 0.15.1-1
lvm2: 2.03.16-2
lxc-pve: 6.0.0-1
lxcfs: 6.0.0-pve2
novnc-pve: 1.4.0-3
proxmox-backup-client: 3.2.7-1
proxmox-backup-file-restore: 3.2.7-1
proxmox-firewall: 0.5.0
proxmox-kernel-helper: 8.1.0
proxmox-mail-forward: 0.2.3
proxmox-mini-journalreader: 1.4.0
proxmox-offline-mirror-helper: 0.6.6
proxmox-widget-toolkit: 4.2.3
pve-cluster: 8.0.7
pve-container: 5.1.12
pve-docs: 8.2.3
pve-edk2-firmware: not correctly installed
pve-esxi-import-tools: 0.7.1
pve-firewall: 5.0.7
pve-firmware: 3.13-1
pve-ha-manager: 4.0.5
pve-i18n: 3.2.2
pve-qemu-kvm: 9.0.2-2
pve-xtermjs: 5.3.0-3
qemu-server: 8.2.4
smartmontools: 7.3-pve1
spiceterm: 3.3.0
swtpm: 0.8.0+pve1
vncterm: 1.8.0
zfsutils-linux: 2.2.4-pve1

If you have any Idea it would be nice to get your feedback. This here is may not 100% the right location for it - may you have a Idea where to ask / report.

Thanks,

Paul
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!