Hey Proxmox Forum,
I am happy with proxmox for many years, but have a bug that we could not solve here. We have a "buggy" VM that we can start and run into massive CRC Ceph errors till the VM crashes. That makes the full host laggy too. We have the problem only with one VM and could delete but would like to understand where it is from.
We tested to restore the VM from a backup and had the same error. Same with the following CPU's from different motherboard companies and on different clusters.
48 x AMD EPYC 7402P 24-Core Processor
128 x AMD EPYC 9534 64-Core Processor
16 x Intel(R) Xeon(R) Gold 6434
Here on one of them:
Nload Proxmox Host: https://cdnx.me/I/UCX8eUnHK.png
VM freezing: https://cdnx.me/I/ATrZ8KM1Q.png
Proxmox Host: https://cdnx.me/I/tQP2vaotP.png
Ceph Load: https://cdnx.me/I/VEo4ipDBN.png
Ceph Log: https://cdnx.me/I/GQoicuqGO.png
Ceph Storage Host(All on the same):
ceph -v
ceph version 18.2.2 (531c0d11a1c5d39fbfe6aa8a521f023abf3bf3e2) reef (stable)
Proxmox(All on the same): ceph -v
#ceph version 18.2.2 (e9fe820e7fffd1b7cde143a9f77653b73fcec748) reef (stable)
If you have any Idea it would be nice to get your feedback. This here is may not 100% the right location for it - may you have a Idea where to ask / report.
Thanks,
Paul
I am happy with proxmox for many years, but have a bug that we could not solve here. We have a "buggy" VM that we can start and run into massive CRC Ceph errors till the VM crashes. That makes the full host laggy too. We have the problem only with one VM and could delete but would like to understand where it is from.
We tested to restore the VM from a backup and had the same error. Same with the following CPU's from different motherboard companies and on different clusters.
48 x AMD EPYC 7402P 24-Core Processor
128 x AMD EPYC 9534 64-Core Processor
16 x Intel(R) Xeon(R) Gold 6434
Here on one of them:
Nload Proxmox Host: https://cdnx.me/I/UCX8eUnHK.png
VM freezing: https://cdnx.me/I/ATrZ8KM1Q.png
Proxmox Host: https://cdnx.me/I/tQP2vaotP.png
Ceph Load: https://cdnx.me/I/VEo4ipDBN.png
Ceph Log: https://cdnx.me/I/GQoicuqGO.png
Ceph Storage Host(All on the same):
ceph -v
ceph version 18.2.2 (531c0d11a1c5d39fbfe6aa8a521f023abf3bf3e2) reef (stable)
Proxmox(All on the same): ceph -v
#ceph version 18.2.2 (e9fe820e7fffd1b7cde143a9f77653b73fcec748) reef (stable)
Code:
root@XXXXXXXXPRX1:~# pveversion -v
proxmox-ve: 8.2.0 (running kernel: 6.8.12-1-pve)
pve-manager: 8.2.4 (running version: 8.2.4/faa83925c9641325)
proxmox-kernel-helper: 8.1.0
proxmox-kernel-6.8: 6.8.12-1
proxmox-kernel-6.8.12-1-pve-signed: 6.8.12-1
amd64-microcode: 3.20240820.1~deb12u1
ceph-fuse: 18.2.2-pve1
corosync: 3.1.7-pve3
criu: 3.17.1-2
glusterfs-client: 10.3-5
ifupdown: residual config
ifupdown2: 3.2.0-1+pmx9
libjs-extjs: 7.0.0-4
libknet1: 1.28-pve1
libproxmox-acme-perl: 1.5.1
libproxmox-backup-qemu0: 1.4.1
libproxmox-rs-perl: 0.3.3
libpve-access-control: 8.1.4
libpve-apiclient-perl: 3.3.2
libpve-cluster-api-perl: 8.0.7
libpve-cluster-perl: 8.0.7
libpve-common-perl: 8.2.2
libpve-guest-common-perl: 5.1.4
libpve-http-server-perl: 5.1.0
libpve-network-perl: 0.9.8
libpve-rs-perl: 0.8.9
libpve-storage-perl: 8.2.3
libspice-server1: 0.15.1-1
lvm2: 2.03.16-2
lxc-pve: 6.0.0-1
lxcfs: 6.0.0-pve2
novnc-pve: 1.4.0-3
proxmox-backup-client: 3.2.7-1
proxmox-backup-file-restore: 3.2.7-1
proxmox-firewall: 0.5.0
proxmox-kernel-helper: 8.1.0
proxmox-mail-forward: 0.2.3
proxmox-mini-journalreader: 1.4.0
proxmox-offline-mirror-helper: 0.6.6
proxmox-widget-toolkit: 4.2.3
pve-cluster: 8.0.7
pve-container: 5.1.12
pve-docs: 8.2.3
pve-edk2-firmware: not correctly installed
pve-esxi-import-tools: 0.7.1
pve-firewall: 5.0.7
pve-firmware: 3.13-1
pve-ha-manager: 4.0.5
pve-i18n: 3.2.2
pve-qemu-kvm: 9.0.2-2
pve-xtermjs: 5.3.0-3
qemu-server: 8.2.4
smartmontools: 7.3-pve1
spiceterm: 3.3.0
swtpm: 0.8.0+pve1
vncterm: 1.8.0
zfsutils-linux: 2.2.4-pve1
If you have any Idea it would be nice to get your feedback. This here is may not 100% the right location for it - may you have a Idea where to ask / report.
Thanks,
Paul