ceph-crash problem

Aug 4, 2022
15
8
8
Hello,

after upgrading Ceph packages from version 17.2.5 to 17.2.6, the syslog contains following warnings:

Jun 05 09:20:24 pve01 ceph-crash[1883176]: WARNING:ceph-crash:post /var/lib/ceph/crash/2023-06-01T19:33:42.133764Z_6d5e3451-8d05-4932-8b5a-a86dca582440 as client.crash.pve01 failed: Error initializing cluster client: ObjectNotFound('RADOS object not found (error calling conf_read_file)') Jun 05 09:20:24 pve01 ceph-crash[1883176]: WARNING:ceph-crash:post /var/lib/ceph/crash/2023-06-01T19:33:42.133764Z_6d5e3451-8d05-4932-8b5a-a86dca582440 as client.crash failed: Error initializing cluster client: ObjectNotFound('RADOS object not found (error calling conf_read_file)') Jun 05 09:20:24 pve01 ceph-crash[1883176]: WARNING:ceph-crash:post /var/lib/ceph/crash/2023-06-01T19:33:42.133764Z_6d5e3451-8d05-4932-8b5a-a86dca582440 as client.admin failed: Error initializing cluster client: ObjectNotFound('RADOS object not found (error calling conf_read_file)')

There are probably some problems with the permission settings after upgrading to the latest Ceph packages. The Proxmox Ceph GUI shows no errors, but there are some unprocessed crash report files in /var/lib/ceph/crash.

Thank you for the answer.
 
Another point to this problem. From version 17.2.6, there is a new function in /usr/bin/ceph-crash script (drop_privs()), that changes user to ceph. This user does not have permissions to read /etc/ceph/ceph.conf (linked to /etc/pve/ceph.conf 640 root:www-data) and /etc/pve/priv dir, that contains keyrings. When we temporary comment out this function, that drops root privileges, everything work as expected.

Changing ceph.conf permissions is not permitted.
root@pve01:/etc/pve# chmod 644 ceph.conf
chmod: changing permissions of 'ceph.conf': Operation not permitted

We do not have ceph-mgr-dashboard package installed as there is no reason for this. So this is probably not the same problem like in this thread.
 
Last edited:
We have the same problem after release upgrade to Proxmox 8.0.3 and ceph 17.2.6. Every 10 minutes in syslog on each node a message. On another cluster with Proxmox 7.4.15 and a ceph 17.2.6 there is not this problem.
 
Hi @radim.smehlik ,
I understand your point of view. Yet, AFAICS, the ceph bug talk about the same issue, without proxmox.... So IMHO, ceph should first solve the issue or maybe Proxmox team could propose a solution to the ceph team. It looks much mainstream to me this way.
But yeah.... we all expecting this to be solved ASAP ;-)
 
Last edited:
I'm just running into this as well. Doesn't look like there has been any movement on the bug or this thread. Is there an ETA for resolution?
 
I am noticing this as well:

Code:
Sep 06 16:43:10 pm ceph-crash[7542]: WARNING:ceph-crash:post /var/lib/ceph/crash/2023-08-23T04:12:26.150675Z_44d4035b-5ea9-4452-93c3-22a06b877464 as client.crash.pm failed: Error initializing cluster client: ObjectNotFound('RADOS object not found (error calling conf_read_file)')
Sep 06 16:43:10 pm ceph-crash[7542]: WARNING:ceph-crash:post /var/lib/ceph/crash/2023-08-23T04:12:26.150675Z_44d4035b-5ea9-4452-93c3-22a06b877464 as client.crash failed: Error initializing cluster client: ObjectNotFound('RADOS object not found (error calling conf_read_file)')
Sep 06 16:43:10 pm ceph-crash[7542]: WARNING:ceph-crash:post /var/lib/ceph/crash/2023-08-23T04:12:26.150675Z_44d4035b-5ea9-4452-93c3-22a06b877464 as client.admin failed: Error initializing cluster client: ObjectNotFound('RADOS object not found (error calling conf_read_file)')
Sep 06 16:53:11 pm ceph-crash[7542]: WARNING:ceph-crash:post /var/lib/ceph/crash/2023-08-23T04:12:26.150675Z_44d4035b-5ea9-4452-93c3-22a06b877464 as client.crash.pm failed: Error initializing cluster client: ObjectNotFound('RADOS object not found (error calling conf_read_file)')
Sep 06 16:53:11 pm ceph-crash[7542]: WARNING:ceph-crash:post /var/lib/ceph/crash/2023-08-23T04:12:26.150675Z_44d4035b-5ea9-4452-93c3-22a06b877464 as client.crash failed: Error initializing cluster client: ObjectNotFound('RADOS object not found (error calling conf_read_file)')
Sep 06 16:53:11 pm ceph-crash[7542]: WARNING:ceph-crash:post /var/lib/ceph/crash/2023-08-23T04:12:26.150675Z_44d4035b-5ea9-4452-93c3-22a06b877464 as client.admin failed: Error initializing cluster client: ObjectNotFound('RADOS object not found (error calling conf_read_file)')
Sep 06 17:03:11 pm ceph-crash[7542]: WARNING:ceph-crash:post /var/lib/ceph/crash/2023-08-23T04:12:26.150675Z_44d4035b-5ea9-4452-93c3-22a06b877464 as client.crash.pm failed: Error initializing cluster client: ObjectNotFound('RADOS object not found (error calling conf_read_file)')
Sep 06 17:03:11 pm ceph-crash[7542]: WARNING:ceph-crash:post /var/lib/ceph/crash/2023-08-23T04:12:26.150675Z_44d4035b-5ea9-4452-93c3-22a06b877464 as client.crash failed: Error initializing cluster client: ObjectNotFound('RADOS object not found (error calling conf_read_file)')
Sep 06 17:03:11 pm ceph-crash[7542]: WARNING:ceph-crash:post /var/lib/ceph/crash/2023-08-23T04:12:26.150675Z_44d4035b-5ea9-4452-93c3-22a06b877464 as client.admin failed: Error initializing cluster client: ObjectNotFound('RADOS object not found (error calling conf_read_file)')

Code:
 pveversion -v
proxmox-ve: 7.4-1 (running kernel: 5.15.111-1-pve)
pve-manager: 7.4-16 (running version: 7.4-16/0f39f621)
pve-kernel-5.15: 7.4-5
pve-kernel-5.13: 7.1-9
pve-kernel-5.15.111-1-pve: 5.15.111-1
pve-kernel-5.15.108-1-pve: 5.15.108-2
pve-kernel-5.13.19-6-pve: 5.13.19-15
ceph: 17.2.6-pve1
ceph-fuse: 17.2.6-pve1
corosync: 3.1.7-pve1
criu: 3.15-1+pve-1
glusterfs-client: 9.2-1
ifupdown2: 3.1.0-1+pmx4
ksm-control-daemon: 1.4-1
libjs-extjs: 7.0.0-1
libknet1: 1.24-pve2
libproxmox-acme-perl: 1.4.4
libproxmox-backup-qemu0: 1.3.1-1
libproxmox-rs-perl: 0.2.1
libpve-access-control: 7.4.1
libpve-apiclient-perl: 3.2-1
libpve-common-perl: 7.4-2
libpve-guest-common-perl: 4.2-4
libpve-http-server-perl: 4.2-3
libpve-rs-perl: 0.7.7
libpve-storage-perl: 7.4-3
libspice-server1: 0.14.3-2.1
lvm2: 2.03.11-2.1
lxc-pve: 5.0.2-2
lxcfs: 5.0.3-pve1
novnc-pve: 1.4.0-1
proxmox-backup-client: 2.4.3-1
proxmox-backup-file-restore: 2.4.3-1
proxmox-kernel-helper: 7.4-1
proxmox-mail-forward: 0.1.1-1
proxmox-mini-journalreader: 1.3-1
proxmox-offline-mirror-helper: 0.5.2
proxmox-widget-toolkit: 3.7.3
pve-cluster: 7.3-3
pve-container: 4.4-6
pve-docs: 7.4-2
pve-edk2-firmware: 3.20230228-4~bpo11+1
pve-firewall: 4.3-5
pve-firmware: 3.6-5
pve-ha-manager: 3.6.1
pve-i18n: 2.12-1
pve-qemu-kvm: 7.2.0-8
pve-xtermjs: 4.16.0-2
qemu-server: 7.4-4
smartmontools: 7.2-pve3
spiceterm: 3.2-2
swtpm: 0.8.0~bpo11+3
vncterm: 1.7-1
zfsutils-linux: 2.1.11-pve1
 
Are these "crash-files" of any importance? They are somewhat old. Yes - there seemed to be some problems. But the number of files does not increase.
Is it risky to simply delete them?
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!