PVE7.2: errors in OSD logs after upgrade from Ceph Pacific to Quincy

p.jahn

Active Member
Feb 19, 2019
26
3
43
43
Hello everyone,

last weekend I did upgrade my cluster from Ceph Pacific 16.2.9 to Quincy 17.2.4 without any issues.

Ceph status is HEALTH_OK and everything is working.

While checking the logs I found plenty of this line in the OSD logs:
2022-10-22T16:01:09.412+0200 7f727824f240 0 read_log_and_missing WARN num of dups exceeded 6000. You can be hit by THE DUPS BUG https://tracker.ceph.com/issues/53729. Consider ceph-objectstore-tool --op trim-pg-log-dups

According to the Ceph release notes [0] "a new offline mechanism has been added: ceph-objectstore-tool now has a trim-pg-log-dups op that targets situations where an OSD is unable to boot due to those inflated dups." Issue #53729 [1] is about a situation where OSDs won't boot because they run out of memory.

Since none of my OSDs are unable to boot, I guess that I don't need to run the ceph-objectstore-tool, right?

Regards
Patrick

[0] https://ceph.io/en/news/blog/2022/v17-2-4-quincy-released/
[1] https://tracker.ceph.com/issues/53729
 
Reading through both the release notes and the bug tracker entry, it seems the offline tool is only required if the OSD doesn't boot.
Once it has booted, it will remove those dups online anyway.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!