PVE7.2: errors in OSD logs after upgrade from Ceph Pacific to Quincy

p.jahn

Active Member
Feb 19, 2019
26
3
43
44
Hello everyone,

last weekend I did upgrade my cluster from Ceph Pacific 16.2.9 to Quincy 17.2.4 without any issues.

Ceph status is HEALTH_OK and everything is working.

While checking the logs I found plenty of this line in the OSD logs:
2022-10-22T16:01:09.412+0200 7f727824f240 0 read_log_and_missing WARN num of dups exceeded 6000. You can be hit by THE DUPS BUG https://tracker.ceph.com/issues/53729. Consider ceph-objectstore-tool --op trim-pg-log-dups

According to the Ceph release notes [0] "a new offline mechanism has been added: ceph-objectstore-tool now has a trim-pg-log-dups op that targets situations where an OSD is unable to boot due to those inflated dups." Issue #53729 [1] is about a situation where OSDs won't boot because they run out of memory.

Since none of my OSDs are unable to boot, I guess that I don't need to run the ceph-objectstore-tool, right?

Regards
Patrick

[0] https://ceph.io/en/news/blog/2022/v17-2-4-quincy-released/
[1] https://tracker.ceph.com/issues/53729
 
Reading through both the release notes and the bug tracker entry, it seems the offline tool is only required if the OSD doesn't boot.
Once it has booted, it will remove those dups online anyway.