Hello Sirs.
Has there anyone encountered the same issue as mine?
I found one of OSDs in our production proxmox CEPH cluster environment which had high apply latency(around 500ms.)
It caused our CEPH cluster performance to degrade. After I restarted the OSD, the cluster performance is back to normal.
Why does one OSD with high apply latency will cause a whole ceph cluster performance to degrade?
How to fix this issue, please?
If I need to monitor all OSDs apply latency, how many milliseconds will be a best practice threshold?
Thank you in advance.
Edwin.
Has there anyone encountered the same issue as mine?
I found one of OSDs in our production proxmox CEPH cluster environment which had high apply latency(around 500ms.)
It caused our CEPH cluster performance to degrade. After I restarted the OSD, the cluster performance is back to normal.
Why does one OSD with high apply latency will cause a whole ceph cluster performance to degrade?
How to fix this issue, please?
If I need to monitor all OSDs apply latency, how many milliseconds will be a best practice threshold?
Thank you in advance.
Edwin.