Hello
we are using zabbix graphs to monitor ceph latency. see attached example .
currently we have just seven 2-TB P3700 nvme drives active.
at the time of spikes there is very little activity by users or cronjobs. zabbix network graphs show below average activity at the time of most spikes.
To try to see if there is bad hardware, we'd like to set up per osd latency history data. Does anyone have suggestions on how to do so?
we are using zabbix graphs to monitor ceph latency. see attached example .
currently we have just seven 2-TB P3700 nvme drives active.
at the time of spikes there is very little activity by users or cronjobs. zabbix network graphs show below average activity at the time of most spikes.
To try to see if there is bad hardware, we'd like to set up per osd latency history data. Does anyone have suggestions on how to do so?