Cluster Status Missing From External Metric Server Metrics

speatzle_

Well-Known Member
Nov 4, 2017
31
6
48
25
Hello,

recently i have deployed allot of small Proxmox VE Clusters which now have a need for monitoring. I have used Influx and Grafana to monitor Containers and VM's in the past and was very surprised to see that the External Metric Server doesn't supply any Metrics about the Cluster status (eg if quorate, number of votes, ...) even though this data is available in the api (pvesh get /cluster/ha/status/current or pvesh get /cluster/ha/status/manager_status). After investigating the source a bit with my limited knowledge of perl i think that all needed librarys already exists and that one could with very low effort add this data to the External Metric Servers Metrics. The relevant code is in the pve-manager git repo in the file PVE/Service/pvestatd.pm on line 125. Here there is a function called update_node_status in which one could expose the Quorum Metrics.

Is there a reason why these Metrics aren't exposed here or has this just been forgotten about?

Is this something that the Proxmox team could implement?

This would make the External Metric Server a good option for Monitoring Clusters as nothing has to be install on the Hosts themselves and its now Configurable via the GUI.
 
Hi,

well that is my own enhancement request which i made after posting on the Mailing list so i am Already on CC.
Thanks for the Response anyway.
 
Perhaps too late but as I have had the same problem I wrote a small python script to run inside the telegraf exec input. This script simply runs pvecm status, parse the result and return a value suitable for influxdb. In the meantime waiting for the native process to spit these info this script is being useful.