Every once in a while I have a node that no longer responds to api calls; its usually because a container is not responding to pvestatd and has to be killed in order to release pveproxy.
How can I have my monitoring identify this condition? since both pveproxy and pvestatd are technically running I cant monitor the service status. The other nodes are able to see that the node's api is hung (gray icon) so there has to be a method short of continuously polling the api. Ideas?
How can I have my monitoring identify this condition? since both pveproxy and pvestatd are technically running I cant monitor the service status. The other nodes are able to see that the node's api is hung (gray icon) so there has to be a method short of continuously polling the api. Ideas?