This is just an informative Posting, not really an issue. Unless somebody has a solution for it because it is really an issue.
Ceph cluster is heavily depended on Monitors(MON) to operate correctly. Minimum 3 MONs needs to be online at all times to achieve Quorum. Currently i use Proxmox VM stored locally for all MONs thus saving cash and utility bill on Physical machine. But since i have Proxmox nodes running any way, i wanted to see if i could use the Proxmox Hosts as additional MONs. Last few weeks all together i tried 5 times. Although MONs worked perfectly, all Proxmox Nodes had unwanted side effects. Mostly #pvestatd was unstable, frequent connection error although all VMs were functioning just fine. WebGUI had big trouble showing any kind of stats such as Nodes on or off, CPU/Memory Usage graphs etc. Not being able to monitor entire Proxmox cluster in moments notice is really not an option for a real life Cluster.
Just an hour ago i took off all MONs from all Proxmox nodes and within seconds everything were back to normal again. I was able to reproduce this entire tests 5 times with same results every time. I am going to document it as "Do not use Proxmox for CEPH MONs". It just confirms Proxmox should be left alone just like any other stable platform to allow it to do what it does best.
Anybody tried something similar and had same results?
Ceph cluster is heavily depended on Monitors(MON) to operate correctly. Minimum 3 MONs needs to be online at all times to achieve Quorum. Currently i use Proxmox VM stored locally for all MONs thus saving cash and utility bill on Physical machine. But since i have Proxmox nodes running any way, i wanted to see if i could use the Proxmox Hosts as additional MONs. Last few weeks all together i tried 5 times. Although MONs worked perfectly, all Proxmox Nodes had unwanted side effects. Mostly #pvestatd was unstable, frequent connection error although all VMs were functioning just fine. WebGUI had big trouble showing any kind of stats such as Nodes on or off, CPU/Memory Usage graphs etc. Not being able to monitor entire Proxmox cluster in moments notice is really not an option for a real life Cluster.
Just an hour ago i took off all MONs from all Proxmox nodes and within seconds everything were back to normal again. I was able to reproduce this entire tests 5 times with same results every time. I am going to document it as "Do not use Proxmox for CEPH MONs". It just confirms Proxmox should be left alone just like any other stable platform to allow it to do what it does best.
Anybody tried something similar and had same results?