After much testing and disaster simulation i have decided not to put Ceph OSDs on Proxmox node with VMs. This not be confused with not using Proxmox+Ceph Severs together. But OSDs should not be on the same Proxmox Nodes where several Virtual Machines are served.
During a OSD failure or OSD addition when Ceph goes into rebalancing mode i have noticed between 25% to 35% CPU consumption. If all my VMs already consuming 80% of CPU, this caused major slow down of VMs. During regular operation though CPU consumption was hardly noticeable. This is not new. Ceph developers did mention that during rebalancing Ceph will consumed large amount of resource.
As long as all OSDs are in their own nodes without any VMs, all is good. On a 7 nodes cluster i put all OSDs in Node 5, 6 & 7. Then put all VMs spread across Node 1 to 4. Running same disaster simulation VMs performed much better.
Proxmox + Ceph server still shines. Because this gives us ability to monitor/manage Ceph from same GUI and eliminates need of having separate node for admin/MONs.
Anybody had experience such as this or have suggestions?
During a OSD failure or OSD addition when Ceph goes into rebalancing mode i have noticed between 25% to 35% CPU consumption. If all my VMs already consuming 80% of CPU, this caused major slow down of VMs. During regular operation though CPU consumption was hardly noticeable. This is not new. Ceph developers did mention that during rebalancing Ceph will consumed large amount of resource.
As long as all OSDs are in their own nodes without any VMs, all is good. On a 7 nodes cluster i put all OSDs in Node 5, 6 & 7. Then put all VMs spread across Node 1 to 4. Running same disaster simulation VMs performed much better.
Proxmox + Ceph server still shines. Because this gives us ability to monitor/manage Ceph from same GUI and eliminates need of having separate node for admin/MONs.
Anybody had experience such as this or have suggestions?