Ceph very low performance 12MB/s

felipemb

Active Member
Nov 22, 2014
28
4
43
Hi,

I need help with my clúster... I have the following scenario of cluster with 3 nodes and Ceph with 2 nodes:

Cluster
1629779091750.png


Ceph
1629779171738.png


Code:
[global]
     auth_client_required = cephx
     auth_cluster_required = cephx
     auth_service_required = cephx
     cluster_network = 10.0.20.11/24
     fsid = b9b27296-a27b-400a-9fcf-aebb4a767bc2
     mon_allow_pool_delete = true
     mon_host = 10.0.20.11 10.0.20.12 10.0.20.13
     osd_pool_default_min_size = 2
     osd_pool_default_size = 3
     public_network = 10.0.20.11/24

[client]
     keyring = /etc/pve/priv/$cluster.$name.keyring

[mon.nodo1]
     public_addr = 10.0.20.11

[mon.nodo2]
     public_addr = 10.0.20.12

[mon.nodo3]
     public_addr = 10.0.20.13


1629779286918.png


My problem is that the performance is very low, here the results:

1629779392901.png

1629779436096.png
In some moments the speed increases up to 110MB/s, but I am doing the test with SSD and it should be higher than 350MB/s

Can someone help me finding a solution to improve performance?


Thank you!





 
Klaus thanks for your answer...

  1. It is not possible to have 3 nodes in the cluster and monitor and only 2 nodes replicating with Ceph? Or what recommendation can you give me to make a cluster with replication in 2 nodes? ZFS-replication?
  2. I have created a round-robin bond with 2 network ports each with a speed of 1Gbps, in theory 250MB/s

Thanks for your time!
 
No only 2 nodes replicating with is unsafe, its is per design that you need at least 3 nodes
hardcore CEPH People even recommend more Nodes .... but 3 works well.

Also 1 G Links are way to slow for decent CEPH Performance. Remember CEPH has to replicate _any_ write to all 3 nodes before it can acknowledge the transfer !

For a working productions CEPH setup (besides experimental setups) you need at least nodes with 10 GBit/s links for CEPH Backend and 10 GBit/s links for Frontend (faster is better of course!) also you should have more than 2 OSD's per node!
 
  • Like
Reactions: felipemb and jsterr
If you get a third node it also speeds up that considerably. Also for small clusters increasing the PG count helps alot on the same principle as more nodes = more disks that can participate in satisfying the request..
 
  • Like
Reactions: felipemb

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!