ceph tuning

RobFantini · Jul 28, 2014

I'm going to make a new ceph test cluster and have some tuning questions.

We'll be using a 4 disk raid-10 on each node + a hot spare . There will be one OSD per node. We do not need super fast I/O , instead our priority is high availability + always great keyboard response. We'll use this ssd for journal: Intel DC S3700 Series 200GB .

Does anyone know how we could set these:

* replica of 2

* permanently set OSD noout ?

*the "mon osd downout subtree limit" set to "host"

thank you and best regards
Rob

udo · Jul 28, 2014

RobFantini said:
Does anyone know how we could set these:

permanently set OSD noout ?

Hi,
this has the disadvantage, that your cluster isn't healthy...

I use an simple monitoring script, which check how much osds are down and if "enough" osd down, the noout flag will be set.
So I have an healthy cluster and if one node fails the osds are not reorginized - but I must bring up the failed node again

icinge check every minute with this script:

Code:

cat /usr/local/scripts/ceph_set_noout.sh
#!/bin/bash
#
# ceph_set_noout.sh setzt automatisch das noout-flag (damit kein recovery auf andere Nodes stattfindet).
# wenn mehr als unter max_osd_down definierte OSDs down sind.

max_osd_down=5
osdmap=`/usr/bin/ceph --keyring /var/lib/icinga/ceph.keyring -c /var/lib/icinga/ceph.conf -s | grep osdmap`
osd=`echo $osdmap | awk '{print $3 }'`
osd_up=`echo $osdmap | awk '{print $5 }'`
osd_in=`echo $osdmap | awk '{print $7 }'`
down=`echo "$osd - $osd_up"| bc`
perfdata="|osd=$osd;up=$osd_up;in=$osd_in"
if [ $down -gt $max_osd_down ]
  then
    echo "$down osd are down; ceph osd set noout $perfdata"
    /usr/bin/ceph --keyring /usr/local/icinga/ceph.keyring -c /usr/local/icinga/ceph.conf osd set noout
    exit 2
fi
if [ $down = 0 ]
  then
    echo "all $osd osd are up $perfdata"
    exit 0
  else
    echo "$down osd are down $perfdata"
    exit 1
fi

Udo

RobFantini · Jul 28, 2014

Udo, what % od osd's should this be: max_osd_down=5 ?

and it looks like use icinga ? as I don't will attempt your script from cron.

udo · Jul 28, 2014

RobFantini said:
Udo, what % od osd's should this be: max_osd_down=5 ?

and it looks like use icinga ? as I don't will attempt your script from cron.

Hi,
yes this script will be run from icinga - to get also an alarm if one nodes fails.

If less (or equal) osds down then max_osd_down then the noout flag isn't set. In my case each osd-node has 12 osds - only if more then 5 osds are down the noout flag will prevent an automatic resync.

Udo

RobFantini · Jul 28, 2014

So if I have one OSD per node and 3 nodes , = 3 OSD's

what should max_osd_down be set to ? 1 ?

If 5 osd's then 3?

udo · Jul 28, 2014

RobFantini said:
So if I have one OSD per node and 3 nodes , = 3 OSD's

what should max_osd_down be set to ? 1 ?

If 5 nodes then 3?

Hi,
in you case must max_osd_down 0, or - better - use 1 and change -gt with -ge (>=).

If one node (OSD) fail, you wan't to avoid an resync....

Udo

RobFantini · Jul 29, 2014

Udo: OK I made the changes..... Next question is:

If I take a node off line for maintenance , when it is put back on is there anything that needs to be done?

udo · Jul 29, 2014

RobFantini said:
Udo: OK I made the changes..... Next question is:

If I take a node off line for maintenance , when it is put back on is there anything that needs to be done?

Code:

ceph osd unset noout

RobFantini · Aug 2, 2014

Udo,
Do you run the script on just one node?

If so using a high available address or something?

udo · Aug 2, 2014

RobFantini said:
Udo,
Do you run the script on just one node?

If so using a high available address or something?

Hi Rob,
I run the script on the monitoringhost with an normal ceph.conf which include all monitoring nodes - so you don't need care about multible hosts.

Udo

RobFantini · Aug 2, 2014

udo said:
Hi Rob,
I run the script on the monitoringhost with an normal ceph.conf which include all monitoring nodes - so you don't need care about multible hosts.

Udo

Is a ' the monitoringhost ' part of icinga or ceph ?

udo · Aug 3, 2014

Hi,
the monitoringhost is the icinga-host.

Udo

Search

Search

ceph tuning

RobFantini

Famous Member

udo

Distinguished Member

RobFantini

Famous Member

udo

Distinguished Member

RobFantini

Famous Member

udo

Distinguished Member

RobFantini

Famous Member

udo

Distinguished Member

RobFantini

Famous Member

udo

Distinguished Member

RobFantini

Famous Member

udo

Distinguished Member