Ceph problem

anton.chasnyk · Jan 31, 2017

I installed proxmox4.4 and ceph jewel, according by manual, but i got a problem with my OSDs.
If I reboot first node, OSDs on this node still online and my VMs are suspended

size/min - 2/1
pg num - 128

OSDs:
node1
- osd.0
- osd.1
node 2
- osd.3
- osd.4
node 3
- no OSDs

In debug message I found this:

Code:

[    3.391884] systemd[1]: [/lib/systemd/system/ceph-osd@.service:18] Unknown lvalue 'TasksMax' in section 'Service'
[    3.392414] systemd[1]: [/lib/systemd/system/ceph-osd@.service:18] Unknown lvalue 'TasksMax' in section 'Service'
[    3.393806] systemd[1]: [/lib/systemd/system/ceph-mon@.service:24] Unknown lvalue 'TasksMax' in section 'Service'

UPD: On both two nodes with OSDs I got this problem. (I have total three nodes, one node without OSDs)

UPD: Ceph Config:

Code:

[global]
     auth client required = cephx
     auth cluster required = cephx
     auth service required = cephx
     cluster network = 192.168.87.0/24
     filestore xattr use omap = true
     fsid = 5ee20a5b-eeb2-4bcb-86a5-57cd9421c0d9
     keyring = /etc/pve/priv/$cluster.$name.keyring
     osd journal size = 5120
     osd pool default min size = 1
     public network = 192.168.87.0/24

[osd]
     keyring = /var/lib/ceph/osd/ceph-$id/keyring

[mon.1]
     host = node1
     mon addr = 192.168.87.15:6789

[mon.2]
     host = node2
     mon addr = 192.168.87.13:6789

[mon.0]
     host = node3
     mon addr = 192.168.87.14:6789

Code:

#
begin crush map
tunable choose_local_tries 0
tunable choose_local_fallback_tries 0
tunable choose_total_tries 50
tunable chooseleaf_descend_once 1
tunable chooseleaf_vary_r 1
tunable straw_calc_version 1
tunable allowed_bucket_algs 54

# devices
device 0 osd.0
device 1 osd.1
device 2 osd.2
device 3 osd.3

# types
type 0 osd
type 1 host
type 2 chassis
type 3 rack
type 4 row
type 5 pdu
type 6 pod
type 7 room
type 8 datacenter
type 9 region
type 10 root

# buckets
host node1 {
         id -2 # do not change unnecessarily
         # weight 1.851
         alg straw hash 0 # rjenkins1
         item osd.0 weight 0.926
         item osd.1 weight 0.926
}

host node2 {
         id -3 # do not change unnecessarily
         # weight 1.851 alg straw
         hash 0 # rjenkins1
         item osd.2 weight 0.926
         item osd.3 weight 0.926
}

root default {
          id -1 # do not change unnecessarily
          # weight 3.703
          alg straw
          hash 0 # rjenkins1
          item h2 weight 1.851
          item h1 weight 1.851
}

# rules
rule replicated_ruleset {
          ruleset 0
          type replicated
          min_size 1
          max_size 10
          step take default step chooseleaf firstn 0 type host 
          step emit
} # end crush map

udo · Jan 31, 2017

Hi,
is your ceph-cluster healthy?
What is the output of

Code:

ceph -s

Code:
Code:

osd pool default min size = 1

"osd pool default min size = 1" is dangerous!

It's highly recommended to use 3 replica - so you must have on min. three hosts OSDs.

How look your pools?

Code:

for i in `ceph osd lspools | tr -d ",[0-9]"`
  do
    ceph osd dump | grep \'$i\'
done

Udo

anton.chasnyk · Feb 1, 2017

udo said:
Hi,
is your ceph-cluster healthy?
What is the output of

Code:

ceph -s

"osd pool default min size = 1" is dangerous!

It's highly recommended to use 3 replica - so you must have on min. three hosts OSDs.

How look your pools?

Code:

for i in `ceph osd lspools | tr -d ",[0-9]"` do ceph osd dump | grep \'$i\' done

Udo

ceph -s

Code:

 cluster 5ee20a5b-eeb2-4bcb-86a5-57cd9421c0d9
     health HEALTH_OK
     monmap e3: 3 mons at {0=192.168.87.14:6789/0,1=192.168.87.15:6789/0,2=192.168.87.13:6789/0}
            election epoch 30, quorum 0,1,2 2,0,1
     osdmap e61: 4 osds: 4 up, 4 in
            flags sortbitwise,require_jewel_osds
      pgmap v176: 128 pgs, 1 pools, 0 bytes data, 0 objects
            138 MB used, 3791 GB / 3791 GB avail
                 128 active+clean

My pool looks like this :

Code:

pool 4 'rbd'
replicated size 2
min_size 1
crush_ruleset 0
object_hash rjenkins
pg_num 128
pgp_num 128
last_change 61
flags hashpspool
stripe_width 0

"osd pool default min size = 1" is dangerous!

I will fix it soon, thx

udo · Feb 1, 2017

anton.chasnyk said:
I will fix it soon, thx

Hi,
but you need three osd-hosts, replica=3 to archive with "osd pool default min size = 2" an secure behavior, where one node can go down without interuption.

Udo

anton.chasnyk · Feb 2, 2017

udo said:
Hi,
but you need three osd-hosts, replica=3 to archive with "osd pool default min size = 2" an secure behavior, where one node can go down without interuption.

Udo

I recreate my pool:

Code:

pool 8 'rbd'
replicated size 3
min_size 2
crush_ruleset 0
object_hash rjenkins
pg_num 128
pgp_num 128
last_change 167
flags hashpspool
stripe_width 0

and setup OSDs this like:
OSDs:
node1
- osd.0
- osd.1
node 2
- osd.3
node 3
- osd.4

In this configuration cluster working fine. Thanks for help.

udo · Feb 2, 2017

Hi,
you don't need to recreate an pool for such things. Replica can changed during run time.

But fine that all works now.

Udo

anton.chasnyk · Feb 2, 2017

udo said:
Hi,
you don't need to recreate an pool for such things. Replica can changed during run time.

But fine that all works now.

Udo

Oh, where I can find this command or guide? Because in GUI it is impossible

tom · Feb 2, 2017

anton.chasnyk said:
Oh, where I can find this command or guide? Because in GUI it is impossible

http://docs.ceph.com/docs/jewel/rados/operations/pools/#set-the-number-of-object-replicas

Search

Search

Ceph problem

anton.chasnyk

New Member

udo

Distinguished Member

anton.chasnyk

New Member

udo

Distinguished Member

anton.chasnyk

New Member

udo

Distinguished Member

anton.chasnyk

New Member

tom

Proxmox Staff Member

We value your privacy