:D :P Ich finds ja gut. So sieht man auch mal, wie andere damit umgehen. Da kann man nur lernen.
Hmm jaein... Ich würde mir das so vorstellen:
Der Pool zeigt den gesamten Speicherplatz aller beteiligten OSDs als RAW Value an. Z.B. 100TB Wenn ich dann 2 Pools habe, geht halt vom verfügbaren...
Uff, mehr Diskussion als ich erwartet hatte :p:D
Also wir nutzen schon seit vielen Jahren Ceph als Storage backend und ich muss sagen, dass mir dieses Verhalten vorher noch nie aufgefallen war.
Ich persönlich finde es am konsistentesten wenn die Größe des Pools einfach der Summe des...
Hallo zusammen
Ich habe unseren SSD Pool ein wenig aufgeräumt und dabei ist mir aufgefallen, dass die Gesamtgröße des Pools variiert, je nachdem wie voll der Pool ist.
Wie kann das sein? Die Datenträger haben doch eine fixe Größe unabhängig davon wie viele Daten gespeichert sind.
In älteren...
I confused something. We had an incident where the OS disk of a Server died. This was, when the VMs were down and we got them up on another node in 10min.
When a DB/WAL SSD died, the VMs were not down, but we needed to shut down the Server to replace the NVME Drive, so we live migrated the VMs.
While we had our fair share of Problems with Ceph in the past, mostly due to inexperience, what would bite us? I mean in terms of storing our data reliably und running consistently?
That's a valid point. Good thing is, our new cluster isn't set in stone yet. It's probably a good idea, to assess...
Yeah, true, that can happen. We run a 2/3 replica on a 6 Node Cluster and we had one node come down with a failed DB/WAL disc. But frankly, that was really not much of a deal. We moved all affected VMs to another node. After 10min we had all affected VMs running again, after 1 day of rebalance...
That's absolutely a valid strategy. Most of the Data that's idle is SMB Shares with files for daily work. So it's not totally idle, just not used very much, but still used. Lots of that is documents and invoices in our document management system and financial accounting.
There is not much of a...
Sadly, thats not possible. We need about 15-20TB Storage, so around 45-60TB RAW. Much of that is very idle data. But IF some of that has to move, these speeds are really not great.
Our databases and other high io stuff like web pages, applications etc. already are on SAS SSDs. Space there is...
So, i did a little digging.
Ceph and its various services/tools have such a ton of features, settings, values and metrics, that it is really hard to get into it, if you are not a specially trained professional, e.g. just a "normal" IT Gui to manage a small to medium sized Cluster.
I found some...
That balance time is just... I need to clear one Server at a time, destroy the OSDs, reduce to three per node, then create new OSDs with the properly sized DB/WAL.
Clearing a Server is about 20h. Backfill starts with about 400-500MB/s and reduces over time to a crawl.
All of this while the...
Yeah i can totally see that.
I just did some maths on our cluster and yeah we need to resize down to 2-3OSDs per Server to get away with our 375GB DB/WAL SSDs. Luckily we can shrink it down that much.
I just started moving EVERYTHING around and reconfiguring the OSDs. This will take about 1-2...
Oh man... this is great news.
I just checked ceph daemon osd.x perf dump and yeah, I guess we are using too small WAL/DB as well.
The slow bytes part is this one here?
"bluefs": {
"db_total_bytes": 62495121408,
"db_used_bytes": 5754585088,
"wal_total_bytes": 0...
Danke erstmal
Ich habe mit dem Query etwas rum gespielt, aber wirklich schlauer bin ich daraus nicht geworden.
Wenn ich nach recovery gucke finde ich nichts besonderes:
Und wenn ich nach blocked suche springt mir auch nichts ins Auge.
Wir haben mit dem Ceph schon einiges erlebt, aber noch nie...
Die Ratios sind so:
root@vm-1:~# ceph osd dump | grep ratio
full_ratio 0.95
backfillfull_ratio 0.9
nearfull_ratio 0.85
Das wiederum sagt:
root@vm-1:~# ceph df
--- RAW STORAGE ---
CLASS SIZE AVAIL USED RAW USED %RAW USED
hdd 88 TiB 51 TiB 38 TiB 38 TiB 42.73
ssd 20...
Die SSD ist absichtlich out, die soll den Node wechseln. Die habe ich bereits leer geräumt und liegt bei mir auf dem Schreibtisch. Der Cluster war vor dem Einbau der neuen SSD healthy.
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.