Hey Guys,
Ok so i've been reading up a bit. And according to most guides everyone recommends using a seperate SSD drive for journals...
I have 1 concern though. From what I read, if the SSD goes down, the entire OSD cluster on that server also goes down...
So couple of quick questions...
1. Since Ceph stores data to 2 pg then replicates, I assume any pending data is immediately lost when journal fails?
2. When the journal fails, What does one need to doto bring it back online?
3. Can proxmox handle the journal? Meaning we regularly add drives. Will we need to manually add each journal partition or can it be automated to a sense to an SSD? (meaning we define the SSD, and the system handles the partitioning as and when OSD's are added/replaced)
4. When an OSD is replaced, I assume there's a process journal wise to replacing it. What is this process?
5. Is it worth the trouble?
My main concern would be if we add the SSD, 1 SSD failure could bring an entire server (which has 15 drives) down. I currently have 6 servers with 6-8 drives each, which I am planning to convert to 3 servers with 15 6TB SAS drives each.
I would love if someone can elaborate a bit for me. I understand the basic's but I am a bit new to ceph still and don't want to have data loss again (had a rough start already )
Thanks Guys
Ok so i've been reading up a bit. And according to most guides everyone recommends using a seperate SSD drive for journals...
I have 1 concern though. From what I read, if the SSD goes down, the entire OSD cluster on that server also goes down...
So couple of quick questions...
1. Since Ceph stores data to 2 pg then replicates, I assume any pending data is immediately lost when journal fails?
2. When the journal fails, What does one need to doto bring it back online?
3. Can proxmox handle the journal? Meaning we regularly add drives. Will we need to manually add each journal partition or can it be automated to a sense to an SSD? (meaning we define the SSD, and the system handles the partitioning as and when OSD's are added/replaced)
4. When an OSD is replaced, I assume there's a process journal wise to replacing it. What is this process?
5. Is it worth the trouble?
My main concern would be if we add the SSD, 1 SSD failure could bring an entire server (which has 15 drives) down. I currently have 6 servers with 6-8 drives each, which I am planning to convert to 3 servers with 15 6TB SAS drives each.
I would love if someone can elaborate a bit for me. I understand the basic's but I am a bit new to ceph still and don't want to have data loss again (had a rough start already )
Thanks Guys