Hello there,
I would be very interested to know if you achieve your project.
Actually, I made a 3 nodes cluster installation with CEPH and a CEPH require entire disks and my servers are hosted by ovh I can't easily add disk, the PVE os size usually under 3Go, it's frustrating to have to dedicate a whole disk...
So the only solution for me was to go with USB key installation...
I choosed ZFS Raid1 from the installer and everything is working very well except :
1- It's slow, web interface unresponsive and, often timeouts in the CEPH section... is it due to ceph itself or to usb install ?
2- I fear this setup will slow down not only the hosts (witch I can live with) but the guests too : maybe some logs writes or any writes that need to be done for an operation can complete... (about that can someone tell me if it would be safe an useful to put /var/log in ram fs in addition with a syslog server for exemple ?)
3- I use 2 USB keys, often scrub with zfs to be sure everything is ok but I know usb life reputation and that's why I set a ZFS RAID1 but if a problem occur, I don't know how I will be able to tell the guy in the datacenter witch key is to removed and get half a chance that he unplug the good one... a little bit silly
With your solution, in case power failure or kernel crash, the files cannot be written back to usb, the solution is to reboot every time important changes or upgrades are done, why not... but about the CEPH part, I don't know CEPH well enough to tell if there is important files that a written on the disks and not replicated among other nodes during normal fonctionnement ?
My idea was maybe no going with a full RAM running as you want to do but maybe the most consuming directories only... can someone tell me what these directory would be ? (I think of logs and rrd graphs in which, by the way, I can see gaps sometimes, is there others)
To finish, does somebody has recommendations for my use case ? (please don't tell me not using usb, I have no choice)
For exemple, searching google, I found this manipulation, do someone has an opinion about it ? I think it could be a good compromise :
"This tip can also lead to data loss. If you do it, you will have to always shut down your computer properly from now on, because unexpected power failures will lead to data loss.
Linux usually ensures that all changes are written to disk every few seconds. Since disk writes are so slow, you can change your system to keep things in memory longer. All changes will be written to memory, and the excruciatingly slow writes to happen in the background while you continue working.
This has an instant, noticeable effect, but it can lead to data loss.
Add these lines to /etc/sysctl.conf, and reboot.
vm.swappiness = 0
vm.dirty_background_ratio = 20
vm.dirty_expire_centisecs = 0
vm.dirty_ratio = 80
vm.dirty_writeback_centisecs = 0
The problem: using this tip means that your system stops writing changes to disk until you shut down or type "sync" at a command line. If your system loses power unexpectedly, you will get bad blocks. I did. You can limit the amount of data loss in the event of a power failure to one minute by setting vm.dirty_writeback_centisecs = 6000."
Thank you for reading me and any advice or suggestion
regards