@Alwin Thank you for the detailed explanations. I have a couple questions about some of them.
Question 1:
When
@Adam Koczarski created his 220GB partitions on the NVMe and then used the Proxmox GUI to create the OSD's and designate the DB/WAL location to these partitions is this still true?
from the referenced blog post:
"By default a partition will be created on the sdc device that is 1% of the main device size. This can be overridden with the
bluestore_block_db_size config option."
....or... since he manually created the 220GB partition it will use ALL 220GB without the need to tweak the bluestore_block_db_size to match?
Question 2:
"In BlueStore, the internal journaling needed for consistency is much lighter-weight, usually behaving like a metadata journal and only journaling small writes when it is faster (or necessary) to do so..."
By default, how are the decisions to land writes or source reads on/from the faster NVMe vs the backing spinners made with bluestore?
Based on what you've stated above and this from the blog you referenced it seems that, at least with RBD, there is no way to speed up DATA reads and writes to the backing HDD's without relying on the guests OS's built in cache or the hypervisor's cache setting for the disk of the VM (or both)...because all data needs to hit the backing storage (the HDD/Spinner's in Adam's case) before being released.
There's been a constant debate around the confusion involving performance, caching and where the DB/WAL fits into all of this at my site. Thanks again for the time.
<D>