Users had been complaining about laggy VM performace on our HDD-based Ceph pool. With many users VMs having many applications that are logging to disk, I see a lot of small writes with write IOPS > 5x read IOPS.
Reading up here and elsewhere, it seems that Write-back may yield better performance than the default no-cache.
The concern here is that it had been long ingrained in me that write caching is not safe without BBU. However, with Ceph having a WAL on PLP NVMe for the pool, would I be right to assume that this would make it safe since the WAL on NVMe serves as a crash-resilient write cache and PLP is effectively BBU for the NVMe?
Reading up here and elsewhere, it seems that Write-back may yield better performance than the default no-cache.
The concern here is that it had been long ingrained in me that write caching is not safe without BBU. However, with Ceph having a WAL on PLP NVMe for the pool, would I be right to assume that this would make it safe since the WAL on NVMe serves as a crash-resilient write cache and PLP is effectively BBU for the NVMe?