Hi,
I am running a three node hyper-converged PVE cluster where all three nodes are also CEPH nodes. Each node has an NVMe OSD organized in a CEPH pool for VM storage. There is a dedicated 10Gbe network for CEPH (and one for the PVE cluster).
For reasons still to be found, two of my OSDs went down and out. Since I couldn't get them back online, I destroyed them and recreated them. That went well.
Now, CEPH is rebalancing at around 10 MiB/s. Given that the OSDs are NVMes and the network is 10gbe, I find this surprisingly slow. What might be the reason and what could I do to speed things up?
Thanks!
I am running a three node hyper-converged PVE cluster where all three nodes are also CEPH nodes. Each node has an NVMe OSD organized in a CEPH pool for VM storage. There is a dedicated 10Gbe network for CEPH (and one for the PVE cluster).
For reasons still to be found, two of my OSDs went down and out. Since I couldn't get them back online, I destroyed them and recreated them. That went well.
Now, CEPH is rebalancing at around 10 MiB/s. Given that the OSDs are NVMes and the network is 10gbe, I find this surprisingly slow. What might be the reason and what could I do to speed things up?
Thanks!