I am building a new CEPH PMX cluster for someone and I am not seeing the performance I would expect. Any thoughts would be welcomed, I am pushing for a CEPH setup over them creating 6x individual nodes since the app needs to have HA capabilities. Worst case we will go to ZFS replicated, but seems a shame to waste such a good opportunity.
6 Nodes: 2xEPYC 7452, 512GB, 2x10Gbps PMX network, 2x 40Gbps Storage network, 2 NMVE OS drives, 6x Intel SSDPE2KX040T8 NVME for CEPH
They want to run VMs on the nodes with HA/Replication which I know will degrade performance some, but I am not seeing better than SSD performance at this time.
proxmox-ve: 6.3-1 (running kernel: 5.4.98-1-pve)
ceph version 15.2.8 (8b89984e92223ec320fb4c70589c39f384c86985) octopus (stable)
NICs: 9000 MTU bonded as active-backup currently
dd bs=1M count=256 if=/dev/zero of=test conv=fdatasync
989MB/S - 1GB/S
fio --randrepeat=1 --ioengine=libaio --direct=1 --gtod_reduce=1 --name=test --filename=test --bs=4k --iodepth=64 --size=4G --readwrite=randrw --rwmixread=75
read: IOPS=24.1k, BW=94.1MiB/s (98.7MB/s)(3070MiB/32613msec)
bw ( KiB/s): min=85008, max=125832, per=100.00%, avg=96473.95, stdev=11087.47, samples=65
iops : min=21252, max=31458, avg=24118.42, stdev=2771.86, samples=65
write: IOPS=8053, BW=31.5MiB/s (32.0MB/s)(1026MiB/32613msec); 0 zone resets
bw ( KiB/s): min=27553, max=41384, per=100.00%, avg=32244.46, stdev=3712.44, samples=65
iops : min= 6888, max=10346, avg=8061.05, stdev=928.11, samples=65
6 Nodes: 2xEPYC 7452, 512GB, 2x10Gbps PMX network, 2x 40Gbps Storage network, 2 NMVE OS drives, 6x Intel SSDPE2KX040T8 NVME for CEPH
They want to run VMs on the nodes with HA/Replication which I know will degrade performance some, but I am not seeing better than SSD performance at this time.
proxmox-ve: 6.3-1 (running kernel: 5.4.98-1-pve)
ceph version 15.2.8 (8b89984e92223ec320fb4c70589c39f384c86985) octopus (stable)
NICs: 9000 MTU bonded as active-backup currently
dd bs=1M count=256 if=/dev/zero of=test conv=fdatasync
989MB/S - 1GB/S
fio --randrepeat=1 --ioengine=libaio --direct=1 --gtod_reduce=1 --name=test --filename=test --bs=4k --iodepth=64 --size=4G --readwrite=randrw --rwmixread=75
read: IOPS=24.1k, BW=94.1MiB/s (98.7MB/s)(3070MiB/32613msec)
bw ( KiB/s): min=85008, max=125832, per=100.00%, avg=96473.95, stdev=11087.47, samples=65
iops : min=21252, max=31458, avg=24118.42, stdev=2771.86, samples=65
write: IOPS=8053, BW=31.5MiB/s (32.0MB/s)(1026MiB/32613msec); 0 zone resets
bw ( KiB/s): min=27553, max=41384, per=100.00%, avg=32244.46, stdev=3712.44, samples=65
iops : min= 6888, max=10346, avg=8061.05, stdev=928.11, samples=65