question about bond count on Ceph nodes

kajatonas

New Member
Sep 2, 2021
1
0
1
40
Hello,
In our setup every server has 2 NICs with 2 interfaces each. For simplicity and for saving network devices resources we want to bond all 4 interfaces to one bond interface, not the two as it written in some PDFs.
So there would be one bond with 4x10GBs. Is there any limitations to this setup instead 2 bond (2x10GB each) ?
In some manuals I've seen that it's specifying the configuration where one bond has public vlans and other one replication based vlans. So, is there any pain if we would add all vlans to one bond instead of two ?

Thanks !
 
It is recommended to separate CEPH backend traffic from Frontend, because of latencies.

also a 4x Bond will not help you in terms of speed as long as you do not have at least 5 nodes.
in the usual bonding method (LACP) the usage of the bond links is done on IP/MAC, so for example Node A talks with Node B over LINK A but with Node C over Link B (and so on)

So between two specific nodes you will have not more than 10 Gbit/s speed (but of course redundancy)

So my advise is to setup 2 bonds with 2 interfaces each.
 
Two points:
1a] in one-bond mode if bond fail, you lost all conectivity
1b] in two-bond mode if one bond fail, you still have something left (usabiliy depends...)
2a] in one-bond mode bond sides select, what iface will be used to send datastream
2b] in two-bond mode admin decide, what ifaces will be used by specific datastreams
 
Two points:
1a] in one-bond mode if bond fail, you lost all conectivity
1b] in two-bond mode if one bond fail, you still have something left (usabiliy depends...)
2a] in one-bond mode bond sides select, what iface will be used to send datastream
2b] in two-bond mode admin decide, what ifaces will be used by specific datastreams
That depends heaviily on the modes supported by both operating system and the switch side!

not all static bond modes work with every switch and can lead to very annoying errors. With many switch brands only active/passive bonding works reliable.

LACP is more dynamic and detects failures of ports better, but uses under linux a Hash over IP and MAC Adress to select which port is used for transmission. So don't expect more than the speed of one wire for a connection between two specific nodes.
Of course the switch has to support LACP and has to be configured correctly.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!