Hello,
i just need to confirm my presumption
We have 8 proxmox nodes with ceph. Just small ceph environment with 16 NVME disks. Servers are connect to Arista DCS-7060CX-32S by 2x 25gb dac cable in LACP bond.
Proxmox is on last 6 version (6.4-13) and last 15 version of ceph (15.2.15)
I can see in grafana a lot of discards. Posting screenshot. All ports looks +/- same
My question is, can this discard be harmful for ceph/proxmox (i see rentransmit in corosync log) and/or to application which needs stable network (etcd in kubernetes, etc)
I am pretty sure, that we have unsuitable switches but i just like to have some confirmation
i just need to confirm my presumption
We have 8 proxmox nodes with ceph. Just small ceph environment with 16 NVME disks. Servers are connect to Arista DCS-7060CX-32S by 2x 25gb dac cable in LACP bond.
Proxmox is on last 6 version (6.4-13) and last 15 version of ceph (15.2.15)
I can see in grafana a lot of discards. Posting screenshot. All ports looks +/- same
My question is, can this discard be harmful for ceph/proxmox (i see rentransmit in corosync log) and/or to application which needs stable network (etcd in kubernetes, etc)
I am pretty sure, that we have unsuitable switches but i just like to have some confirmation
Code:
Feb 16 12:58:22 srv1 corosync[5241]: [TOTEM ] Retransmit List: b4447f
Feb 16 14:15:48 srv1 corosync[5241]: [TOTEM ] Retransmit List: b5a7ab
Feb 16 14:17:47 srv1 corosync[5241]: [TOTEM ] Retransmit List: b5b0c3
Feb 16 14:27:50 srv1 corosync[5241]: [TOTEM ] Retransmit List: b5defd
Feb 16 14:30:19 srv1 corosync[5241]: [TOTEM ] Retransmit List: b5ea62