i rebooted whole cluster all 4 nodes !
do you have a running cluseter with RDMA ? which steps do you made to get it running in detail ?
I'm totally stuck ...
i Just succeed to have udaddy and rping happy i followed elurex suggestion... but
unpack build OFED driver
cd to DEBS
dpkg --force-overwrite -i *.deb
reboot
and all rdma pingers are happy
but change ceph.conf to RDMA still mon and mgr are unhappy with memlock ! but i set these to infinity...
did not work ...
apt-get remove proxmox-ve pve*
Reading package lists... Done
Building dependency tree
Reading state information... Done
E: Unable to locate package pve
E: Unable to locate package pveam.log
E: Couldn't find any package by glob 'pveam.log'
E: Couldn't find any package by regex...
I reverted my ceph.conf to tcp mode, cluster is running...
but with these new drivers i cant rping or udaddy ...
udaddy
failed to create event channel: No such device
rping -s -C 10 -v
rdma_create_event_channel: No such device
something got broken i suppose ...
any hints ?
i did not succeed !
both mon and mgr did not start
Jun 6 21:15:03 pve01 ceph-mgr[5485]: -2> 2018-06-06 21:15:03.683201 7f8b502d66c0 -1 RDMAStack RDMAStack!!! WARNING !!! For RDMA to work properly user memlock (ulimit -l) must be big enough to allow large amount of registered memory. We...
Hi
just updated firmware ...
the build process made a 9.4 ???
cd /usr/local/src
wget "http://content.mellanox.com/ofed/MLNX_EN-4.3-1.0.1.0/mlnx-en-4.3-1.0.1.0-debian9.1-x86_64.tgz"
tar -xzvf mlnx-en-4.3-1.0.1.0-debian9.1-x86_64.tgz
cd mlnx-en-4.3-1.0.1.0-debian9.1-x86_64/...
thanks ... but some warnings ... shall i ignore them or specify '--skip-distro-check' ?
also which firmware do you have ?
# ibv_devinfo
hca_id: mlx4_0
transport: InfiniBand (0)
fw_ver: 2.40.7000
node_guid...
connect x3 pro running in 56gBit/s mode ... sx1002 switch ... and appropriate cables... see also my personal profile footer and my original RDMA thread...
don't know how to proceed with this driver ...
you said invoke "mlnx_add_kernel_support.sh" and compile .... and installl all stuff in...
can you direct me to a download link for topic #1 ?
out of box pve version is this one ...
strings /lib/modules/4.15.17-2-pve/kernel/drivers/net/ethernet/mellanox/mlx4/mlx4_core.ko|grep -i versio
(Installed FW version is %d.%d.%03d)
This driver version supports only revisions %d to %d
FW...
I have proxmox 5.2 based on debian stretch, which drivers do you install ?
is a apt source avail ?
so you think i should give this another try after my gaveup in October ? on a now productive cluster ?
my 2 cents ...
on 56Gbit/s network configuration see my signature
Total time run: 60.022982
Total writes made: 41366
Write size: 4194304
Object size: 4194304
Bandwidth (MB/sec): 2756.68
Stddev Bandwidth: 174.339
Max bandwidth (MB/sec): 2976
Min...
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.