Hi all,
I have a network issue that started before 8.1, for which I had a workaround, that won't work anymore in 8.1.
I have a 9 node PVE cluster (pve0..8), all using LAGs to a single Edgeswitch, using Jumbo Frames. Nodes 1 to 8 are Mac Minis from various generations, using the internal Ethernet port, and an external Apple Gb Thunderbolt adapter. Node pve0 is a far more recent industrial PC, with 6 Intel 225-V NICs. I use four of them in a bond.
And on top of that, is use Ceph. Icurrently have 6 VMs and 2 CTs running fine.
Everything was working pretty smooth before 8.1, with one exception, I had to set autoneg off on the i225-V manually at each reboot. In fact, the NICs would properly negotiate at 1 Gb/s, but I found logs in the switch that the NIC was permanently negotiating, and this would degrade performance. Ceph was unusable until I turned off autoneg (like ethtool -s enp5s0 speed 1000 duplex full autoneg off). By the way, I had tried to put the change in /etc/network/interfaces, but it never worked, I had to do it manually.
Now since 8.1, this fix does not work anymore at all, and the i225-V interfaces are kept on autoneg, seems the command is ignored.
I tried reducing the bond to only two links, I changed the hash policy, etc... nothing works.
Weird thing is that apart from Ceph and PBS, networking is kind of slow but usable. Ping looks normal. When I SSH to or from that node, there is a small unusal delay which I do not have with the other nodes. Oh, and between node 0 and 3, communication seems absolutely normal.
Any clue ? there are for me two issues :
- why did I have to turn off autoneg on i225-V ??
- why can't I do it anymore in 8.1 ?
I have a network issue that started before 8.1, for which I had a workaround, that won't work anymore in 8.1.
I have a 9 node PVE cluster (pve0..8), all using LAGs to a single Edgeswitch, using Jumbo Frames. Nodes 1 to 8 are Mac Minis from various generations, using the internal Ethernet port, and an external Apple Gb Thunderbolt adapter. Node pve0 is a far more recent industrial PC, with 6 Intel 225-V NICs. I use four of them in a bond.
And on top of that, is use Ceph. Icurrently have 6 VMs and 2 CTs running fine.
Everything was working pretty smooth before 8.1, with one exception, I had to set autoneg off on the i225-V manually at each reboot. In fact, the NICs would properly negotiate at 1 Gb/s, but I found logs in the switch that the NIC was permanently negotiating, and this would degrade performance. Ceph was unusable until I turned off autoneg (like ethtool -s enp5s0 speed 1000 duplex full autoneg off). By the way, I had tried to put the change in /etc/network/interfaces, but it never worked, I had to do it manually.
Now since 8.1, this fix does not work anymore at all, and the i225-V interfaces are kept on autoneg, seems the command is ignored.
I tried reducing the bond to only two links, I changed the hash policy, etc... nothing works.
Weird thing is that apart from Ceph and PBS, networking is kind of slow but usable. Ping looks normal. When I SSH to or from that node, there is a small unusal delay which I do not have with the other nodes. Oh, and between node 0 and 3, communication seems absolutely normal.
Any clue ? there are for me two issues :
- why did I have to turn off autoneg on i225-V ??
- why can't I do it anymore in 8.1 ?