Hi,
Im having a peculiar issue.
My nodes used infiniband in connected mode for cluster network, and storage (different interfaces) .
The cluster network runs on a 10.1,100.x range, and the ipoib device looks like
ib1: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 65520
inet 10.1.100.2 netmask 255.255.255.0 broadcast 10.1.100.255
the other node is 10.1.100.1 with the same mtu.
in the corrosync config I have
nodelist {
node {
name: kvm1
nodeid: 1
quorum_votes: 1
ring0_addr: 10.1.100.1
}
node {
name: kvm2
nodeid: 2
quorum_votes: 1
ring0_addr: 10.1.100.2
}
and
interface {
bindnetaddr: 10.1.100.1
ringnumber: 0
}
Now the problem is pvecm status shows only the local node on each of the nodes, and spamming to the syslog on both nodes I see :
Jul 29 11:16:10 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:10 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:10 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:10 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:10 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:12 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:12 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:12 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:12 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:12 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:12 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
This setup worked perfectly on corrosync2.
Do you think the MTU of 65520 is in some way not compatible with corosync3 ? or is there any other gems anyone could suggest ??
All the best
Kevin M
Im having a peculiar issue.
My nodes used infiniband in connected mode for cluster network, and storage (different interfaces) .
The cluster network runs on a 10.1,100.x range, and the ipoib device looks like
ib1: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 65520
inet 10.1.100.2 netmask 255.255.255.0 broadcast 10.1.100.255
the other node is 10.1.100.1 with the same mtu.
in the corrosync config I have
nodelist {
node {
name: kvm1
nodeid: 1
quorum_votes: 1
ring0_addr: 10.1.100.1
}
node {
name: kvm2
nodeid: 2
quorum_votes: 1
ring0_addr: 10.1.100.2
}
and
interface {
bindnetaddr: 10.1.100.1
ringnumber: 0
}
Now the problem is pvecm status shows only the local node on each of the nodes, and spamming to the syslog on both nodes I see :
Jul 29 11:16:10 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:10 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:10 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:10 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:10 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:11 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:12 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:12 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:12 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:12 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:12 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
Jul 29 11:16:12 kvm2 corosync[4025739]: [KNET ] pmtud: Aborting PMTUD process: Too many attempts. MTU might have changed during discovery.
This setup worked perfectly on corrosync2.
Do you think the MTU of 65520 is in some way not compatible with corosync3 ? or is there any other gems anyone could suggest ??
All the best
Kevin M