Hi all,
I'm having problems with my 2 node Proxmox cluster. One is a home built PC that runs several VMs, and the other a second hand Intel NUC to play around with Plex and hardware transcoding using Intel Quicksync.
I joined these two in a cluster - but have noticed that I get a lot of dropouts with corosync. I've been experimenting with the configuration, but still haven't managed to get a setup that works.
Error:
This is my current `/etc/corosync/corosync.conf` - I attempted to add a ring1 using IPv4 - so the two nodes have paths via IPv4 and IPv6 to try:
I tried extending the token timeout to 3000ms - but this doesn't seem to have corrected anything.
Both systems are plugged into the same switch, so there is no firewall or WAN link or similar between the nodes...
Has anyone come across this before and maybe resolved it?
I'm having problems with my 2 node Proxmox cluster. One is a home built PC that runs several VMs, and the other a second hand Intel NUC to play around with Plex and hardware transcoding using Intel Quicksync.
I joined these two in a cluster - but have noticed that I get a lot of dropouts with corosync. I've been experimenting with the configuration, but still haven't managed to get a setup that works.
Error:
Code:
Jun 28 00:33:29 mel-pm2 corosync[793213]: [KNET ] link: host: 1 link: 0 is down
Jun 28 00:33:29 mel-pm2 corosync[793213]: [KNET ] link: host: 1 link: 1 is down
Jun 28 00:33:29 mel-pm2 corosync[793213]: [KNET ] host: host: 1 has 0 active links
Jun 28 00:33:29 mel-pm2 corosync[793213]: [KNET ] host: host: 1 has no active links
Jun 28 00:33:29 mel-pm2 corosync[793213]: [KNET ] host: host: 1 has 0 active links
Jun 28 00:33:29 mel-pm2 corosync[793213]: [KNET ] host: host: 1 has no active links
Jun 28 00:33:31 mel-pm2 corosync[793213]: [TOTEM ] Token has not been received in 2250 ms
Jun 28 00:33:32 mel-pm2 corosync[793213]: [KNET ] rx: host: 1 link: 0 is up
Jun 28 00:33:32 mel-pm2 corosync[793213]: [KNET ] rx: host: 1 link: 1 is up
Jun 28 00:33:32 mel-pm2 corosync[793213]: [KNET ] host: host: 1 has 2 active links
Jun 28 00:33:32 mel-pm2 corosync[793213]: [KNET ] host: host: 1 has 2 active links
Jun 28 00:33:32 mel-pm2 corosync[793213]: [TOTEM ] A processor failed, forming new configuration: token timed out (3000ms), waiting 3600ms for consensus.
Jun 28 00:33:34 mel-pm2 corosync[793213]: [QUORUM] Sync members[2]: 1 2
Jun 28 00:33:34 mel-pm2 corosync[793213]: [TOTEM ] A new membership (1.2315) was formed. Members
Jun 28 00:33:34 mel-pm2 corosync[793213]: [QUORUM] Members[2]: 1 2
Jun 28 00:33:34 mel-pm2 corosync[793213]: [MAIN ] Completed service synchronization, ready to provide service.
This is my current `/etc/corosync/corosync.conf` - I attempted to add a ring1 using IPv4 - so the two nodes have paths via IPv4 and IPv6 to try:
Code:
logging {
debug: off
to_syslog: yes
}
nodelist {
node {
name: mel-pm
nodeid: 1
quorum_votes: 1
ring0_addr: <ipv6 prefix>:100::1
ring1_addr: 172.31.1.1
}
node {
name: mel-pm2
nodeid: 2
quorum_votes: 1
ring0_addr: <ipv6 prefix>:100::2
ring1_addr: 172.31.1.2
}
}
quorum {
provider: corosync_votequorum
}
totem {
cluster_name: Melbourne
config_version: 2
interface {
linknumber: 0
}
ip_version: ipv4-6
link_mode: passive
secauth: on
version: 2
link_mode: active
token: 3000
}
I tried extending the token timeout to 3000ms - but this doesn't seem to have corrected anything.
Both systems are plugged into the same switch, so there is no firewall or WAN link or similar between the nodes...
Has anyone come across this before and maybe resolved it?