Dear Proxmox forum,
I am now struggeling since 1 day with creating a PVE7 cluster.
1st Machine is a Lenovo SR650 already containing (productive) VMs.
4 NICs:
1: connecting to 10.1.20.31 (for proxmox Webinterface, VLAN tagged 20)
2: connecting to 10.1.30.31 (for cluster networking, VLAN tagged 30)
3: for VM traffic untagged
4: NC
I created a cluster in many ways, via gui even via CLI.
For cleaning up use following commands:
systemctl stop pve-cluster
systemctl stop corosync
pmxcfs -l
rm /etc/pve/corosync.conf
rm -r /etc/corosync/*
killall pmxcfs
systemctl start pve-cluster
and remove the ghost with pvecm delnode [name]
I create the cluster with success via:
pvecm create PVE-CL-K-1 -link0 10.1.30.31
no error.
Adding the second node:
normal pc
10.1.20.46 comes from switch untagged for webinterface traffic
10.1.30.46 comes tagged
nodes can ping each other in both networks.
after that the 2nd node hangs and I am getting hung and blocked messages on the local terminal on the 2nd node...
am I blind? did I something wrong?
Both have the same local time, both rebooted... tested many times, even via gui and cli.
Thank you very much
Dirk
I am now struggeling since 1 day with creating a PVE7 cluster.
1st Machine is a Lenovo SR650 already containing (productive) VMs.
4 NICs:
1: connecting to 10.1.20.31 (for proxmox Webinterface, VLAN tagged 20)
2: connecting to 10.1.30.31 (for cluster networking, VLAN tagged 30)
3: for VM traffic untagged
4: NC
I created a cluster in many ways, via gui even via CLI.
For cleaning up use following commands:
systemctl stop pve-cluster
systemctl stop corosync
pmxcfs -l
rm /etc/pve/corosync.conf
rm -r /etc/corosync/*
killall pmxcfs
systemctl start pve-cluster
and remove the ghost with pvecm delnode [name]
I create the cluster with success via:
pvecm create PVE-CL-K-1 -link0 10.1.30.31
no error.
Code:
root@PVE-BKP-K-1:~# pvecm status
Cluster information
-------------------
Name: PVE-CL-K-1
Config Version: 5
Transport: knet
Secure auth: on
Quorum information
------------------
Date: Tue Jul 20 11:05:55 2021
Quorum provider: corosync_votequorum
Nodes: 1
Node ID: 0x00000001
Ring ID: 1.f2
Quorate: Yes
Votequorum information
----------------------
Expected votes: 1
Highest expected: 1
Total votes: 1
Quorum: 1
Flags: Quorate
Membership information
----------------------
Nodeid Votes Name
0x00000001 1 10.1.30.31 (local)
root@PVE-BKP-K-1:~#
Adding the second node:
normal pc
10.1.20.46 comes from switch untagged for webinterface traffic
10.1.30.46 comes tagged
Code:
root@PVE-K-2:~# cat /etc/network/interfaces
# network interface settings; autogenerated
# Please do NOT modify this file directly, unless you know what
# you're doing.
#
# If you want to manage parts of the network configuration manually,
# please utilize the 'source' or 'source-directory' directives to do
# so.
# PVE will preserve these directives, but will NOT read its network
# configuration from sourced files, so do not attempt to move any of
# the PVE managed interfaces into external files!
auto lo
iface lo inet loopback
iface enx002432181cbc inet manual
iface enp1s0 inet manual
auto vmbr0
iface vmbr0 inet static
address 10.1.20.46/24
gateway 10.1.20.254
bridge-ports enx002432181cbc
bridge-stp off
bridge-fd 0
auto vlan30
iface vlan30 inet static
address 10.1.30.46/24
vlan-raw-device enx002432181cbc
root@PVE-K-2:~#
nodes can ping each other in both networks.
Code:
root@PVE-K-2:~# pvecm add 10.1.20.31 -link0 10.1.30.46
Please enter superuser (root) password for '10.1.20.31': *******
Establishing API connection with host '10.1.20.31'
The authenticity of host '10.1.20.31' can't be established.
X509 SHA256 key fingerprint is 3xxxFD:2A:B0:5xx3:xx
Are you sure you want to continue connecting (yes/no)? yes
Login succeeded.
check cluster join API version
Request addition of this node
Join request OK, finishing setup locally
stopping pve-cluster service
backup old database to '/var/lib/pve-cluster/backup/config-1626771346.sql.gz'
waiting for quorum...OK
after that the 2nd node hangs and I am getting hung and blocked messages on the local terminal on the 2nd node...
am I blind? did I something wrong?
Both have the same local time, both rebooted... tested many times, even via gui and cli.
Thank you very much
Dirk