Hello!
I've got a problem with HA on the 3 server cluster. I've got 3 ports on each node: management, bridge for vm and network with test ISCSI storage.
I manually shutdown management port for node3 on the switch, so I can simulate network problems on the node3. Fencing works fine - I can see 2 ports (bridge for vm and ISCSI storage) disabled on the netwok switch.
Then I enable back management port and try to restart services like this:
Rgmanager just hangs. Nothing I can do to bring cman and rgmanager back, the only way is reboot.
/etc/pve/cluster.conf
where first port is bridge for vm and second is network with test ISCSI storage.
What I can do to bring back node3 to cluster? Is it possible to do without reboot?
P.S FENCE_JOIN="yes" is configured for all nodes.
P.S.S Scheme
I've got a problem with HA on the 3 server cluster. I've got 3 ports on each node: management, bridge for vm and network with test ISCSI storage.
I manually shutdown management port for node3 on the switch, so I can simulate network problems on the node3. Fencing works fine - I can see 2 ports (bridge for vm and ISCSI storage) disabled on the netwok switch.
Then I enable back management port and try to restart services like this:
Code:
root@proxmox3:~# /etc/init.d/pve-cluster restart
Restarting pve cluster filesystem: pve-cluster.
root@proxmox3:~# /etc/init.d/pvedaemon restart
Restarting PVE Daemon: pvedaemon.
root@proxmox3:~# /etc/init.d/cman restart
Stopping cluster:
Leaving fence domain... [ OK ]
Stopping dlm_controld... [ OK ]
Stopping fenced... [ OK ]
Stopping cman... [ OK ]
Unloading kernel modules... [ OK ]
Unmounting configfs... [ OK ]
Starting cluster:
Checking if cluster has been disabled at boot... [ OK ]
Checking Network Manager... [ OK ]
Global setup... [ OK ]
Loading kernel modules... [ OK ]
Mounting configfs... [ OK ]
Starting cman... [ OK ]
Waiting for quorum... [ OK ]
Starting fenced... [ OK ]
Starting dlm_controld... [ OK ]
Tuning DLM kernel config... [ OK ]
Unfencing self... fence_node: cannot connect to cman
[FAILED]
/etc/pve/cluster.conf
Code:
<?xml version="1.0"?><cluster config_version="7" name="rnet-cluster">
<cman keyfile="/var/lib/pve-cluster/corosync.authkey"/>
<fencedevices>
<fencedevice agent="fence_ifmib" community="test-fencing" ipaddr="192.168.100.1" name="test-switch" snmp_version="2c"/>
</fencedevices>
<clusternodes>
<clusternode name="proxmox1" nodeid="1" votes="1">
<fence>
<method name="1">
<device action="off" name="test-switch" port="FastEthernet0/33"/>
<device action="off" name="test-switch" port="FastEthernet0/36"/>
</method>
</fence>
</clusternode>
<clusternode name="proxmox2" nodeid="2" votes="1">
<fence>
<method name="1">
<device action="off" name="test-switch" port="FastEthernet0/34"/>
<device action="off" name="test-switch" port="FastEthernet0/37"/>
</method>
</fence>
</clusternode>
<clusternode name="proxmox3" nodeid="3" votes="1">
<fence>
<method name="1">
<device action="off" name="test-switch" port="FastEthernet0/35"/>
<device action="off" name="test-switch" port="FastEthernet0/38"/>
</method>
</fence>
</clusternode>
</clusternodes>
<rm>
<pvevm autostart="1" vmid="101"/>
</rm>
</cluster>
Code:
root@proxmox3:~# fence_tool ls
root@proxmox3:~# clustat
Could not connect to CMAN: No such file or directory
root@proxmox3:~# pvecm status
cman_tool: Cannot open connection to cman, is it running ?
P.S FENCE_JOIN="yes" is configured for all nodes.
P.S.S Scheme
Last edited: