Cluster HA fence_ifmib

vince_122

New Member
Apr 10, 2012
29
0
1
Hi there,

I am trying to enable HA for my cluster with 3 nodes.
As we're out of money, the only solution to "fence" my nodes is to use the fence_ifmib command.

I'm trying to implement it but I can't find where is my error.

Here is my cluster.conf :

<?xml version="1.0"?>
<cluster config_version="43" name="CLUSTER-PROXMOX">
<cman keyfile="/var/lib/pve-cluster/corosync.authkey"/>
<fencedevices>
<fencedevice agent="fence_ifmib" ipaddr="[SW_IP]" community="[MY_COM]" snmp_version="2c" name="fence"/>
</fencedevices>
<clusternodes>
<clusternode name="vxplfsrv17" nodeid="1" votes="1"/>
<fence>
<method name="FENCE">
<device name="fence" port="1002"/>
</method>
</fence>
<clusternode name="vxplfsrv18" nodeid="2" votes="1"/>
<clusternode name="vxplfsrv19" nodeid="3" votes="1"/>
</clusternodes>
<rm/>
</cluster>

Where is my error ? When I'm saving my file, i get :

[dcdb] notice: wrote new cluster config '/etc/cluster/cluster.conf'
[dcdb] crit: cman_tool version failed with exit code 1#010

If i'm trying directly from command line :
./fence_ifmib -o status -a [SW_IP] -d 2c -c [MY_COM] -n 1002
Status: ON

Thanks for helping,
Vince.
 
Yes I made modification at each time.

But I looked all syslogs, and I see the error only on my 1st node, not on the others :

Node1 :
[dcdb] notice: wrote new cluster config '/etc/cluster/cluster.conf'
[dcdb] crit: cman_tool version failed with exit code 1#010

Node 2 & 3 :
pmxcfs[1495]: [dcdb] notice: wrote new cluster config '/etc/cluster/cluster.conf'

On all nodes, In /etc/cluster/cluster.conf I have version 42
But "cman_tool version" returns :
6.2.0 config 41

Normal ?

How can I debug this error about cman_tool ?

Thanks for all !
Vince.
 
Auto reply :

I deleted a line, and all worked !

Now it looks like this :
<?xml version="1.0"?>
<cluster config_version="43" name="CLUSTER-PROXMOX">
<cman keyfile="/var/lib/pve-cluster/corosync.authkey"/>
<fencedevices>
<fencedevice agent="fence_ifmib" ipaddr="135.117.232.231" community="complf" snmp_version="2c" name="fence"/>
</fencedevices>
<clusternodes>
<clusternode name="vxplfsrv17" nodeid="1" votes="1"/>
<clusternode name="vxplfsrv18" nodeid="2" votes="1"/>
<clusternode name="vxplfsrv19" nodeid="3" votes="1"/>
</clusternodes>
<rm>
</rm>
</cluster>
But what was the error on this group ? :


<clusternode name="vxplfsrv17" nodeid="1" votes="1"/>
<fence>
<method name="FENCE">
<device name="fence" port="1002"/>
</method>
</fence>


Thanks again..
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!