Hi,
I've added a new node to my cluster on which I am unable to start fencing
On blh02-12 :
root@blh02-12#fence_tool join
fence_tool: fenced not running, no lockfile
I'd appreciate very much some tips about how to troubleshoot / which logs to check as, as far as I understood, the only think you need to do for fencing to work is to enable it in /etc/default/redhat-cluster-pve
here are some info - commands ran on a different node
clustat
Cluster Status for bl02-cluster01 @ Tue Nov 6 10:19:17 2012
Member Status: Quorate
Member Name ID Status
------ ---- ---- ------
blh02-14 1 Online, Local, rgmanager
blh02-13 2 Offline
blh02-10 3 Online, rgmanager
blh02-11 4 Online, rgmanager
blh02-12 5 Online
blh02-08 6 Online, rgmanager
fence_tool -n ls
fence domain
member count 4
victim count 0
victim now 0
master nodeid 1
wait state none
members 1 3 4 6
all nodes
nodeid 1 member 1 victim 0 last fence master 2 how agent
nodeid 2 member 0 victim 0 last fence master 1 how agent
nodeid 3 member 1 victim 0 last fence master 1 how agent
nodeid 4 member 1 victim 0 last fence master 2 how agent
nodeid 5 member 0 victim 0 last fence master 0 how none
nodeid 6 member 1 victim 0 last fence master 0 how none
cat /etc/pve/cluster.conf
<?xml version="1.0"?>
<cluster config_version="54" name="bl02-cluster01">
<cman expected_votes="4" keyfile="/var/lib/pve-cluster/corosync.authkey"/>
<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="blh02-14" nodeid="1" votes="1">
<fence>
<method name="1">
<device action="reboot" name="bla02-14"/>
</method>
</fence>
</clusternode>
<clusternode name="blh02-13" nodeid="2" votes="1">
<fence>
<method name="1">
<device action="reboot" name="bla02-13"/>
</method>
</fence>
</clusternode>
<clusternode name="blh02-10" nodeid="3" votes="1">
<fence>
<method name="1">
<device action="reboot" name="bla02-10"/>
</method>
</fence>
</clusternode>
<clusternode name="blh02-11" nodeid="4" votes="1">
<fence>
<method name="1">
<device action="reboot" name="bla02-11"/>
</method>
</fence>
</clusternode>
<clusternode name="blh02-12" nodeid="5" votes="1">
<fence>
<method name="1">
<device action="reboot" name="bla02-12"/>
</method>
</fence>
</clusternode>
<clusternode name="blh02-08" nodeid="6" votes="1">
<fence>
<method name="1">
<device action="reboot" name="bla02-08"/>
</method>
</fence>
</clusternode>
</clusternodes>
<fencedevices>
<fencedevice agent="fence_ilo" ipaddr="x.x.x.x" login="iloadmin" name="bla02-08" passwd="ccc"/>
<fencedevice agent="fence_ilo" ipaddr="x.x.x.x" login="iloadmin" name="bla02-10" passwd="ccc"/>
<fencedevice agent="fence_ilo" ipaddr="x.x.x.x" login="iloadmin" name="bla02-11" passwd="ccc"/>
<fencedevice agent="fence_ilo" ipaddr="x.x.x.x" login="iloadmin" name="bla02-12" passwd="ccc"/>
<fencedevice agent="fence_ilo" ipaddr="x.x.x.x" login="iloadmin" name="bla02-13" passwd="ccc"/>
<fencedevice agent="fence_ilo" ipaddr="x.x.x.x" login="iloadmin" name="bla02-14" passwd="ccc"/>
</fencedevices>
I've added a new node to my cluster on which I am unable to start fencing
On blh02-12 :
root@blh02-12#fence_tool join
fence_tool: fenced not running, no lockfile
I'd appreciate very much some tips about how to troubleshoot / which logs to check as, as far as I understood, the only think you need to do for fencing to work is to enable it in /etc/default/redhat-cluster-pve
here are some info - commands ran on a different node
clustat
Cluster Status for bl02-cluster01 @ Tue Nov 6 10:19:17 2012
Member Status: Quorate
Member Name ID Status
------ ---- ---- ------
blh02-14 1 Online, Local, rgmanager
blh02-13 2 Offline
blh02-10 3 Online, rgmanager
blh02-11 4 Online, rgmanager
blh02-12 5 Online
blh02-08 6 Online, rgmanager
fence_tool -n ls
fence domain
member count 4
victim count 0
victim now 0
master nodeid 1
wait state none
members 1 3 4 6
all nodes
nodeid 1 member 1 victim 0 last fence master 2 how agent
nodeid 2 member 0 victim 0 last fence master 1 how agent
nodeid 3 member 1 victim 0 last fence master 1 how agent
nodeid 4 member 1 victim 0 last fence master 2 how agent
nodeid 5 member 0 victim 0 last fence master 0 how none
nodeid 6 member 1 victim 0 last fence master 0 how none
cat /etc/pve/cluster.conf
<?xml version="1.0"?>
<cluster config_version="54" name="bl02-cluster01">
<cman expected_votes="4" keyfile="/var/lib/pve-cluster/corosync.authkey"/>
<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="blh02-14" nodeid="1" votes="1">
<fence>
<method name="1">
<device action="reboot" name="bla02-14"/>
</method>
</fence>
</clusternode>
<clusternode name="blh02-13" nodeid="2" votes="1">
<fence>
<method name="1">
<device action="reboot" name="bla02-13"/>
</method>
</fence>
</clusternode>
<clusternode name="blh02-10" nodeid="3" votes="1">
<fence>
<method name="1">
<device action="reboot" name="bla02-10"/>
</method>
</fence>
</clusternode>
<clusternode name="blh02-11" nodeid="4" votes="1">
<fence>
<method name="1">
<device action="reboot" name="bla02-11"/>
</method>
</fence>
</clusternode>
<clusternode name="blh02-12" nodeid="5" votes="1">
<fence>
<method name="1">
<device action="reboot" name="bla02-12"/>
</method>
</fence>
</clusternode>
<clusternode name="blh02-08" nodeid="6" votes="1">
<fence>
<method name="1">
<device action="reboot" name="bla02-08"/>
</method>
</fence>
</clusternode>
</clusternodes>
<fencedevices>
<fencedevice agent="fence_ilo" ipaddr="x.x.x.x" login="iloadmin" name="bla02-08" passwd="ccc"/>
<fencedevice agent="fence_ilo" ipaddr="x.x.x.x" login="iloadmin" name="bla02-10" passwd="ccc"/>
<fencedevice agent="fence_ilo" ipaddr="x.x.x.x" login="iloadmin" name="bla02-11" passwd="ccc"/>
<fencedevice agent="fence_ilo" ipaddr="x.x.x.x" login="iloadmin" name="bla02-12" passwd="ccc"/>
<fencedevice agent="fence_ilo" ipaddr="x.x.x.x" login="iloadmin" name="bla02-13" passwd="ccc"/>
<fencedevice agent="fence_ilo" ipaddr="x.x.x.x" login="iloadmin" name="bla02-14" passwd="ccc"/>
</fencedevices>