HA fencing error when trying to activate

jimz0r

New Member
Nov 19, 2013
22
0
1
Hi,

i'm trying to set up a test cluster with 2 nodes in HA with fencing devices.
when i'm trying to active it at the HA tab i'm getting the follow error:

config validation failed: /usr/sbin/ccs_config_validate: line 186: 7159 Segmentation fault ccs_config_dump > $tempfile (500)

does anyone know how to solve this problem, i really can't find out what the problem is.

As fencing device we are using a APC switch AP7921

etc/pve/cluster.conf.new

<?xml version="1.0"?>
<cluster config_version="18" name="clustertest">
<cman expected_votes="1" keyfile="/var/lib/pve-cluster/corosync.authkey" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_apc" name="apc" lanplus="1" ipaddr="192.168.1.60" login="domeinict" passwd="test123" power_wait="5"/>
</fencedevices>
<clusternodes>
<clusternode name="proxmox01" nodeid="1" votes="1"/>
<fence>
<method name="1">
<device name="apc"/>
</method>
</fence>
<clusternode name="proxmox02" nodeid="2" votes="1"/>
</clusternodes>
<rm>
<pvevm autostart="1" vmid="100"/>
</rm>
</cluster>
 
Code:
<?xml version="1.0"?>
<cluster config_version="18" name="clustertest">
<cman expected_votes="1" keyfile="/var/lib/pve-cluster/corosync.authkey" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_apc" name="apc" lanplus="1" ipaddr="192.168.1.60" login="domeinict" passwd="test123" power_wait="5"/>
</fencedevices>
<clusternodes>
<clusternode name="proxmox01" nodeid="1" votes="1"/>

do not close the clusternode tag here! It should include the fence section:

Code:
<?xml version="1.0"?>
<cluster config_version="18" name="clustertest">
<cman expected_votes="1" keyfile="/var/lib/pve-cluster/corosync.authkey" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_apc" name="apc" lanplus="1" ipaddr="192.168.1.60" login="domeinict" passwd="test123" power_wait="5"/>
</fencedevices>
<clusternodes>
<clusternode name="proxmox01" nodeid="1" votes="1">
<fence>
<method name="1">
<device name="apc"/>
</method>
</fence>
</clusternode>
<clusternode name="proxmox02" nodeid="2" votes="1"/>
</clusternodes>
<rm>
<pvevm autostart="1" vmid="100"/>
</rm>
</cluster>
 
[FONT=tahoma, arial, verdana, sans-serif]Thank you for your message.

anyway it still doensn't fix the error, still getting the error:
[/FONT]config validation failed: /usr/sbin/ccs_config_validate: line 186: 26617 Segmentation fault ccs_config_dump > $tempfile (500)[FONT=tahoma, arial, verdana, sans-serif]

I have used the following wiki page:
[/FONT]http://pve.proxmox.com/wiki/Fencing

Is there a log file which shows me more details on this error, if so what is the location?

Best Regards,
Jimmy

[FONT=tahoma, arial, verdana, sans-serif]


[/FONT]
 
Please use the following command to verify the configuration:

# ccs_config_validate -f <filename>

I just did that and it comes back with:

root@Proxmox01:/etc/pve# ccs_config_validate -f cluster.conf.new
Configuration validates
 
Hi,

after using: /etc/pve# ccs_config_validate -f cluster.conf.new it says;

Configuration validates

But when I'm trying to activate it at the web interface it stil comes with the next error:
config validation failed:

config validation failed: /usr/sbin/ccs_config_validate: line 186: 2008 Segmentation fault ccs_config_dump > $tempfile (500)

Does anyone know how to solve this issue?

Thanks
Jimmy
 
It's fixed it was something in the script.
now we have a new problem.

When i go to Node > services > start RGManager it says:
Starting Cluster Service Manager: [ OK ]
TASK OK

But the status says ''stopped''

When using
/etc/init.d/rgmanager start in CLi it says the same



root@Proxmox01:~# /etc/init.d/rgmanager start
Starting Cluster Service Manager: [ OK ]
root@Proxmox01:~#



root@Proxmox01:~# /etc/init.d/rgmanager status
rgmanager is stopped




I would really appreciate it if you could help us out with this issue as it is of high importance because we are doing this for a project.


Best regards,
Jimmy
 
Last edited:
Seriously, does nobody have any ideas? Even the admins.
So i have payed €49,99 for this terrible support forum for really bad support. This is really outrageous.
 
How do you expect to get support if you choose an unsupported setup? For HA at least 3 nodes are required. A special case is two nodes and a quorum disk.
 
How do you expect to get support if you choose an unsupported setup? For HA at least 3 nodes are required. A special case is two nodes and a quorum disk.

Thank you for your reply.
I have forgotten that it needed 3 nodes, I have installed a third node. I'm now started with the configurations.

I will update the status when everything is configured.

Thanks,
Jimmy
 
Hi,

Still the same problem.

When i go to Node > services > start RGManager it says:
Starting Cluster Service Manager: [ OK ]
TASK OK

But the status says ''stopped''

When using
/etc/init.d/rgmanager start in CLi it says the same



root@Proxmox01:~# /etc/init.d/rgmanager start
Starting Cluster Service Manager: [ OK ]
root@Proxmox01:~#



root@Proxmox01:~# /etc/init.d/rgmanager status
rgmanager is stopped


Everything is configured properly now.

- 3 Proxmox ve 3.1 Nodes
- Nodes are in cluster.
- Shared storage by NFS

Configured as HA ( followed instruction from the Fencing wiki page)

http://pve.proxmox.com/wiki/Fencing#Enable_fencing_on_all_nodes

Failover configured by CLI

http://pve.proxmox.com/wiki/Fencing#General_HowTo_for_editing_the_cluster.conf

Fail over script:

<?xml version="1.0"?>
<cluster config_version="37" name="pilotfase">
<cman keyfile="/var/lib/pve-cluster/corosync.authkey"/>
<fencedevices>
<fencedevice agent="fence_apc" ipaddr="192.168.1.60" login="hpapc" name="apc" passwd="12345678" power_wait="10"/>
</fencedevices>
<clusternodes>
<clusternode name="Proxmox01" nodeid="1" votes="1">
<fence>
<method name="power">
<device name="apc" port="1" secure="on"/>
<device name="apc" port="2" secure="on"/>
</method>
</fence>
</clusternode>
<clusternode name="Proxmox02" nodeid="2" votes="1">
<fence>
<method name="power">
<device name="apc" port="3" secure="on"/>
<device name="apc" port="4" secure="on"/>
</method>
</fence>
</clusternode>
<clusternode name="Proxmox03" nodeid="3" votes="1">
<fence>
<method name="power">
<device name="apc" port="5" secure="on"/>
<device name="apc" port="6" secure="on"/>
</method>
</fence>
</clusternode>
</clusternodes>
</cluster>



Best Regards,
Jimmy
 
Last edited: