HA fencing error when trying to activate

jimz0r

New Member
Nov 19, 2013
22
0
1
Hi,

i'm trying to set up a test cluster with 2 nodes in HA with fencing devices.
when i'm trying to active it at the HA tab i'm getting the follow error:

config validation failed: /usr/sbin/ccs_config_validate: line 186: 7159 Segmentation fault ccs_config_dump > $tempfile (500)

does anyone know how to solve this problem, i really can't find out what the problem is.

As fencing device we are using a APC switch AP7921

etc/pve/cluster.conf.new

<?xml version="1.0"?>
<cluster config_version="18" name="clustertest">
<cman expected_votes="1" keyfile="/var/lib/pve-cluster/corosync.authkey" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_apc" name="apc" lanplus="1" ipaddr="192.168.1.60" login="domeinict" passwd="test123" power_wait="5"/>
</fencedevices>
<clusternodes>
<clusternode name="proxmox01" nodeid="1" votes="1"/>
<fence>
<method name="1">
<device name="apc"/>
</method>
</fence>
<clusternode name="proxmox02" nodeid="2" votes="1"/>
</clusternodes>
<rm>
<pvevm autostart="1" vmid="100"/>
</rm>
</cluster>
 
Code:
<?xml version="1.0"?>
<cluster config_version="18" name="clustertest">
<cman expected_votes="1" keyfile="/var/lib/pve-cluster/corosync.authkey" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_apc" name="apc" lanplus="1" ipaddr="192.168.1.60" login="domeinict" passwd="test123" power_wait="5"/>
</fencedevices>
<clusternodes>
<clusternode name="proxmox01" nodeid="1" votes="1"/>

do not close the clusternode tag here! It should include the fence section:

Code:
<?xml version="1.0"?>
<cluster config_version="18" name="clustertest">
<cman expected_votes="1" keyfile="/var/lib/pve-cluster/corosync.authkey" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_apc" name="apc" lanplus="1" ipaddr="192.168.1.60" login="domeinict" passwd="test123" power_wait="5"/>
</fencedevices>
<clusternodes>
<clusternode name="proxmox01" nodeid="1" votes="1">
<fence>
<method name="1">
<device name="apc"/>
</method>
</fence>
</clusternode>
<clusternode name="proxmox02" nodeid="2" votes="1"/>
</clusternodes>
<rm>
<pvevm autostart="1" vmid="100"/>
</rm>
</cluster>
 
[FONT=tahoma, arial, verdana, sans-serif]Thank you for your message.

anyway it still doensn't fix the error, still getting the error:
[/FONT]config validation failed: /usr/sbin/ccs_config_validate: line 186: 26617 Segmentation fault ccs_config_dump > $tempfile (500)[FONT=tahoma, arial, verdana, sans-serif]

I have used the following wiki page:
[/FONT]http://pve.proxmox.com/wiki/Fencing

Is there a log file which shows me more details on this error, if so what is the location?

Best Regards,
Jimmy

[FONT=tahoma, arial, verdana, sans-serif]


[/FONT]
 
Please use the following command to verify the configuration:

# ccs_config_validate -f <filename>

I just did that and it comes back with:

root@Proxmox01:/etc/pve# ccs_config_validate -f cluster.conf.new
Configuration validates
 
Hi,

after using: /etc/pve# ccs_config_validate -f cluster.conf.new it says;

Configuration validates

But when I'm trying to activate it at the web interface it stil comes with the next error:
config validation failed:

config validation failed: /usr/sbin/ccs_config_validate: line 186: 2008 Segmentation fault ccs_config_dump > $tempfile (500)

Does anyone know how to solve this issue?

Thanks
Jimmy
 
It's fixed it was something in the script.
now we have a new problem.

When i go to Node > services > start RGManager it says:
Starting Cluster Service Manager: [ OK ]
TASK OK

But the status says ''stopped''

When using
/etc/init.d/rgmanager start in CLi it says the same



root@Proxmox01:~# /etc/init.d/rgmanager start
Starting Cluster Service Manager: [ OK ]
root@Proxmox01:~#



root@Proxmox01:~# /etc/init.d/rgmanager status
rgmanager is stopped




I would really appreciate it if you could help us out with this issue as it is of high importance because we are doing this for a project.


Best regards,
Jimmy
 
Last edited:
Seriously, does nobody have any ideas? Even the admins.
So i have payed €49,99 for this terrible support forum for really bad support. This is really outrageous.
 
How do you expect to get support if you choose an unsupported setup? For HA at least 3 nodes are required. A special case is two nodes and a quorum disk.
 
How do you expect to get support if you choose an unsupported setup? For HA at least 3 nodes are required. A special case is two nodes and a quorum disk.

Thank you for your reply.
I have forgotten that it needed 3 nodes, I have installed a third node. I'm now started with the configurations.

I will update the status when everything is configured.

Thanks,
Jimmy
 
Hi,

Still the same problem.

When i go to Node > services > start RGManager it says:
Starting Cluster Service Manager: [ OK ]
TASK OK

But the status says ''stopped''

When using
/etc/init.d/rgmanager start in CLi it says the same



root@Proxmox01:~# /etc/init.d/rgmanager start
Starting Cluster Service Manager: [ OK ]
root@Proxmox01:~#



root@Proxmox01:~# /etc/init.d/rgmanager status
rgmanager is stopped


Everything is configured properly now.

- 3 Proxmox ve 3.1 Nodes
- Nodes are in cluster.
- Shared storage by NFS

Configured as HA ( followed instruction from the Fencing wiki page)

http://pve.proxmox.com/wiki/Fencing#Enable_fencing_on_all_nodes

Failover configured by CLI

http://pve.proxmox.com/wiki/Fencing#General_HowTo_for_editing_the_cluster.conf

Fail over script:

<?xml version="1.0"?>
<cluster config_version="37" name="pilotfase">
<cman keyfile="/var/lib/pve-cluster/corosync.authkey"/>
<fencedevices>
<fencedevice agent="fence_apc" ipaddr="192.168.1.60" login="hpapc" name="apc" passwd="12345678" power_wait="10"/>
</fencedevices>
<clusternodes>
<clusternode name="Proxmox01" nodeid="1" votes="1">
<fence>
<method name="power">
<device name="apc" port="1" secure="on"/>
<device name="apc" port="2" secure="on"/>
</method>
</fence>
</clusternode>
<clusternode name="Proxmox02" nodeid="2" votes="1">
<fence>
<method name="power">
<device name="apc" port="3" secure="on"/>
<device name="apc" port="4" secure="on"/>
</method>
</fence>
</clusternode>
<clusternode name="Proxmox03" nodeid="3" votes="1">
<fence>
<method name="power">
<device name="apc" port="5" secure="on"/>
<device name="apc" port="6" secure="on"/>
</method>
</fence>
</clusternode>
</clusternodes>
</cluster>



Best Regards,
Jimmy
 
Last edited:

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!