*solved* cluster.conf /usr/sbin/ccs_config_validate: Segmentation fault

RobFantini

Famous Member
May 24, 2012
2,042
107
133
Boston,Mass
** Solved, this was caused by operator error. **

this is wrong usage:
Code:
ccs_config_validate -l /etc/pve/cluster.conf.new

this works:
Code:
ccs_config_validate  -f /etc/pve/cluster.conf.new

Hello
I get a seg fault trying to validate a new config.
Here is a working config:
Code:
<?xml version="1.0"?>                                                                                         
<cluster name="fbc201408" config_version="9">                                                                 
                                                                                                                      
     <fencedevices>                                                                                                   
       <fencedevice agent="fence_apc" ipaddr="10.1.10.11" login="fbcadmin" name="apc11" passwd="xxxxxi" power_wait="10"/>
       <fencedevice agent="fence_apc" ipaddr="10.1.10.78" login="fbcadmin" name="apc78" passwd="xxxxxi" power_wait="10"/>
       <fencedevice agent="fence_apc" ipaddr="10.1.10.88" login="fbcadmin" name="apc88" passwd="xxxxxi" power_wait="10"/>    
     </fencedevices>


  <cman keyfile="/var/lib/pve-cluster/corosync.authkey">
  </cman>

  <clusternodes>
  <clusternode name="dell1-ib" votes="1" nodeid="1"/>
  <clusternode name="ceph2-ib" votes="1" nodeid="3"/><clusternode name="ceph4-ib" votes="1" nodeid="4"/><clusternode name="dell-2-ib" votes="1" nodeid="2"/></clusternodes>

</cluster>

so I try to change to this:
Code:
<?xml version="1.0"?>
<cluster name="fbc201408" config_version="10">

     <fencedevices>
       <fencedevice agent="fence_apc" ipaddr="10.1.10.11" login="fbcadmin" name="apc11" passwd="xxxx" power_wait="10"/>
       <fencedevice agent="fence_apc" ipaddr="10.1.10.78" login="fbcadmin" name="apc78" passwd="xxxx" power_wait="10"/>
       <fencedevice agent="fence_apc" ipaddr="10.1.10.88" login="fbcadmin" name="apc88" passwd="xxxx" power_wait="10"/>
     </fencedevices>


  <cman keyfile="/var/lib/pve-cluster/corosync.authkey">
  </cman>

    <clusternodes>
     <clusternode name="dell-1-ib" votes="1" nodeid="1">
               <fence>
                       <method name="power">
                               <device name="apc88" port="1" secure="on" action="off"/>
                               <device name="apc78" port="4" secure="on" action="off"/>
                               <device name="apc88" port="1" secure="on" action="on"/>
                               <device name="apc78" port="4" secure="on" action="on"/>
                       </method>
               </fence>
     </clusternode>
                <clusternode name="ceph2-ib" votes="1" nodeid="3"/><clusternode name="ceph4-ib" votes="1" nodeid="4"/><clusternode name="dell-2-ib" votes="1" nodeid="2"/></clusternodes>

</cluster>

and get this error on validate:
Code:
ccs_config_validate -l /etc/pve/cluster.conf.new
/usr/sbin/ccs_config_validate: line 186: 60308 Segmentation fault      ccs_config_dump > $tempfile

Unable to get the configuration

pveversion -v:
Code:
proxmox-ve-2.6.32: 3.3-139 (running kernel: 2.6.32-34-pve)
pve-manager: 3.3-5 (running version: 3.3-5/bfebec03)
pve-kernel-2.6.32-32-pve: 2.6.32-136
pve-kernel-2.6.32-30-pve: 2.6.32-130
pve-kernel-2.6.32-29-pve: 2.6.32-126
pve-kernel-2.6.32-34-pve: 2.6.32-140
pve-kernel-2.6.32-31-pve: 2.6.32-132
lvm2: 2.02.98-pve4
clvm: 2.02.98-pve4
corosync-pve: 1.4.7-1
openais-pve: 1.1.4-3
libqb0: 0.11.1-2
redhat-cluster-pve: 3.2.0-2
resource-agents-pve: 3.9.2-4
fence-agents-pve: 4.0.10-1
pve-cluster: 3.0-15
qemu-server: 3.3-3
pve-firmware: 1.1-3
libpve-common-perl: 3.0-19
libpve-access-control: 3.0-15
libpve-storage-perl: 3.0-25
pve-libspice-server1: 0.12.4-3
vncterm: 1.1-8
vzctl: 4.0-1pve6
vzprocps: 2.0.11-2
vzquota: 3.1-2
pve-qemu-kvm: 2.1-10
ksm-control-daemon: 1.1-1
glusterfs-client: 3.5.2-1

any clues on how to solve?


best regards, rob
 
Last edited:
Re: cluster.conf /usr/sbin/ccs_config_validate: Segmentation fault

it looks like something changed in ccs_config_validate .

I tried running ccs_config_validate against a backup cluster.conf from 7-2014 and got the segmentation fault .

Code:
ccs_config_validate -l cluster.conf-july2014
/usr/sbin/ccs_config_validate: line 186: 406448 Segmentation fault      ccs_config_dump > $tempfile

Unable to get the configuration

ccs_config_dump does work from cli .
 
Re: cluster.conf /usr/sbin/ccs_config_validate: Segmentation fault

Ok next I tried copy and paste of fence_apc code from pve wiki, changed to our node names and got the same result.

It seems something changed in fence_apc .

I'll test again on next PVE release update, or set up IPMI .