HA Issues

adamb

Famous Member
Mar 1, 2012
1,329
77
113
Hey all new to proxmox and very impressed!!

I am in the process of setting up a 2 node cluster with a dual DRBD primary/primary setup. I am using the IMM of our IBM servers as a fence device. After setting everything up I am hitting an issue with the HA tab on the gui. Now when accessing the HA tab I am seeing the following message and unsure what exactly the issue could be. The virtual machines are still running as they should and I can migrate from NodeA to NodeB with ease. I have yet to test the HA features as I figured this issue would cause problems. Any input is greatly appreciated. I am seeing the following message when trying to access the HA tab of the gui. I am unable to get past this error as it doesn't allow me to do anything on the tab anymore.

not well-formed (invalid-token) at line 15, column 5, byte 653 at /usr/lib/perl5/XML/Parser.pm line 187 (500)
 
root@proxmox2:~# cat /etc/pve/cluster.conf
<?xml version="1.0"?>
<cluster config_version="4" name="ccs-test">
<cman keyfile="/var/lib/pve-cluster/corosync.authkey"/>
<clusternodes>
<clusternode name="proxmox1" nodeid="1" votes="1"/>
<clusternode name="proxmox2" nodeid="2" votes="1"/>
</clusternodes>
<rm>
<pvevm autostart="1" vmid="100"/>
<pvevm autostart="1" vmid="101"/>
</rm>
</cluster>

Also when creating the cluster.conf.new should this be moved to the cluster.conf itself or left as cluster.conf.new? I appreciate the input!!
 
Just tested HA functionalities and they seem to be working quite well even with this issue. I can power down either node and all virtual machines fail over properly. Must be something simple in the config I am missing!
 
you mention configured fencing but you posted a cluster.conf without it.?
 
The fencing configuration is in cluster.conf.new. I was unsure if it needed to be copied to cluster.conf because the wiki states nothing about. So I left as is. HA is working though as it should. Still getting the error though in the HA tab.
 
check your config, post cluster.conf.new
 
Here is my cluster.conf.new

root@proxmox1:~# cat /etc/pve/cluster.conf.new
<?xml version="1.0"?>
<cluster config_version="4" name="ccs-test">
<cman expected_votes="1" keyfile="/var/lib/pve-cluster/corosync.authkey" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_ipmilan" name="ipmi1" lanplus="1" ipaddr="10.80.12.126" login="USERID" passwd="PASSW0RD" power_wait="5"/>
<fencedevice agent="fence_ipmilan" name="ipmi2" lanplus="1" ipaddr="10.80.12.131" login="USERID" passwd="PASSW0RD" power_wait="5"/>
</fencedevices>
<clusternodes>
<clusternode name="proxmox1" nodeid="1" votes="1"/>
<fence>
<method name="1">
<device name="ipmi1"/>
</method>
</fence>
< clusternode name="proxmox2" nodeid="2" votes="1"/>
<fence>
<method name="2">
<device name="ipmi2"/>
</method>
</fence>
</clusternodes>
<rm>
<pvevm autostart="1" vmid="100"/>
<pvevm autostart="1" vmid="101"/>
</rm>
</cluster>
root@proxmox1:~
 
I am missing the "commit change" button. All I have is revert change and activate.

When hitting activate I am getting the following error. "config validation failed: unknown error 500"
 
When hitting activate I am getting the following error. "config validation failed: unknown error 500"

Then you still have errors in the cluster.conf.

Please check with:

# ccs_config_validate -l /etc/pve/cluster.conf.new
 
Here are the outputs

root@proxmox2:/etc/pve# ccs_config_validate -l /etc/pve/cluster.conf.new
tempfile:10: element fence: Relax-NG validity error : Expecting element clusternode, got fence
tempfile:9: element clusternode: Relax-NG validity error : Element clusternodes has extra content: clusternode
Configuration fails to validate
 
Pinned it down myself. I was missing </cman> at the top. I also had some opening and ending tag mismatches. The config validated and successfully Activated in the GUI. I really appreciate all the help!!