Im testing HA with this scenario:
- 3 x PVE 2.1 fresh install, last full-upgrade
- Fencing with ILO/IPMI working ok, with dedicated interface
- 2 x KVM Ubuntu Server 12.04 VMs
First I've configured HA in the ubuntu machines and HA puts both in same server. Second I isolated the server by putting off their ethernet cables (bonding). Cluster detect the node off (Aug 01 15:10:31 rgmanager State change: virtualnc02 DOWN), but nothing happend with VMs, they dont start in another node. Last I put on the ethernet cables again and VMs appeared shutdown in web interface, and i've needed to remove from HA to power up again.
In all moment the rgmanager shows:
Aug 01 15:11:54 rgmanager [pvevm] VM 100 is running
Aug 01 15:11:54 rgmanager [pvevm] VM 101 is running
but its false.
And clustat says HA vms (100 and 101) are in node 'virtualnc03', but the are rellay in 'virtualnc02', which was the isolated one:
root@virtualnc02:~# clustat
Cluster Status for VIRTUALNC @ Wed Aug 1 15:48:23 2012
Member Status: Quorate
Member Name ID Status
------ ---- ---- ------
virtualnc03 1 Online, rgmanager
virtualnc04 2 Online, rgmanager
virtualnc02 4 Online, Local, rgmanager
Service Name Owner (Last) State
------- ---- ----- ------ -----
pvevm:100 virtualnc03 started
pvevm:101 virtualnc03 started
root@virtualnc02:~# ssh virtualnc03 qm list
root@virtualnc02:~# qm list
VMID NAME STATUS MEM(MB) BOOTDISK(GB) PID
100 prueba stopped 512 15.00 0
101 prueba2 stopped 512 15.00 0
So, its a erratic behaviour in HA for me. Any idea?
- 3 x PVE 2.1 fresh install, last full-upgrade
- Fencing with ILO/IPMI working ok, with dedicated interface
- 2 x KVM Ubuntu Server 12.04 VMs
First I've configured HA in the ubuntu machines and HA puts both in same server. Second I isolated the server by putting off their ethernet cables (bonding). Cluster detect the node off (Aug 01 15:10:31 rgmanager State change: virtualnc02 DOWN), but nothing happend with VMs, they dont start in another node. Last I put on the ethernet cables again and VMs appeared shutdown in web interface, and i've needed to remove from HA to power up again.
In all moment the rgmanager shows:
Aug 01 15:11:54 rgmanager [pvevm] VM 100 is running
Aug 01 15:11:54 rgmanager [pvevm] VM 101 is running
but its false.
And clustat says HA vms (100 and 101) are in node 'virtualnc03', but the are rellay in 'virtualnc02', which was the isolated one:
root@virtualnc02:~# clustat
Cluster Status for VIRTUALNC @ Wed Aug 1 15:48:23 2012
Member Status: Quorate
Member Name ID Status
------ ---- ---- ------
virtualnc03 1 Online, rgmanager
virtualnc04 2 Online, rgmanager
virtualnc02 4 Online, Local, rgmanager
Service Name Owner (Last) State
------- ---- ----- ------ -----
pvevm:100 virtualnc03 started
pvevm:101 virtualnc03 started
root@virtualnc02:~# ssh virtualnc03 qm list
root@virtualnc02:~# qm list
VMID NAME STATUS MEM(MB) BOOTDISK(GB) PID
100 prueba stopped 512 15.00 0
101 prueba2 stopped 512 15.00 0
So, its a erratic behaviour in HA for me. Any idea?
Last edited: