Hi there,
I have 2 VM (KVM) on a shared storage on a IMS in a cluster of 3.
Everything was working fine. Then is started to play with the enabled HA and now one of the VM is stated as failed in the Datacenter -> Summary -> HA service status:
owner
I want to get pvevm:200 out of the failed state, so I tried:
0. Restart RGManager on server2
1. Restart VM
2. Migrate VM
3. Restart Server where it is on (server 2)
4. Remove VM from HA
5. Readd VM to HA
6. Readd VM to HA when stopped.
This all didn't work. And doesn't changed the state.
I have tried to let the Vm migrate trough stop RGManager, but it runs still on that server.
So HA stopped working for that VM. For the other, everything is working correctly.
I have manually test the fence_intelmodular and it all works like a charm.
So how can I get HA again working for the pvevm:200.
I think this is one of the resons, but I can't place it:
That stats it when I restart RGManager on that server.
# clustat:
Who can point me in the right direction?
I have 2 VM (KVM) on a shared storage on a IMS in a cluster of 3.
Everything was working fine. Then is started to play with the enabled HA and now one of the VM is stated as failed in the Datacenter -> Summary -> HA service status:
owner
Name | Owner | State | Restarts | Last transition | Last owner |
pvevm:200 | none | Failed | 0 | 2011-12-27 | rupsjenooitgenoeg(server2) |
pvevm:201 | timo (server3) | started | 0 | 2012-01-02 | sint (server 1) |
I want to get pvevm:200 out of the failed state, so I tried:
0. Restart RGManager on server2
1. Restart VM
2. Migrate VM
3. Restart Server where it is on (server 2)
4. Remove VM from HA
5. Readd VM to HA
6. Readd VM to HA when stopped.
This all didn't work. And doesn't changed the state.
I have tried to let the Vm migrate trough stop RGManager, but it runs still on that server.
So HA stopped working for that VM. For the other, everything is working correctly.
I have manually test the fence_intelmodular and it all works like a charm.
So how can I get HA again working for the pvevm:200.
I think this is one of the resons, but I can't place it:
Code:
[COLOR=#000000][FONT=tahoma]task started by HA resource agent[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]VM still running - terminating now with SIGTERM[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]TASK OK[/FONT][/COLOR]
# clustat:
Code:
Cluster Status for IMS-EVO @ Fri Jan 20 10:53:00 2012
Member Status: Quorate
Member Name ID Status
------ ---- ---- ------
sint 1 Online, rgmanager
rupsjenooitgenoeg 2 Online, Local, rgmanager
timo 3 Online, rgmanager
Service Name Owner (Last) State
------- ---- ----- ------ -----
pvevm:200 (rupsjenooitgenoeg) failed
pvevm:201 timo started
Last edited: