For better understanding of new HA mechanism, I have decided to try you pve-ha-simulator.
I started all nodes and enabled one vm:101.
Then I migrated vm:101 to node 2 so far so good.
Finally I disabled network on node2.
Simulator fenced the node2 and started vm:101 on node1, however it took 3 minutes to complete.
Does it sound normal? Can the timers be adjusted up or down?
Here is the log of simulator:
Thank you,
I started all nodes and enabled one vm:101.
Then I migrated vm:101 to node 2 so far so good.
Finally I disabled network on node2.
Simulator fenced the node2 and started vm:101 on node1, however it took 3 minutes to complete.
Does it sound normal? Can the timers be adjusted up or down?
Here is the log of simulator:
Code:
nfo 06:41:37 hardware: execute network node2 off
info 06:41:45 node2/lrm: status change active => lost_agent_lock
info 06:41:50 node2/crm: status change master => lost_manager_lock
info 06:41:50 node2/crm: status change lost_manager_lock => wait_for_quorum
info 06:42:31 watchdog: execute power node2 off
info 06:42:31 hardware: crm on node 'node2' killed by poweroff
info 06:42:31 hardware: lrm on node 'node2' killed by poweroff
info 06:42:31 hardware: execute power node2 off
info 06:42:31 hardware: server 'node2' stopped by poweroff (watchdog)
info 06:43:36 node1/crm: got lock 'ha_manager_lock'
info 06:43:36 node1/crm: status change slave => master
info 06:43:36 node1/crm: node 'node2': state changed from 'online' => 'unknown'
info 06:44:36 node1/crm: service 'vm:101': state changed from 'started' to 'fence'
info 06:44:36 node1/crm: node 'node2': state changed from 'unknown' => 'fence'
info 06:44:36 node1/crm: got lock 'ha_agent_node2_lock'
info 06:44:36 node1/crm: fencing: acknowleged - got agent lock for node 'node2'
info 06:44:36 node1/crm: node 'node2': state changed from 'fence' => 'unknown'
info 06:44:36 node1/crm: service 'vm:101': state changed from 'fence' to 'stopped'
info 06:44:36 node1/crm: service 'vm:101': state changed from 'stopped' to 'started' (node = node1)
Thank you,