Today I did my first crash test ;-) I will describe my results tomorrow.
So with a little delay I like to describe what I tried as my first crash test:
I populated my nodes like this:
node1: VM1 and VM2
node2: VM3 and VM4
Everything works fine, live migration is no problem.
Now I turned off node1 - no gentle shutdown, rather the hard way: Press and hold the power button via ILO.
So what happened? Of course, node1, VM1 and VM2 went off immediately.
node2, VM3 and VM4 kept on running.
Then I logged in into node2 via SSH and moved all VM configs from node1 to node2:
Code:
# mv /etc/pve/nodes/node1/qemu-server/* /etc/pve/nodes/node2/qemu-server/
That's what I wanted - business can go on and I can repair node1.
After I had finished "repairing" node1 I turned it on again. DRBD started to replicate as expected, and after a short while everything was in sync again. I migrated VM3 and VM4 back to node1 and everything looked like before.
From my point of view this was a success, what do you think?
Greets
Stephan
Last edited: