hi Cesar,
did you plug off one of the nodes?
For test of failure of DRBD, I did Plug off all NICs for DRBD, and after by CLI I did resynchronize the DRBD partitions without problems.
I try to switch from vserver (it runs verry well with active/passive DRBD setup) but i had some bad times. I have 2 Nodes and a few VM's running on each node and then i unplug one node, the desaster starts.... :-(
-For PVE, DRBD always must be in active/active mode, then you can use live migration and HA.
Note important about of Quorum:
- But if you have only two PVE nodes and no three or more in the same PVE cluster, then you will must have a line that say:
<cman keyfile="/var/lib/pve-cluster/corosync.authkey" two_node="1" expected_votes="1"/>
- With the line above, you are saying to the PVE Nodes that only 1 vote of Quorum will be enough for work. Of this manner, always you can work with only a PVE Node.
For get that HA works well with only two PVE Nodes:
-
You must have PVE in cluster and with Quorum, of other way HA don't work
- Also will be best add a configuration of "fence_manual" for the PVE Hosts in your cluster.conf file. Of this manner (talking about of fence manual), if you have a problem with a PVE Node, you must disconnect the electrical power in the PVE Node with problems, and "
only after" by CLI in the PVE Node that is alive you must run "fence_ack_manual <name_of_PVE_Node_that_was_disconnected>", then the VM(s) start in the PVE Node that is alive (This topic has been discussed several times in this forum if you want to learn how do it).
- Remember that rgmanager and "join fence" in your PVE Nodes that are in HA must are enabled (see the wiki of Proxmox about of this topic)
About of PVE Cluster and Switch:
- If the Switch of your Virtual Center supports Multicast, the PVE Cluster will works well, if not, you must configure it in unicast mode (see the wiki of Proxmox about of this topic)
Can't start the VM's on the other node because the VM's listet on the offline node and they can't manualy migrate as long the one node is offline! :-(
The network for DRBD must be in different(s) NIC(s) to the network of PVE, and the PVE Node that is alive must have quorum (i believe that it is your problem, please read "note important about of Quorum" above in this post).
After the offline node restart and come back he not found some qfiles (Discs) of some VM's and can't start some VM's on the second node either!
- What is qfiles?... qcow2 files?
- For get HA in PVE, I say that must use LVM on top of DRBD (see the wiki of Proxmox with DRBD), of this manner, you can only use the raw format and not qcow2 for the virtual hard disks of your VMs, and the virtual disks of the VMS must be in the DRBD partition.
- You can't have qcow2 files in LVM on top of DRBD with HA enabled for VMs, only raw format.
- Optionally (and not necessarily), the VMs that don't have HA enabled can have his virtual disks on other partition that aren't the DRBD partition, and of this manner, you can have qcow2 files for your virtual disks out of DRBD.
- About of you can't find the files of the virtual disks,
First: You must know that the virtual disks in the DRBD partition are "IN RAW FORMAT", and each virtual disks is a logical volume.
Second: Only the Virtual Disks that are raise for PVE, you can find it in the PVE Node that have raised.
Third: If you want find it, and the DRBD partition is in active/active mode, then you can run by CLI "vgdisplay", and will see the location
I dont know the exactly problem but i not get the point because i have a vmware cluster with 2 nodes and there is no problem about the sync. Both nodes are in the virtual datacenter and if one switched off u can start the VM on the other.?
- Only two nodes for get HA?, than strange, i believe that you don't have HA. But this isn't relevant for this thread.
I try this proxmox setup with 2 Nodes and DRBD with LVM, i not installed a quorum, so maybe with a Quorum-NFS-Mount from another Server it will work perfectly? Or need a thirt Proxmox Node?
- In a PVE cluster, Quroum "always is necessary", and when you add a PVE Node to a PVE Cluster, the quorum is added automatically.
- "Quorum-NFS-Mount" don't exist in a PVE Cluster. See above in this post my note "Note important about of Quorum".
I want to try it again because Proxmox have a better Management tool than vserver but important is the absolutly savety of the data!
- PVE is fantastic, but you will need learn many things about that.
- About of "absolutely safety of the data!", i will give a data: Did you know that DRBD have a command for verify data of replicated volumes on-Line or in-Hot (without power off anything)?
Please see this link:
http://www.drbd.org/users-guide-8.3/s-use-online-verify.html
Can somebody tell something about a NFS-Quorum from a NAS?
- This question was answered above
Best regards and good luck with your PVEs
Cesar