Thanks for your suggestions!
I know that if I clustered these nodes I could control them from any node. I'm coming from a XenServer environment where getting control of all nodes was as easy as typing in an IP into a textbox on client software. I'm reluctant to start making these nodes aware of each other. I'll outline my fears below:
1. Clustering means I'll have to move around, and rebuild 200+ VMs since their VMIDs overlap (across each node, eg, there are 7 VMs with the VMID of 101, etc)
2. Clustering means I'll have to rethink the network setup. As it stands I'm using vmbr1 with NAT masquerading to create an internal network for each node, which is where the VMs live. We structure our VMs to have the vmbr1 network with vmid as the ending subnet. e.g. VM with VMID of 101 on vhost7 has an IP of 10.5.7.101. This makes it easy to remember IPs of VMs.
Below is an example of my /etc/network/interfaces on each node.
Code:
root@vhost7:~# cat /etc/network/interfaces
auto lo
iface lo inet loopback
iface eth0 inet manual
auto vmbr0
iface vmbr0 inet static
address 10.100.2.117
netmask 255.255.255.0
gateway 10.100.2.1
bridge_ports eth0
bridge_stp off
bridge_fd 0
auto vmbr1
iface vmbr1 inet static
address 10.5.7.1
netmask 255.255.255.0
bridge_ports none
bridge_stp off
bridge_fd 0
post-up echo 1 > /proc/sys/net/ipv4/ip_forward
post-up iptables -t nat -A POSTROUTING -s '10.5.7.0/24' -o vmbr0 -j MASQUERADE
post-down iptables -t nat -D POSTROUTING -s '10.5.7.0/24' -o vmbr0 -j MASQUERADE
Then I've injected static routes into our primary gateway to get to these VMs.
You may ask why I have a vmbr0 and vmbr1. Occasionally I need to set the IP of a VM to the node's network. In that case, I bridge the VMs nic card to vmbr0, and then I can apply an IP on the 10.100.2.0/24 network.
3. Clustering means I'll have to deal with split brain problems, fencing issues, quorum problems, update issues, taking nodes into maintenance (for good, for power cycling, for moving to a different network for testing with clients). All these headaches for something as simple as managing from one page. Probably not worth it.
You may also ask why I didn't just cluster in the beginning. I have two reasons. We started out with only one machine, and the second reason is because I'm a noob when it comes to Proxmox
I'd love to hear your solutions to these problems! Let me know what you think. Thanks!