Hi All
We're currently documenting best practices and were trying to find documentation on proper steps to shutdown entire cluster for when there is any kind of maintenance taking place to the building, network, infrastructure, to the servers itself etc.
3x Node Cluster
1x Main Network
1x Corosync Network
1x Ceph Network (4 OSD's per node)
Currently what we have is:
1. Set HA status to Freeze
2. Set HA group to Stopped
2. Bulk Shutdown VM's
3. Initiate Node shutdown starting from number 3 then 2 then 1 with a minute apart from one another.
Then when booted again:
1. Bulk Start VM's
2. Set HA to migrate again
3. Set HA group to started
Any advice, comments etc will be appreciated.
We're currently documenting best practices and were trying to find documentation on proper steps to shutdown entire cluster for when there is any kind of maintenance taking place to the building, network, infrastructure, to the servers itself etc.
3x Node Cluster
1x Main Network
1x Corosync Network
1x Ceph Network (4 OSD's per node)
Currently what we have is:
1. Set HA status to Freeze
2. Set HA group to Stopped
2. Bulk Shutdown VM's
3. Initiate Node shutdown starting from number 3 then 2 then 1 with a minute apart from one another.
Then when booted again:
1. Bulk Start VM's
2. Set HA to migrate again
3. Set HA group to started
Any advice, comments etc will be appreciated.