I have read the docs on HA top to bottom and back to top:
https://pve.proxmox.com/wiki/High_Availability
I can't get my head around the resource scheduling, because:
"The cluster resource scheduler (CRS) mode controls how HA selects nodes for the recovery of a service as well as for migrations that are triggered by a shutdown policy."
Then for both modes it goes on to say:
"Non-HA-managed services are currently not counted."
There's some further details mentioned on the (tech preview) Static-Load Scheduler in terms of CPU and RAM weighing, but even then the only CRS points are recovery, config change and service started (from a stopped state).
So do I understand this correctly that there's:
1. no config options whatsoever to have running services auto-migrated in order to avoid uneven load (say for a set threshold) on HA group of nodes?
2. a service would be only migrated in case e.g. OOM killer takes it down, but not before?
3. after a node-down recovery, none of the running services will be migrated to it unless they fail on the other nodes in the same priority group?
4. the Basic Scheduler is good for nothing but a set of homogenous services on resource-equivalent nodes?
5. none of these are reliable in case there are also non-HA services on any of the nodes used for HA-services?
And finally:
6 Can I prevent non-HA services be started on HA-group of nodes? Or shouldn't the docs then advise on having a none-or-all services as HA approach on a select group of nodes?
Obviously I do not really want to be confirmed on (all of) the above, but if I am wrong, could you advise how to address the said issue?
https://pve.proxmox.com/wiki/High_Availability
I can't get my head around the resource scheduling, because:
"The cluster resource scheduler (CRS) mode controls how HA selects nodes for the recovery of a service as well as for migrations that are triggered by a shutdown policy."
Then for both modes it goes on to say:
"Non-HA-managed services are currently not counted."
There's some further details mentioned on the (tech preview) Static-Load Scheduler in terms of CPU and RAM weighing, but even then the only CRS points are recovery, config change and service started (from a stopped state).
So do I understand this correctly that there's:
1. no config options whatsoever to have running services auto-migrated in order to avoid uneven load (say for a set threshold) on HA group of nodes?
2. a service would be only migrated in case e.g. OOM killer takes it down, but not before?
3. after a node-down recovery, none of the running services will be migrated to it unless they fail on the other nodes in the same priority group?
4. the Basic Scheduler is good for nothing but a set of homogenous services on resource-equivalent nodes?
5. none of these are reliable in case there are also non-HA services on any of the nodes used for HA-services?
And finally:
6 Can I prevent non-HA services be started on HA-group of nodes? Or shouldn't the docs then advise on having a none-or-all services as HA approach on a select group of nodes?
Obviously I do not really want to be confirmed on (all of) the above, but if I am wrong, could you advise how to address the said issue?