feat(scheduling): per-Deployment k8s scheduling options #223

andrewazores · 2024-11-29T18:33:09Z

Related to #220

To test:

helm install cryostat ./charts/cryostat and ensure everything comes up as usual
If you have a multi-node cluster for testing, try setting different nodeSelector, tolerations, and affinity settings introduced by this PR. Else, just verify that the k8s resource manifests look as expected.

tthvo · 2024-11-30T01:20:57Z

charts/cryostat/values.yaml

+  ## @param grafana.affinity [object] Affinity for the Grafana Pod. See: [Affinity](https://kubernetes.io/docs/reference/kubernetes-api/workload-resources/pod-v1/#scheduling)
+  affinity: {}


This is unused as Grafana and Datasource are in the same pod with Cryostat?

tthvo · 2024-11-30T01:21:02Z

charts/cryostat/values.yaml

@@ -267,6 +277,8 @@ datasource:
  nodeSelector: {}
  ## @param datasource.tolerations [array] Tolerations for the JFR Datasource Pod. See: [Tolerations](https://kubernetes.io/docs/reference/kubernetes-api/workload-resources/pod-v1/#scheduling)
  tolerations: []
+  ## @param datasource.affinity [object] Affinity for the JFR Datasource Pod. See: [Affinity](https://kubernetes.io/docs/reference/kubernetes-api/workload-resources/pod-v1/#scheduling)
+  affinity: {}


tthvo · 2024-11-30T01:38:41Z

charts/cryostat/values.yaml

+## @param affinity [object] default Affinity for the various Pods. See: [Affinity](https://kubernetes.io/docs/reference/kubernetes-api/workload-resources/pod-v1/#scheduling)
 affinity: {}


Would it be a better choice that this field represents common affinity specs for managed pods? Then, each pod can overwrite them as needed. This seems a bit more intuitive to me, considering SecurityContext, where container level specs overwrite pod level ones, where possible.

However, this means we need to figure a way to do a strategic merge on this field :( Not sure how that can be done currently.

A simpler way, I suppose, is to remove this field? Since we don't have such default options for tolerations, nodeSelectors. We can just remove this and let the users define their own?

There are also default options for tolerations and nodeSelectors now, just a few lines above this one. Each of these is only used if there is no pod-specific attribute provided - there is no merge strategy, each Pod will try to use its specific setting, and if there is none then use these global defaults.

Ahh, right opps, Sorry, not sure why I didn't see them :D

Also, considering #222, which defines top-level field and behaves like a set of common annotations, I am wondering if it would be better to have this PR behave the same? Or adjust #222 to behave as in this PR?

Otherwise, I guess it's okayy since they are different specs :D what do you think?

I figured it made sense for annotations to get merged together since they are really just some kind of metadata tagging on the resources, so it makes sense that the annotation attributes are probably shared between things, even if they have their own specific values to add on top, or overrides to apply.

Things like affinities (or anything that relates to node scheduling) seem like if there is a specific configuration required, then it is probably not something that is intended to be merged or shared with others. So the default setting is there to allow for customizing things to ensure that all of Cryostat gets scheduled together onto one node, but if there are any more specific settings then those are probably being applied to cause the scheduler to put it on a different node.

So I think it makes sense either for these specs to behave differently, or else #222 should work like this one (replace, not merge). This one should not work like #222 IMO.

Oh right, that makes sense! In that case, the current way fits better! I think #222 won't have to be adjusted as it seems to be the common practice to merge metadata. As long as it is well-documented, there won't be any issue^^

Thanks for the explanations! This sort of design situation bugs me at times :D

feat(nodeselector): per-Deployment node selectors

383d62a

andrewazores added feat New feature or request safe-to-test labels Nov 29, 2024

andrewazores requested a review from a team November 29, 2024 18:33

tolerations

ce10621

andrewazores changed the title ~~feat(nodeselector): per-Deployment node selectors~~ feat(scheduling): per-Deployment k8s scheduling options Nov 29, 2024

affinity

4699d89

tthvo reviewed Nov 30, 2024

View reviewed changes

andrewazores added 2 commits December 2, 2024 10:56

drop unused values

8950d3b

readme

af47426

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(scheduling): per-Deployment k8s scheduling options #223

feat(scheduling): per-Deployment k8s scheduling options #223

andrewazores commented Nov 29, 2024 •

edited

Loading

tthvo Nov 30, 2024

tthvo Nov 30, 2024

tthvo Nov 30, 2024

andrewazores Dec 2, 2024

tthvo Dec 2, 2024 •

edited

Loading

andrewazores Dec 11, 2024

tthvo Dec 17, 2024 •

edited

Loading

		## @param grafana.affinity [object] Affinity for the Grafana Pod. See: [Affinity](https://kubernetes.io/docs/reference/kubernetes-api/workload-resources/pod-v1/#scheduling)
		affinity: {}

		## @param affinity [object] default Affinity for the various Pods. See: [Affinity](https://kubernetes.io/docs/reference/kubernetes-api/workload-resources/pod-v1/#scheduling)
		affinity: {}

feat(scheduling): per-Deployment k8s scheduling options #223

Are you sure you want to change the base?

feat(scheduling): per-Deployment k8s scheduling options #223

Conversation

andrewazores commented Nov 29, 2024 • edited Loading

tthvo Nov 30, 2024

Choose a reason for hiding this comment

tthvo Nov 30, 2024

Choose a reason for hiding this comment

tthvo Nov 30, 2024

Choose a reason for hiding this comment

andrewazores Dec 2, 2024

Choose a reason for hiding this comment

tthvo Dec 2, 2024 • edited Loading

Choose a reason for hiding this comment

andrewazores Dec 11, 2024

Choose a reason for hiding this comment

tthvo Dec 17, 2024 • edited Loading

Choose a reason for hiding this comment

andrewazores commented Nov 29, 2024 •

edited

Loading

tthvo Dec 2, 2024 •

edited

Loading

tthvo Dec 17, 2024 •

edited

Loading