operator: upgrade all control plane nodes first by 3u13r · Pull Request #3444 · edgelesssys/constellation

3u13r · 2024-10-20T20:48:55Z

Context

Allow #3396. Since kubelets must not communicate with a KubeAPI Server that has an older version than the kubelet itself, we need to upgrade all control planes, before upgrading the worker nodes. Control plane nodes are configured in a way that they only talk to the local KubeAPI server that matches the kubelet version.

Proposed change(s)

Allow increasing the node budget: This is technically a full feature that I thought we already had and I needed it for the env test that tries to upgrade the worker nodes so I can verify that the control plane pending node is created but not the worker node. This should fail because of our explicit check and node the missing node budget.
Generally improve test coverage of the nodeversion env test since we are upgrading 2 nodes in different scaling groups now.
Don't call out to the cloud provider API to create new worker nodes if there are still control planes in the outdated or donor category, even if we would have enough node budget to do so.
Since the operator code gen make target didn't work for me, I bumped the version (this seemingly was needed so that this plays nicely with go workspaces), reverted the make target scripts back to their original form and manually copied over the newly generated file to the cli helm embedding (we might want to automate this in the future when we use bazel for all parts of the operator).

How to test:

bazel test --test_output=all --cache_test_results=no //operators/constellation-node-operator/controllers:controllers_test

Related issue

kubernetes/kubernetes#127316

Checklist

Run the E2E tests that are relevant to this PR's changes
- ~~gcp-snp, 3:2, v1.30.4 -> v1.31.1 K8s, v1.18.0 -> head of this PR: https://github.com/edgelesssys/constellation/actions/runs/11431369883~~
- gcp-snp, 3:2, v1.30.4 -> v1.31.1 K8s, v1.18.0 -> head of this PR, image to upgrade to: ref/euler-feat-operator-upgrade-control-planes-first/stream/console/v2.19.0-pre.0.20241021015702-d0e8a385bb2f: https://github.com/edgelesssys/constellation/actions/runs/11455280858
Add labels (e.g., for changelog category)

Link to Milestone

daniel-weisse · 2024-10-21T06:25:59Z

...ation/helm/charts/edgeless/operators/charts/constellation-operator/crds/nodeversion-crd.yaml

+              maxNodeBudget:
+                description: MaxNodeBudget is the maximum number of nodes that can
+                  be created simultaneously.
+                format: int32
+                type: integer


Is this problematic for upgrades, since we never upgrade the installed CRDs through helm?

No, I think helm has some upgrade magic build in regarding CRDs. But this time we only add one field that is optional, we are fine. Proof: https://github.com/edgelesssys/constellation/actions/runs/11455280858/job/31871905456.
Or do you have a concrete error in mind, I'm missing?

As far as I know, Helm simply doesn't do anything if a CRD already exists: https://helm.sh/docs/chart_best_practices/custom_resource_definitions/#method-1-let-helm-do-it-for-you

So if we change the CRDs, running helm apply won't actually apply these changes.
From what I can tell, this effectively makes the maxNodeBudget option in the nodeversion crd non-functional.
I'm assuming everything still works fine because we default to 1 if the value is not set in the CR

Oh, thanks for the hint. Yes, the behavior is (hopefully) exactly the same. I think we are good for now then (also since the e2e test passed). I didn't want to advertise this feature anyway, but even if we do we should add this constraint.
In the future(tm), we likely want to do what others do (e.g., istio: https://istio.io/latest/docs/setup/upgrade/helm/#canary-upgrade-recommended).

operators/constellation-node-operator/controllers/nodeversion_controller.go

operators/constellation-node-operator/controllers/scalinggroup_controller.go

burgerdev · 2024-10-23T09:17:36Z

operators/constellation-node-operator/internal/constants/constants.go

 	PlaceholderControlPlaneScalingGroupName = "control-planes-id"
 	// PlaceholderWorkerScalingGroupName name of the worker scaling group used if upgrades are not yet supported.
 	PlaceholderWorkerScalingGroupName = "workers-id"
+	// ControlPlaneRoleLabel label used to identify control plane nodes.


Suggested change

// ControlPlaneRoleLabel label used to identify control plane nodes.

// ControlPlaneRoleLabel label used to identify control plane nodes.

// https://kubernetes.io/docs/reference/labels-annotations-taints/#node-role-kubernetes-io-control-plane

Nit, just to document that this is canonical.

burgerdev · 2024-10-23T09:24:07Z

operators/constellation-node-operator/controllers/nodeversion_controller_test.go

 	r.RLock()
 	defer r.RUnlock()


Does this need a write lock now?

burgerdev · 2024-10-23T10:41:21Z

operators/constellation-node-operator/api/v1alpha1/nodeversion_types.go

 	// KubernetesClusterVersion is the advertised Kubernetes version of the cluster.
 	KubernetesClusterVersion string `json:"kubernetesClusterVersion,omitempty"`
+	// MaxNodeBudget is the maximum number of nodes that can be created simultaneously.
+	MaxNodeBudget uint32 `json:"maxNodeBudget,omitempty"`


I'm against making this a user-facing feature for now, because we did not discuss its semantics sufficiently. What if the user sets it to 1000 - do we replace all control-plane nodes at once? Should we have different budgets for control planes and for workers? Could this be relative? Should we rather implement a different upgrade algorithm (3533 etc)? I'm also reminded of the evolution of https://kubernetes.io/docs/reference/generated/kubernetes-api/v1.30/#rollingupdatedeployment-v1-apps.

Afaict this PR would be much smaller if we removed this feature and tried to find a different way to test it.

do we replace all control-plane nodes at once?

Yes, I also tested it setting the budget to 5 when I had 3:2 nodes. But "at-once" only applies to the world how the operator sees it. The join service still forbids multiple control-plane nodes joining at the same time. Note that this is the scenario right after initializing a Constellation with >=3 control planes. Also replace means, adding the nodes first. So in theory you could go from 3 control planes to 6 since the operator only removes a node once the hand over is finished.

Should we have different budgets for control planes and for workers?

We might do that in the future, if a customer requires it or we think we need it.

Could this be relative?

I assume as in a percentage value. Sure, but this is more difficult/complex then setting the number of nodes.

Should we rather implement a different upgrade algorithm

I think the bug is orthogonal to this proposal, since I'm not reworking the operator replacement algorithm, which would be a large undertaking in my opinion.

I'm also reminded of the evolution of https://kubernetes.io/docs/reference/generated/kubernetes-api/v1.30/#rollingupdatedeployment-v1-apps

I don't know the evolution, is there a summary somewhere of what happened?

Note that also the operator API is on version v1alpha1, so in my opinion, we don't have to provide any API stability guarantees between Constellation versions and we can completely change the whole upgrade process and APIs between Constellation versions.

Some of the current fields are not really user-facing in a sense that the user should directly use them. Changing the image reference requires 1. the image reference to be one of our images references 2. the measurements in the join config to match the image. Changing the k8s version requires a config map under that name and for the Constellation to upgrade correctly it has to contain the right set of components and patches.

Afaict this PR would be much smaller if we removed this feature and tried to find a different way to test it.

Then I'll have another try at the test for this, but this might take a bit of time before I get back to this.

Co-Authored-By: Leonard Cohnen <[email protected]>

Before we call out ot the cloud provider we check if there are still control plane nodes that are outdated (or donors). If there are, we don't create any worker nodes, even if we have the budget to do so.

github-actions · 2025-02-18T10:47:45Z

Coverage report

Package	Old	New	Trend
bootstrapper/internal/kubernetes/k8sapi	13.10%	14.00%	↗️
internal/constellation/kubecmd	62.40%	75.50%	↗️
internal/versions	9.70%	9.70%	🚧
operators/constellation-node-operator/api/v1alpha1	0.00%	0.00%	🚧
operators/constellation-node-operator/controllers	30.80%	32.00%	↗️
operators/constellation-node-operator/internal/constants	[no test files]	[no test files]	🚧
operators/constellation-node-operator/internal/controlplane	100.00%	100.00%	↔️
operators/constellation-node-operator/internal/etcd	65.80%	65.80%	↔️
operators/constellation-node-operator/internal/node	100.00%	100.00%	↔️

msanft · 2025-02-18T18:18:15Z

Since the operator code gen make target didn't work for me

Do you remember what the issue was there? We might want to make a ticket for it.

msanft

The changes seem reasonable to me both on the abstract and implementation level.
However, I lack the expertise to be a 100% confident in reviewing this.
Running some tests again:

burgerdev · 2025-02-19T07:42:29Z

Since the operator code gen make target didn't work for me

Do you remember what the issue was there? We might want to make a ticket for it.

Fwiw, this part was already merged as #3653.

burgerdev · 2025-03-24T16:55:57Z

This was merged as #3663

3u13r added the no changelog Change won't be listed in release changelog label Oct 20, 2024

3u13r requested a review from derpsteb as a code owner October 20, 2024 20:48

3u13r requested a review from burgerdev October 20, 2024 20:49

3u13r force-pushed the euler/feat/operator/upgrade-control-planes-first branch from 39f28da to 5000cc9 Compare October 20, 2024 20:54

3u13r changed the title ~~Euler/feat/operator/upgrade control planes first~~ operator: upgrade all control plane node first Oct 20, 2024

3u13r force-pushed the euler/feat/operator/upgrade-control-planes-first branch 2 times, most recently from 14e650b to 5fcf88f Compare October 20, 2024 22:14

3u13r requested a review from daniel-weisse as a code owner October 20, 2024 22:14

3u13r force-pushed the euler/feat/operator/upgrade-control-planes-first branch 2 times, most recently from a612103 to d0e8a38 Compare October 20, 2024 23:57

daniel-weisse reviewed Oct 21, 2024

View reviewed changes

3u13r force-pushed the euler/feat/operator/upgrade-control-planes-first branch 2 times, most recently from a8ec800 to 72f85f3 Compare October 22, 2024 10:47

3u13r changed the title ~~operator: upgrade all control plane node first~~ operator: upgrade all control plane nodes first Oct 22, 2024

burgerdev reviewed Oct 23, 2024

View reviewed changes

3u13r force-pushed the euler/feat/operator/upgrade-control-planes-first branch from 72f85f3 to f1fc488 Compare November 7, 2024 13:13

3u13r requested review from elchead, katexochen, msanft and thomasten as code owners November 7, 2024 13:13

3u13r force-pushed the euler/feat/operator/upgrade-control-planes-first branch from f1fc488 to ac0be72 Compare November 7, 2024 13:20

3u13r marked this pull request as draft November 7, 2024 13:21

3u13r force-pushed the euler/feat/operator/upgrade-control-planes-first branch from ac0be72 to 72f85f3 Compare November 7, 2024 13:25

burgerdev force-pushed the burgerdev/k8s-1.31 branch 2 times, most recently from eb20368 to eb60fdc Compare December 23, 2024 10:46

burgerdev force-pushed the burgerdev/k8s-1.31 branch from b1e967b to d5860bf Compare January 16, 2025 16:03

burgerdev force-pushed the euler/feat/operator/upgrade-control-planes-first branch from 72f85f3 to cb1e37a Compare February 18, 2025 07:54

burgerdev force-pushed the burgerdev/k8s-1.31 branch 2 times, most recently from 3e22aa8 to a2a25fd Compare February 18, 2025 07:59

burgerdev force-pushed the euler/feat/operator/upgrade-control-planes-first branch from cb1e37a to 89318fe Compare February 18, 2025 08:00

burgerdev mentioned this pull request Feb 18, 2025

operator: bump controller-gen version #3653

Merged

5 tasks

katexochen removed their request for review February 18, 2025 09:00

burgerdev and others added 5 commits February 18, 2025 11:07

versions: add k8s 1.31, remove k8s 1.28

463643c

e2e: set default k8s version for daily to 1.30

c06353b

e2e: remove defaults for required arguments

1c00e54

versions: move 1.31 to the end of the list

304dc79

kubernetes: set feature gate ControlPlaneKubeletLocalMode

37900d9

Co-Authored-By: Leonard Cohnen <[email protected]>

burgerdev force-pushed the burgerdev/k8s-1.31 branch from a2a25fd to 37900d9 Compare February 18, 2025 10:07

3u13r added 7 commits February 18, 2025 11:07

operator: allow increasing number of node upgrade budget

11c0298

operator: requeue scaling group on conflict

7fbdba5

operator: clean up after joining node env test

ebee6b3

operator: move control plane label to constants

3ddd638

operator: stub etcd remove calls in env tests

d6f9391

operator: upgrade control plane nodes first

1253359

Before we call out ot the cloud provider we check if there are still control plane nodes that are outdated (or donors). If there are, we don't create any worker nodes, even if we have the budget to do so.

operator: add test that tries to upgrade worker nodes first and fails

a9a9d4b

burgerdev force-pushed the euler/feat/operator/upgrade-control-planes-first branch from 89318fe to a9a9d4b Compare February 18, 2025 10:08

msanft approved these changes Feb 18, 2025

View reviewed changes

msanft approved these changes Feb 21, 2025

View reviewed changes

burgerdev force-pushed the burgerdev/k8s-1.31 branch 3 times, most recently from 0839dbf to bf26b4b Compare February 26, 2025 12:10

Base automatically changed from burgerdev/k8s-1.31 to main February 26, 2025 13:07

burgerdev closed this Mar 24, 2025

	// ControlPlaneRoleLabel label used to identify control plane nodes.
	// ControlPlaneRoleLabel label used to identify control plane nodes.
	// https://kubernetes.io/docs/reference/labels-annotations-taints/#node-role-kubernetes-io-control-plane

Conversation

3u13r commented Oct 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Proposed change(s)

Related issue

Checklist

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Feb 18, 2025

Coverage report

Uh oh!

msanft commented Feb 18, 2025

Uh oh!

msanft left a comment

Choose a reason for hiding this comment

Uh oh!

burgerdev commented Feb 19, 2025

Uh oh!

burgerdev commented Mar 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

3u13r commented Oct 20, 2024 •

edited

Loading