WIP Add good candidate node interface #8531

elmiko · 2025-09-12T19:39:59Z

What type of PR is this?

/kind bug

What this PR does / why we need it:

Which issue(s) this PR fixes:

Fixes #8494

Special notes for your reviewer:

this is a challenging scenario to debug, please see the related issue.

Does this PR introduce a user-facing change?

The ClusterAPI provider will not scale down a MachineDeployment that is undergoing an update.

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

k8s-ci-robot · 2025-09-12T19:40:09Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: elmiko
Once this PR has been reviewed and has the lgtm label, please assign feiskyer for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

cluster-autoscaler/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

elmiko · 2025-09-12T19:40:32Z

i'm still working on some clusterapi specific unit tests, but they are quite challenging given the mocks that are needed.

elmiko · 2025-09-12T19:41:15Z

cc @sbueringer @fabriziopandini this isn't quite done yet, but the business logic seems to be working as expected.

wjunott · 2025-09-15T02:40:01Z

cluster-autoscaler/processors/nodes/pre_filtering_processor.go

+		} else if err != nil && err != cloudprovider.ErrNotImplemented {
+			klog.Warningf("Error while checking if node is a candidate for deletion %s: %v", node.Name, err)
+			continue
+		}
 		nodeGroup, err := ctx.CloudProvider.NodeGroupForNode(node)


According to NodeGroupForNode()'s interface comment: nil if the node should not be processed by cluster autoscaler, it looks we can also put capi rollout logics into this interface's implementation?

ah, good suggestion. that might be a much simpler way to solve this. i'll research that approach, thank you!

Wouldn't this also break scale up's and potentially a lot of other places where this func is used?

Wouldn't this also break scale up's and potentially a lot of other places where this func is used?

yes, unfortunately after doing more research i don't think this would work for us for a couple reasons:

there are other use of NodeGroupForNode that should always return accurate information

clusterapi, and potentially other providers who perform node updates, need to know that the autoscaler will be deleting a node to make the decision about the deletion process. NodeGroupForNode does not pass this context.

this would only work in we were to assume that any node undergoing an update should be ignored completely by the autoscaler. i'm not sure we can make that assertion.

my hope is that in the future when something like the Declarative Node Maintenance api has been accepted, that we will be able to coordinate using that api.

2. clusterapi, and potentially other providers who perform node updates, need to know that the autoscaler will be deleting a node to make the decision about the deletion process. NodeGroupForNode does not pass this context.

Honestly, I don't know. But NodeGroupForNode is called in 21 places. So I do not know what other impact it has and if it produces new issues elsewhere. I think this would require extensive research and testing.

Our idea so far was to make a surgical change to ensure GetScaleDownCandidates does not return Nodes/Machines of MD in rollout as scale down candidates. Which means that Nodes/Machines of a MD are simply not considered for scale down, but everything else still works as of today.

Modifying NodeGroupForNode would significantly increase the blast radius of this fix.

cluster-autoscaler/cloudprovider/cloud_provider.go

cluster-autoscaler/processors/nodes/pre_filtering_processor.go

sbueringer · 2025-09-16T11:06:47Z

cluster-autoscaler/cloudprovider/clusterapi/clusterapi_controller.go

+	})
+	objs, err := c.machineSetInformer.Lister().ByNamespace(r.GetNamespace()).List(selector)
+	if err != nil {
+		return nil, err


Maybe wrap the error here to provide a bit of context

@elmiko I think you missed this one (but up to you of course :))

cluster-autoscaler/cloudprovider/clusterapi/clusterapi_utils.go

aleksandra-malinowska · 2025-09-16T14:29:03Z

If I understand #8494 correctly, this change is meant to prevent CA from attempting to scale down a node that has already been cordoned and drained by Cluster API (and it's up to Cluster API to remove it).

Can Cluster API apply the scale-down disabled annotation when draining? https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/core/scaledown/eligibility/eligibility.go#L39

sbueringer · 2025-09-16T14:51:36Z

If I understand #8494 correctly, this change is meant to prevent CA from attempting to scale down a node that has already been cordoned and drained by Cluster API (and it's up to Cluster API to remove it).

Can Cluster API apply the scale-down disabled annotation when draining? master/cluster-autoscaler/core/scaledown/eligibility/eligibility.go#L39

No, this is about preventing cluster autoscaler to delete / scale down a Node altogether if Cluster API is doing a rollout.
The problem is that if autoscaler tries to delete / scale down a Node during a rollout there's a high chance that it will end up deleting the wrong Node (and that then repeats until we have no Nodes anymore for a node group)

elmiko · 2025-09-16T16:47:17Z

+1 to what @sbueringer is saying, and also this problem is currently confined to clusterapi provider but it could affect any provider who does node updating in a similar fashion as clusterapi. i think we need to make the autoscaler smarter in these scenarios where a cloud provider needs to have more control over which nodes are being marked for removal during a maintenance window.

This function allows cloud providers to specify when a node is not a good candidate for scaling down. This will occur before the autoscaler has begun to cordon, drain, and taint any node for scale down. Also adds a unit test for the prefiltering node processor.

The initial implementation of this function for clusterapi will return that a node is not a good candidate for scale down when it belongs to a MachineDeployment that is currently rolling out an upgrade.

elmiko · 2025-09-16T19:16:31Z

updated with @sbueringer 's suggestions.

sbueringer · 2025-09-17T08:04:37Z

Answered above

k8s-ci-robot requested review from aleksandra-malinowska and apricote September 12, 2025 19:40

wjunott reviewed Sep 15, 2025

View reviewed changes

sbueringer reviewed Sep 16, 2025

View reviewed changes

elmiko added 2 commits September 16, 2025 15:09

add clusterapi IsNodeCandidateForScaleDown impl

150cf78

The initial implementation of this function for clusterapi will return that a node is not a good candidate for scale down when it belongs to a MachineDeployment that is currently rolling out an upgrade.

elmiko force-pushed the add-good-candidate-node-interface branch from cae87c7 to 150cf78 Compare September 16, 2025 19:16

WIP Add good candidate node interface #8531

Are you sure you want to change the base?

WIP Add good candidate node interface #8531

Conversation

elmiko commented Sep 12, 2025

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

Uh oh!

k8s-ci-robot commented Sep 12, 2025

Uh oh!

elmiko commented Sep 12, 2025

Uh oh!

elmiko commented Sep 12, 2025

Uh oh!

wjunott Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

elmiko Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sbueringer Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

elmiko Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

sbueringer Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sbueringer Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

sbueringer Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aleksandra-malinowska commented Sep 16, 2025

Uh oh!

sbueringer commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elmiko commented Sep 16, 2025

Uh oh!

elmiko commented Sep 16, 2025

Uh oh!

sbueringer commented Sep 17, 2025

Uh oh!

Uh oh!

elmiko Sep 15, 2025 •

edited

Loading

sbueringer Sep 16, 2025 •

edited

Loading

sbueringer commented Sep 16, 2025 •

edited

Loading