Skip to content

Commit

Permalink
CORS-3741: Nutanix enhancement: allow multiple NICs
Browse files Browse the repository at this point in the history
  • Loading branch information
yanhua121 committed Nov 6, 2024
1 parent 4cc0d6f commit 76ca31d
Showing 1 changed file with 128 additions and 0 deletions.
128 changes: 128 additions & 0 deletions enhancements/machine-api/nutanix-multi-nics.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,128 @@
---
title: nutanix-multi-nics
authors:
reviewers:
approvers:
api-approvers:
creation-date: 2024-11-05
last-updated: 2024-11-05
tracking-link:
- https://issues.redhat.com/browse/CORS-3741
---

# Nutanix: Multi-NICs for OCP Cluster Nodes

## Summary

Ability to install OpenShift on Nutanix with nodes having multiple NICs (multiple subnets) from IPI and for autoscaling with MachineSets.

## Motivation

Requested by customers:
- Everest Digital
- Unacle B.V

### Goals

- Allow users to configure multiple subnets for Nutanix pltform in the install-config.yaml file at cluster installation using IPI or UPI.
- Allow users to configure multiple subnets via Machine/MachineSet CRs' Nutanix providerSpec to add/scale worker nodes.
- Allow smooth cluster upgrade from older OCP versions.

### Non-Goals

## Proposal

### User Stories

As an OpenShift user, I wish to deploy clusters that allow infrastructure and worker nodes with multi-NICs support. This may be to support secondary storage networking, such as Nutanix CSI, or to support other applications with segmented network requirements.

### API Extensions

Currently, the “subnets” fields in both Machine/MachineSet’s Nutanix providerSpec and Nutanix FailureDomain are already array type. The only change for the api is to relax the validation rule for the “subnets” fields to allow multiple values and to ensure no duplication values are configured.

We will add a featue gate "NutanixMultiSubnets" (DevPreviewNoUpgrade, TechPreviewNoUpgrade) for this feature. After QE testing complete, we will add the feature gate to the "Default" feature set.

```go
// NutanixPlatformSpec holds the desired state of the Nutanix infrastructure provider.
// This only includes fields that can be modified in the cluster.
type NutanixPlatformSpec struct {
...

// failureDomains configures failure domains information for the Nutanix platform.
// When set, the failure domains defined here may be used to spread Machines across
// prism element clusters to improve fault tolerance of the cluster.
// +openshift:validation:FeatureGateAwareMaxItems:featureGate=NutanixMultiSubnets,maxItems=32
// +listType=map
// +listMapKey=name
// +optional
FailureDomains []NutanixFailureDomain `json:"failureDomains"`
}

// NutanixFailureDomain configures failure domain information for the Nutanix platform.
type NutanixFailureDomain struct {
...

// subnets holds a list of identifiers (one or more) of the cluster's network subnets
// If the feature gate NutanixMultiSubnets is enabled, up to 32 subnets may be configured.
// for the Machine's VM to connect to. The subnet identifiers (uuid or name) can be
// obtained from the Prism Central console or using the prism_central API.
// +kubebuilder:validation:Required
// +kubebuilder:validation:MinItems=1
// +openshift:validation:FeatureGateAwareMaxItems:featureGate="",maxItems=1
// +openshift:validation:FeatureGateAwareMaxItems:featureGate=NutanixMultiSubnets,maxItems=32
// +openshift:validation:FeatureGateAwareXValidation:featureGate=NutanixMultiSubnets,rule="self.all(x, self.exists_one(y, x == y))",message="each subnet must be unique"
// +listType=atomic
Subnets []NutanixResourceIdentifier `json:"subnets"`
}
```

### Implementation Details/Notes/Constraints

The installer should allow more than one subnets to be configured in the install-config.yaml. And pass that configuration to the installer generated Machine/MachineSet manifests when running the installer to create an OCP cluster.

The Machine validation webhook should check the Nutanix providerSpec’s “subnets” field to allow more than one item and make sure there are no duplicates.
The nutanix machine controller should allow more than one item in the NutanixMachineProviderConfig’s “subnets” field, and use this configured subnets value when creating a new VM for the Machine node.

### Workflow Description

### Topology Considerations

#### Hypershift / Hosted Control Planes

#### Standalone Clusters

#### Single-node Deployments or MicroShift

### Risks and Mitigations

### Drawbacks

## Test Plan

- QE will test this feature
- Will add an e2e test case for this feature

## Graduation Criteria

### Dev Preview -> Tech Preview

### Tech Preview -> GA

After QE test is done. We can add the feature gate "NutanixMultiSubnets" to the Default feature set, for the 4.18 GA.

### Removing a deprecated feature

## Upgrade / Downgrade Strategy

To upgrade an existing OCP (prior to 4.18) Nutanix cluster to 4.18, there is nothing to worry about this feature. Because prior to 4.18, the “subnets” field of the Nutanix providerSpec in the Machine/MachineSet/ControlPlaneMachineSet CRs and in the each of Nutanix FailureDomains of the Infrastructure CR should only have one and exactly one item. And this is supported in 4.18.

To downgrade an existing 4.18 OCP Nutanix cluster to a prior version, if any of the Machine/MachineSet/ControlPlaneMachineSet CRs and the Nutanix FailureDomains of the Infrastructure CR configures more than one “subnets”, it will fail with validation errors.

## Version Skew Strategy

## Operational Aspects of API Extensions

## Support Procedures

## Alternatives

0 comments on commit 76ca31d

Please sign in to comment.