OCPEDGE-2280: Add Adaptable Topology, reorganize topology enhancements #1905

jaypoulz · 2025-12-10T21:08:03Z

This enhancement introduces Adaptable topology, a new cluster-topology mode that enables clusters to dynamically adjust their behavior based on node count. This allows SingleReplica clusters to scale to multi-node configurations without redeployment.

Key features:

Automatic behavior adjustment as control-plane and worker nodes scale
One-way transition from SingleReplica to Adaptable topology
Operator compatibility declarations via OLM annotations
CLI command for safe topology transitions with compatibility checks
Shared utilities in library-go to ease operator implementation

The proposal includes complete workflow descriptions, API extensions, test plans, and version skew strategy. Future stages will add AutomaticQuorumRecovery (AQR) to enable DualReplica-based resiliency for two-node configurations.

openshift-ci-robot · 2025-12-10T21:08:07Z

@jaypoulz: This pull request explicitly references no jira issue.

Details

In response to this:

This enhancement introduces Adaptable topology, a new cluster-topology mode that enables clusters to dynamically adjust their behavior based on node count. This allows SingleReplica clusters to scale to multi-node configurations without redeployment.

Key features:

Automatic behavior adjustment as control-plane and worker nodes scale

One-way transition from SingleReplica to Adaptable topology

Operator compatibility declarations via OLM annotations

CLI command for safe topology transitions with compatibility checks

Shared utilities in library-go to ease operator implementation

The proposal includes complete workflow descriptions, API extensions, test plans, and version skew strategy. Future stages will add AutomaticQuorumRecovery (AQR) to enable DualReplica-based resiliency for two-node configurations.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

openshift-ci · 2025-12-10T21:08:38Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign moadz for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

jaypoulz · 2025-12-11T22:03:26Z

/retitle OCPEDGE-2280: Add Adaptable Topology, reorganize topology enhancements

openshift-ci-robot · 2025-12-11T22:03:33Z

@jaypoulz: This pull request references OCPEDGE-2280 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the epic to target either version "4.21." or "openshift-4.21.", but it targets "openshift-4.22" instead.

Details

In response to this:

This enhancement introduces Adaptable topology, a new cluster-topology mode that enables clusters to dynamically adjust their behavior based on node count. This allows SingleReplica clusters to scale to multi-node configurations without redeployment.

Key features:

Automatic behavior adjustment as control-plane and worker nodes scale

One-way transition from SingleReplica to Adaptable topology

Operator compatibility declarations via OLM annotations

CLI command for safe topology transitions with compatibility checks

Shared utilities in library-go to ease operator implementation

The proposal includes complete workflow descriptions, API extensions, test plans, and version skew strategy. Future stages will add AutomaticQuorumRecovery (AQR) to enable DualReplica-based resiliency for two-node configurations.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

jaypoulz · 2025-12-12T14:40:17Z

/retest

Introduce Adaptable topology, a cluster-topology mode that adjusts control-plane and infrastructure behavior based on node count. Clusters can install with Adaptable topology or transition from SingleReplica as a Day 2 operation. Key components: - Infrastructure API adds Adaptable enum value - Operator compatibility annotations for in-payload and OLM operators - Library-go utilities for node count awareness - cluster-etcd-operator enhanced scaling logic for 2↔3 nodes - oc adm topology transition CLI command with compatibility checks - Console marketplace filtering and compatibility display - Initial Topology Audit covering 39 operators The initial implementation supports SingleReplica-to-Adaptable transitions. Future stages will add AutomaticQuorumRecovery (AQR) for DualReplica behavior on two-node configurations. Reviewers assigned across 18 teams including control plane, OLM, API, installer, console, monitoring, and core operator teams. This also reorganizes edge topology enhancements under a new edge-topologies/ directory: - edge-topologies/adaptable-topology.md (new) - edge-topologies/single-node/ (moved) - edge-topologies/two-node/ (moved and reorganized) - two-node-fencing.md (renamed from tnf.md) - two-node-arbiter.md (renamed from arbiter-clusters.md)

openshift-ci · 2025-12-12T18:30:58Z

@jaypoulz: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/markdownlint	`6e81004`	link	true	`/test markdownlint`

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

jeff-roche · 2025-12-15T14:03:29Z