Add `definition_origin` field to type definitions #1009

duckki · 2025-10-23T05:00:30Z

Motivation

This PR fixes an issue where iter_origins() methods fail to return the Definition origin, even if the element has a (non-extension) definition. This can happen with schema definition and most type definitions.

Example

            type T # empty type definition

            extend type T { # an extension with a field
                field: Boolean
            }

iter_origin() method on the type T only returns an Extension, not Definition. That's because type T does not have fields nor directive applications and the current implementation does not record origins at all in this case.

Fix

This PR adds definition_origin field to SchemaDefinition and other type definition structs like Scalar and ObjectType. The definition_origin field is expected to have Some((ComponentOrigin::Definition) value if a schema element has a non-extension definition. Otherwise, it is expected to have None value.

The field actually hold a ComponentOrigin value, instead of being bool type, because iter_origins() method is expected to return a reference to a ComponentOrigin value and the field allows the method to return a reference to an object held by self.

Note: Unfortunately, this PR would be a breaking change.

Downstream fix PR: apollographql/router#8475

tninesling · 2025-10-24T14:45:50Z

crates/apollo-compiler/src/schema/mod.rs

    pub directives: DirectiveList,
+    /// Non-extension definition origin, if exists.
+    /// - We hold the origin here, so its reference can be returned from `iter_origins()`.
+    pub definition_origin: Option<ComponentOrigin>,


As you already called out, this will be a breaking change to the API. One thing I'd like to explore is if this is the right pattern for including this. Typically, we encode a Node plus a ComponentOrigin by wrapping a type in Component<T>, which is why we can get the origin for most of the children in the AST. Since the concept of origin is somewhat orthogonal to the data stored in each type, I think we should consider changing this to the following:

Schema::schema_definition should be a Component<SchemaDefinition> instead of Node<SchemaDefinition>

Similarly, we should update ExtendedType to hold Component<T> instead of Node<T> for each type

I think that would be better aligned with the current implementation of this library. I also have a couple other notes that I'm curious if you have an opinion on:

Knowing how we're consuming this downstream, I think the actual thing we want to check is if the ExtendedType we get came from an extension or not (i.e. I think it's a logical bug, or at least a wasted check, to check the origins of the children in the AST). This may or may not inform the final solution here, but I just wanted to call that out.

One unexpected thing I came across yesterday was that ExtensionId is actually an Arc-wrapped source span. I naively assumed it would be something like an AtomicUsize. This doesn't affect the current changes, but it does have bearing on my next comment.

Since we're adding ComponentOrigin to more of the places where we are holding Node<T>, and whatever solution we come to will likely be a breaking API change, I'm wondering if this is the right time to consolidate Component<T> and Node<T>. I don't think naively adding the origin as-is to the current node header would be performant enough, since historically we've cared about how many bytes of overhead we add to each Node. However, I could see some AtomicUsize or even a simple extension/non-extension boolean flag giving us enough information for our use case.

I'll stop there. Curious to get your thoughts.

There's also some context on a similar issue regarding DirectiveList in #851

Thanks for bringing up the #851.

Schema elements combine implicit def, explicit def and multiple extensions, thus naturally can have multiple origins or nothing at all (if it's all implicit). We could elect one origin to represent the Component, but unfortunately Component has no option to be "implicit" at this time.

If we used Component across the board, then the origin should be optional or has a new variant like Builtin and/or Implicit.

BTW, the origin election would be on this order: builtin/implicit < extension < definition. So, the highest origin will represent the element. That's enough info to fix this PR's issue without adding definition_origin field.

I'm not entirely sure what you mean by implicit in this case. At least in terms of how ComponentOrigin is currently set up, the only options are Definition or Extension. In the usual case, a type's origin should always be Definition, but we run into the case where it may only have extension definitions when we use the adopt_orphan_extensions option.

What I'm proposing is that we use ComponentOrigin::ExtensionId(_) to capture that case instead of definition_origin: None. I think that's more idiomatic with the current patterns in this crate.

A schema element like built-in types may not have definition nor extensions. We won't even have an extension id to store.

apollo-rs/crates/apollo-compiler/src/schema/from_ast.rs

Line 40 in ddb4ded

schema_definition: Node::new(SchemaDefinition {

Alternatively, we can treat implicit definitions to be the "Definition", but distinguished by Node::is_built-in(). But, we need to check that in iter_origins().

Thinking more about it, it will work without adding built-in/implicit variants.

Previously, I thought a built-in type like String could be extended with or without a base definition and only the latter is EXTENSION_WITH_NO_BASE error. That would mean that we need to tell built-in definition from explicit base definition.

However, I realized that situation won't happen:

Schema definitions are always allowed to be extended without a base

Thus, (built-in def + extension) and (explicit def + extension) behave the same.

Built-in types are not allowed to be extended at all (at least JS federation prevents it).

So, built-in types won't cause EXTENSION_WITH_NO_BASE errors anyways.

Rust composition allows to extend built-in scalar types at the moment, but built-in types are excluded from EXTENSION_WITH_NO_BASE checks.

duckki · 2025-10-29T02:24:53Z

#1012 is an alternative proposal that does not break API.

Add definition_origin fields to type definitions

bb9e3f3

duckki requested a review from a team as a code owner October 23, 2025 05:00

duckki mentioned this pull request Oct 23, 2025

fix(composition): fixed unexpected EXTENSION_WITH_NO_BASE errors from upgrading/merging apollographql/router#8475

Closed

10 tasks

tninesling self-assigned this Oct 23, 2025

duckki added the apollo-compiler-2.0 Potential breaking API changes label Oct 23, 2025

tninesling reviewed Oct 24, 2025

View reviewed changes

This was referenced Oct 27, 2025

Empty type definitions do not record origins #1010

Open

Add iter_orphan_extension_types method on SchemaBuilder #1012

Merged

duckki marked this pull request as draft October 29, 2025 02:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `definition_origin` field to type definitions #1009

Add `definition_origin` field to type definitions #1009

Uh oh!

duckki commented Oct 23, 2025 •

edited

Loading

Uh oh!

tninesling Oct 24, 2025

Uh oh!

tninesling Oct 24, 2025

Uh oh!

duckki Oct 25, 2025 •

edited

Loading

Uh oh!

tninesling Oct 27, 2025

Uh oh!

duckki Oct 27, 2025 •

edited

Loading

Uh oh!

duckki Oct 28, 2025

Uh oh!

duckki commented Oct 29, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add definition_origin field to type definitions #1009

Are you sure you want to change the base?

Add definition_origin field to type definitions #1009

Uh oh!

Conversation

duckki commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Example

Fix

Uh oh!

tninesling Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

tninesling Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

duckki Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tninesling Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

duckki Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

duckki Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

duckki commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add `definition_origin` field to type definitions #1009

Add `definition_origin` field to type definitions #1009

duckki commented Oct 23, 2025 •

edited

Loading

duckki Oct 25, 2025 •

edited

Loading

duckki Oct 27, 2025 •

edited

Loading

duckki commented Oct 29, 2025 •

edited

Loading