You Shook Me Alloc Night Long — finally sorted like we mean it. by cds-amal · Pull Request #125 · runtimeverification/stable-mir-json

cds-amal · 2026-02-22T22:47:03Z

This PR replaces all index-based sort keys with content-derived ones, making the ordering of output vectors deterministic across runs. The one remaining source of cross-run non-determinism is adt_def (see below for why that can't be fixed here).

Context

For those unfamiliar: rustc stores types, allocations, and other values in central tables and refers to them by integer index. These indices are assigned in the order the compiler happens to intern items, which is not guaranteed to be the same between invocations. There are two independent reasons for this:

Hash-map iteration order: Rustc uses hash maps heavily (often FxHashMap for speed rather than the standard library's RandomState-backed HashMap). Hash-map iteration order is not specified and depends on hash/table layout and insertion history. If the order of insertions varies (because upstream work completed in a different order, or because of platform or compiler version differences), then iterating the map and interning items in iteration order produces different interned indices.
Parallel query evaluation: Rustc uses rayon for parallel compilation. Query evaluation, monomorphization collection, and codegen unit partitioning can all run work items concurrently. The order in which parallel tasks complete depends on OS thread scheduling, which can affect insertion order into those hash maps. So if type A gets interned before type B in one run because its query finished first, their indices may swap in the next run.

Either source alone is sufficient to produce different indices across runs.

The output vectors in collect_smir() were sorted like this:

allocs.sort_by(|a, b| a.alloc_id.to_index().cmp(&b.alloc_id.to_index()));
functions.sort_by(|a, b| a.0 .0.to_index().cmp(&b.0 .0.to_index()));
types.sort_by(|a, b| a.0.to_index().cmp(&b.0.to_index()));
spans.sort();

These .to_index() calls return those interned IDs. The original comment said "stabilise output (a bit)", which is honest about the fact that this was only partially working. And uneval_consts (coming from HashMap::into_iter()) wasn't sorted at all.

The integration test harness worked around all of this with a jq normalization filter (normalise-filter.jq) that strips unstable IDs and re-sorts by content before diffing. But the raw JSON output itself was not reproducible.

What changed

All changes are in src/printer.rs, in the sorting section of collect_smir(). Here's the new sort strategy for each vector:

Vector	Old sort key	New sort key	Tiebreaker
`functions`	`Ty.to_index()`	`Ty` display string (via `ty_pretty`)	interned index
`types`	`Ty.to_index()`	`Ty` display string (via `ty_pretty`)	interned index
`allocs`	`AllocId.to_index()`	content-derived string (see below)	none needed
`spans`	span index (opaque)	location tuple: `(filename, lo_line, lo_col, hi_line, hi_col)`	none needed
`uneval_consts`	unsorted	item name string	none needed
`items`	unchanged	already uses a content-based `Ord` impl	N/A

Allocation sort keys deserve some detail. The new alloc_sort_key() helper produces a content-derived string from each AllocInfo by matching on the GlobalAlloc variant:

Variant	Sort key format	Example
`Memory`	`0_Memory_` + zero-padded byte length	`"0_Memory_00000000000000000032"`
`Static`	`1_Static_` + def name	`"1_Static_MY_STATIC"`
`VTable`	`2_VTable_` + type string	`"2_VTable_dyn Trait"`
`Function`	`3_Function_` + instance name	`"3_Function_foo::bar"`

The numeric prefix groups entries by variant kind. Byte length is zero-padded to 20 digits so that 32 sorts before 128 lexicographically. (Span locations are usize values that compare numerically, so no padding is needed there.)

Three golden test files were regenerated because their normalized output changed due to the new ordering. The jq normalization filter doesn't fully sort TupleType and FunType entries, so those were sensitive to input order.

Remaining non-determinism: `adt_def`

We looked into stabilizing the adt_def field on EnumType/StructType/UnionType as well; it's another interned index that the jq filter strips for test normalization. Turns out it can't be dropped or replaced.

The reason: downstream consumers need adt_def as a cross-reference key to match AggregateKind::Adt(adt_def, ...) in MIR bodies with the corresponding type metadata entry. AggregateKind serialization comes from stable_mir (we don't control it), so both sides of the join have to use the same key format. The index is consistent within a single JSON file; it's just not stable across runs.

So adt_def remains the one known source of cross-run non-determinism in the output, and the jq filter still needs to strip it for golden test comparison. A comment was added on the field explaining this constraint. If we ever get the ability to customize AggregateKind serialization upstream, we could replace these indices with names, but that's a stable_mir change, not something we can do on our end. (See PR #64 for the full discussion.)

On the `Ty` display string tiebreaker

Ty's display impl goes through ty_pretty(), which calls to_string() on rustc's internal Ty. For monomorphized types (all generics resolved, no lifetime parameters), this should be injective: distinct types produce distinct strings. But there's a theoretical concern: could two distinct Ty values display identically? Perhaps some obscure lifetime or where-clause difference that gets elided in the display output.

We weren't confident enough to rule this out entirely, so the interned index is used as a tiebreaker: content-based ordering for the common case, with the index preserving a consistent (within-run) ordering for any hypothetical ties. If a tie does occur, the ordering at the tie point would be non-deterministic across runs, but we haven't observed this in practice.

What this means in practice

If you run stable-mir-json twice on the same input file and diff the raw JSON (no jq filter), the only differences will be the interned index values themselves (adt_def, alloc_id, Ty keys, etc.). The structural ordering of every output vector is now identical across runs. Before this change, even the order of types, functions, allocs, and spans could shuffle around.

The jq normalization filter is still needed for golden tests (to strip those interned IDs), but the sort operations in it are now redundant; they just confirm what the source already guarantees.

Performance

The format!("{}", ty) calls for sorting functions and types allocate strings. This is fine; the vectors are small (one entry per unique type or function in the program), and this code runs once at the end after all collection is done.

Test plan

cargo build compiles cleanly
make integration-test passes (all 28 tests)
Ran stable-mir-json twice on assert_eq.rs and diffed the raw JSON: identical modulo interned index values (adt_def only); all vector ordering matched exactly

This is a follow-up to #131. That PR fixed a real panic (non-builtin-deref Static/VTable allocations), but the fix landed as three nearly identical match arms: Static, VTable, and Function, each repeating the same "try `get_prov_ty`, fall back to opaque placeholder" logic. The only meaningful difference between them was a single predicate ("is this type usable directly?") and the debug log string. This PR collapses them back into one arm, makes the predicate explicit, and documents a previously unexplained jq filter. ## What changed **`src/printer/mir_visitor.rs`** The three arms now share a single code path. The predicate that actually differs between them is captured in a `needs_recovery` variable: - Static/VTable: `builtin_deref(true).is_none()` (not a reference, raw pointer, or Box) - Function: `!kind.is_fn_ptr()` (outer type isn't already a function pointer) Worth noting: these two predicates are *not* equivalent (a function pointer passes `builtin_deref` but fails `is_fn_ptr`), which is why a naive "just combine the arms" without the inner match would have changed behavior for Function allocs. The inner match on `global_alloc` makes this asymmetry visible rather than hiding it across separate arms. The rest of the logic (try `get_prov_ty`, fall back to opaque placeholder) is written once. Also dropped an unnecessary `.clone()` on the `builtin_deref` call; it takes `&self`. **`tests/integration/normalise-filter.jq`** The `def_id` filter had a terse comment ("unrelated to this regression") that didn't explain *why* it exists or why it's safe to strip globally. Replaced it with a proper explanation: `def_id` values are interned compiler indices (same class as `alloc_id` and `adt_def`) that are consistent within a single rustc invocation but non-deterministic across runs. Downstream consumers need them as cross-reference keys to join `AggregateKind::Adt` in MIR bodies with type metadata entries, so they can't be dropped from the output itself; we only strip them here for golden-file comparison. (See #125 for the full picture on interned-index non-determinism.) ## Test plan - [ ] `cargo build` compiles cleanly - [ ] `make integration-test` passes (no behavioral change; this is a pure refactor)

…tput Replace interned-index-based sorting with content-derived sort keys so that JSON output is stable across compiler invocations (interned indices are non-deterministic). - allocs: sort by kind prefix + content (memory size, static name, etc.) - functions: sort by type display string, tiebreak on index - types: sort by type display string, tiebreak on index - spans: sort by source location data - uneval_consts: sort by name Add -Dwarnings to clippy in style-check to match CI. Update 3 golden files to reflect new sort order.

The previous alloc sort key for Memory variants used only the byte length, so two allocations with the same size (e.g. "hello" and "world") produced identical keys and fell back to non-deterministic AllocId order. Replace the single alloc_sort_key with a three-tier comparison: 1. alloc_sort_tag: cheap &'static str for variant ordering 2. alloc_content_key: name/Display string for Static/VTable/Function 3. alloc_bytes: direct &[Option<u8>] slice comparison for Memory This eliminates the collision without any intermediate string allocation for the byte tiebreaker.

dkcumming

Nice, love the tiebreakers! Things are Back In Black!

This was referenced Feb 28, 2026

Refactor/printer #122

Merged

Factor: collapse triplicated alloc recovery into one arm #135

Merged

cds-amal force-pushed the feat/determinism branch from 1d282d5 to 874d290 Compare March 3, 2026 14:24

cds-amal changed the title ~~Smaller epsilon towards determinism~~ You shook me alloc night long and it's all sorted now Mar 3, 2026

cds-amal marked this pull request as ready for review March 3, 2026 14:33

cds-amal requested a review from a team March 3, 2026 14:33

cds-amal changed the title ~~You shook me alloc night long and it's all sorted now~~ You Shook Me Alloc Night Long — finally sorted like we mean it. Mar 3, 2026

cds-amal requested review from Stevengre and dkcumming March 3, 2026 15:28

dkcumming approved these changes Mar 4, 2026

View reviewed changes

dkcumming merged commit 7da8e3b into runtimeverification:master Mar 4, 2026
5 checks passed

cds-amal deleted the feat/determinism branch March 4, 2026 16:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

You Shook Me Alloc Night Long — finally sorted like we mean it.#125

You Shook Me Alloc Night Long — finally sorted like we mean it.#125
dkcumming merged 2 commits intoruntimeverification:masterfrom
cds-rs:feat/determinism

cds-amal commented Feb 22, 2026 •

edited

Loading

Uh oh!

dkcumming left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cds-amal commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

What changed

Remaining non-determinism: adt_def

On the Ty display string tiebreaker

What this means in practice

Performance

Test plan

Uh oh!

dkcumming left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cds-amal commented Feb 22, 2026 •

edited

Loading

Remaining non-determinism: `adt_def`

On the `Ty` display string tiebreaker