`struct Av1Block`: narrow field types #1460

kkysen · 2025-10-12T07:06:04Z

The benchmark script found that the merge commit 8eb6932 (#951) was one of the largest perf regressions (+4.7% on 80 threads, +4.3% on 150 threads). While it touched a bunch of things, one significant part was f.frame_thread.b, a DisjointMut<Vec<Av1Block>>. Av1Block is currently 32 bytes, but could be 24 bytes. This PR narrows all of the field types (recursively) as much as possible so we know their more precise range. This will help us then re-arrange the layout (Av1Block is not passed to asm) and get it down to 24 bytes (this PR itself is perf neutral, although it should remove a bunch of bounds checks). Plus, this PR just makes things more idiomatic. Moreover, I think it may be possible to switch from DisjointMut to Relaxed atomics for Av1Block. It should also help make Av1Block: FromZeroes, which will let me optimize the .resize with new_zeroed_slice.

…MODES` table with a `const fn` w/ a `match`

…f a `match` since it optimizes better

Note that `rav1d_msac_decode_symbol_adapt16` already truncates its return to `< 16`, so the check is also a no-op here.

… handle negatives

randomPoison

Looks fine, though I'm a bit worried about all of the unwraps added thanks to using InRange in various places. Are you sure that adding all those checks doesn't degrade performance? Can you confirm that they're optimized out?

kkysen

Looks fine, though I'm a bit worried about all of the unwraps added thanks to using InRange in various places. Are you sure that adding all those checks doesn't degrade performance? Can you confirm that they're optimized out?

They generally should be. I'll go add the // Elided comments. Ones like Av1BlockInterRefIndex::new(N).unwrap() or Av1BlockInterRefIndex::new(N + rav1d_msac_decode_bool_adapt(...) as i8).unwrap() are elided since it knows N and the range of a bool. Some others using rav1d_msac_decode_symbol_adapt8 or rav1d_msac_decode_symbol_adapt16 are elided because they truncate to 8/16.

I think Av1BlockInterRefIndex::new(frame_hdr.skip_mode.refs[0]).unwrap(), doesn't, but we can also make frame_hdr.skip_mode.refs use an InRange type (didn't want to do too much just in this PR).

I also measured performance, and there's basically no difference (just noise, < 0.1% different).

kkysen added 13 commits October 12, 2025 01:44

struct Av1Block::skip: make a bool

066534b

struct Av1Block::skip_mode: make a bool

6ea3105

struct Av1Block::bs: make a BlockSize

718912a

struct Av1BlockInter: #[derive(Default)]

8084db3

fn decode_b: make is_globalmv a bool

db0f4ae

enum CompInterPredMode: make a real enum

83a1f2c

enum InterPredMode: make a real enum

372fd1d

fn CompInterPredMode::split: replace `static DAV1D_COMP_INTER_PRED_…

667390b

…MODES` table with a `const fn` w/ a `match`

trait DefaultValue: impl for [T; N]

9c22fba

fn CompInterPredMode::split: switch to use an enum_map! instead o…

d87c418

…f a `match` since it optimizes better

type WedgeIdx: make an InRange<u8, 0, 15>

a19d479

Note that `rav1d_msac_decode_symbol_adapt16` already truncates its return to `< 16`, so the check is also a no-op here.

struct InRange: use i128 instead of u128 of "catch-all" type to…

d940eea

… handle negatives

struct Av1BlockInter::r#ref: make indices InRange<i8, -1, 6>s

bbe67a2

kkysen requested a review from randomPoison October 12, 2025 07:06

randomPoison approved these changes Oct 13, 2025

View reviewed changes

kkysen commented Oct 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`struct Av1Block`: narrow field types #1460

`struct Av1Block`: narrow field types #1460

Uh oh!

kkysen commented Oct 12, 2025

Uh oh!

randomPoison left a comment

Uh oh!

kkysen left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

struct Av1Block: narrow field types #1460

Are you sure you want to change the base?

struct Av1Block: narrow field types #1460

Uh oh!

Conversation

kkysen commented Oct 12, 2025

Uh oh!

randomPoison left a comment

Choose a reason for hiding this comment

Uh oh!

kkysen left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

`struct Av1Block`: narrow field types #1460

`struct Av1Block`: narrow field types #1460