DNM: Reapply "Incremental compaction (#32381)" #32654

def- · 2025-06-04T15:45:19Z

This reverts commit 4cad141.

Checklist

This PR has adequate test coverage / QA involvement has been duly considered. (trigger-ci for additional test/nightly runs)
This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
If this PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way), then it is tagged with a T-proto label.
If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).
If this PR includes major user-facing behavior changes, I have pinged the relevant PM to schedule a changelog post.

Towards MaterializeInc/database-issues#9191 Today, we have no good way to split the work of compaction into smaller parts. This presents an issue as datasets and clusters continue to grow in size. If a compaction takes a significant amount of time there is a risk that the process running the compaction might not live long enough (for whatever reason: failure, shutdown, schedule, etc). This PR aims to improve the situation when dealing with compacting many shorter runs. We already split the work up into "chunks" based on the size of the runs but we don't write the work back out into state until all chunks are complete. This is suboptimal. Imagine a big amount of compaction is chugging along, 99 of the 100 batches of work are done, but before the last one can finish the cluster shuts down. All that work is wasted. This PR "checkpoints" it's work into state after each chunk is done. That way in the example above, only the partially finished 100th chunk is lost. (Incremental work within chunks will be the subject of future work). There is a tradeoff here though, it means writing to state more often, this risks putting CRDB under additional load. We currently seem to execute 650-750 writes per second to each of our CRDB nodes in us-east-1 on average. There is significant potential risk here. In us-east-1, on the order of 200 chunks per second are queued up. That means that if each chunk completes immediately and concurrently, we significantly push the QPS of our crdb cluster (I think our cluster can handle it based on resource usage I'm seeing but setting that aside...) I don't think that every chunk across every environment is going to complete immediately and concurrently so I think the likely impact on the QPS is likely to be lower than 200/s. That said we don't have a sense of _per chunk_ timing so it's harder to estimate specifically. An anecdotal test in staging didn't reveal any undue load. If this remains a concern, some form of backpressure could be implemented to batch applies.    - [ ] This PR has adequate test coverage / QA involvement has been duly considered. ([trigger-ci for additional test/nightly runs](https://trigger-ci.dev.materialize.com/)) - [ ] This PR has an associated up-to-date [design doc](https://github.com/MaterializeInc/materialize/blob/main/doc/developer/design/README.md), is a design doc ([template](https://github.com/MaterializeInc/materialize/blob/main/doc/developer/design/00000000_template.md)), or is sufficiently small to not require a design.  - [ ] If this PR evolves [an existing `$T ⇔ Proto$T` mapping](https://github.com/MaterializeInc/materialize/blob/main/doc/developer/command-and-response-binary-encoding.md) (possibly in a backwards-incompatible way), then it is tagged with a `T-proto` label. - [ ] If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label ([example](MaterializeInc/cloud#5021)).  - [ ] If this PR includes major [user-facing behavior changes](https://github.com/MaterializeInc/materialize/blob/main/doc/developer/guide-changes.md#what-changes-require-a-release-note), I have pinged the relevant PM to schedule a changelog post.

DAlperin and others added 15 commits May 28, 2025 15:39

assert based on diffs_sum

e7c42d4

support diffs_sum for hollow run refs

a388eb3

handle tombstones

77b1c70

lint

681a4cc

Generic diffs_sum

661fc16

Look, I'm not proud of this either

5b4e4e2

Lint/test fixes

ddd5756

Fix tombstone apply

230346b

Compare options

43d8141

Fix lints, add assert case cover

b1a02c5

Address PR comments

1125440

Merge remote-tracking branch 'upstream/main' into dov/assert-diffs-sum

ac85436

adjust range SpineBatch

48756be

consistent range

590589e

def- force-pushed the pr-test-test branch from c20fd66 to d538164 Compare June 4, 2025 16:21

fix merge

e20b920

def- force-pushed the pr-test-test branch 2 times, most recently from 9b9a6b1 to 935abd9 Compare June 5, 2025 17:48

def- added 2 commits June 5, 2025 18:17

Try to make 0dt upsert trigger it

d9385eb

Revert 32598

67b0930

def- force-pushed the pr-test-test branch from 935abd9 to 67b0930 Compare June 5, 2025 18:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DNM: Reapply "Incremental compaction (#32381)" #32654

DNM: Reapply "Incremental compaction (#32381)" #32654

Uh oh!

def- commented Jun 4, 2025

Uh oh!

Uh oh!

DNM: Reapply "Incremental compaction (#32381)" #32654

Are you sure you want to change the base?

DNM: Reapply "Incremental compaction (#32381)" #32654

Uh oh!

Conversation

def- commented Jun 4, 2025

Checklist

Uh oh!

Uh oh!