fix: batch recovery jobs to avoid 16MB read limit by sethconvex · Pull Request #172 · get-convex/workpool

sethconvex · 2026-03-03T17:51:49Z

Summary

Recovery was sending all stale running jobs to a single recover mutation, which could exceed Convex's 16MB read limit when many jobs needed recovery simultaneously (e.g. high maxParallelism + server restart)
Batch recovery jobs into chunks of 50, matching the existing pattern used for cancellations (CANCELLATION_BATCH_SIZE = 64)

Test plan

Existing loop and recovery tests pass (31 tests)
Verify with a high maxParallelism deployment that recovery no longer hits the 16MB limit

🤖 Generated with Claude Code

Summary by CodeRabbit

Refactor
- Recovery now processes jobs in discrete batches for more efficient, incremental recovery.
- Recovery flow now correctly continues when more old jobs remain, preventing premature completion of recovery.

Recovery was sending all stale running jobs to a single `recover` mutation, which could exceed Convex's 16MB read limit when many jobs needed recovery at once (e.g. high maxParallelism + server restart). Batch into chunks of 50, matching the pattern used for cancellations. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

coderabbitai · 2026-03-03T17:52:04Z

Warning

Rate limit exceeded

@sethconvex has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 8 minutes and 15 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

📥 Commits

Reviewing files that changed from the base of the PR and between 96380fe and 639c510.

📒 Files selected for processing (1)

src/component/loop.ts

📝 Walkthrough

Walkthrough

Updates recovery to process old jobs in sequential batches of 50. The recovery handler now returns whether more candidates remain; the main flow sets state.lastRecovery to 0n when additional batches exist, otherwise to the current segment.

Changes

Cohort / File(s)	Summary
Recovery Batch Processing `src/component/loop.ts`	Introduce `RECOVERY_BATCH_SIZE = 50`; change `handleRecovery` to process up to 50 recovery jobs per invocation and return a boolean indicating more candidates; update main to set `state.lastRecovery` to `0n` when more batches remain, otherwise to the current segment.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~8 minutes

Poem

🐰 Fifty hops in tidy rows,
Old tasks find new paths where wind blows,
Quiet batches, one by one,
Recovery’s work now neatly done,
I nibble carrots and hum a fun tune. 🥕

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'fix: batch recovery jobs to avoid 16MB read limit' directly and clearly describes the main change: implementing batching for recovery jobs to prevent exceeding Convex's 16MB read limit. It is concise, specific, and accurately represents the primary objective of the changeset.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix/recovery-batch-size

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

pkg-pr-new · 2026-03-03T17:52:13Z

Open in StackBlitz

npm i https://pkg.pr.new/get-convex/workpool/@convex-dev/workpool@172

commit: 639c510

The initial batch fix only batched the scheduled `recover` call, but `handleRecovery` inside `main` was still reading work docs for every old running job unbounded. Now it processes at most RECOVERY_BATCH_SIZE candidates per iteration and signals `main` to re-run recovery immediately if more remain. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

This reverts commit 96380fe.

ianmacartney · 2026-03-03T18:08:10Z

src/component/loop.ts

        if (r.started >= oldEnoughToConsider) {
          return null;
        }
        const work = await ctx.db.get(r.workId);


this is still loading the old work - which may be big

Work documents can store arbitrarily large fnArgs, so use a conservative batch size to stay well under the 16MB read limit. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sethconvex requested a review from ianmacartney March 3, 2026 17:52

sethconvex and others added 2 commits March 3, 2026 09:56

Revert "fix: also cap recovery reads inside main loop iteration"

cb9dacc

This reverts commit 96380fe.

sethconvex mentioned this pull request Mar 3, 2026

fix: cap recovery reads inside main loop iteration #173

Closed

2 tasks

ianmacartney reviewed Mar 3, 2026

View reviewed changes

sethconvex and others added 2 commits March 3, 2026 10:11

chore: set RECOVERY_BATCH_SIZE to 8 for safety margin

b60de8c

Work documents can store arbitrarily large fnArgs, so use a conservative batch size to stay well under the 16MB read limit. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chore: use RECOVERY_BATCH_SIZE of 10

639c510

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sethconvex closed this Mar 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: batch recovery jobs to avoid 16MB read limit#172

fix: batch recovery jobs to avoid 16MB read limit#172
sethconvex wants to merge 5 commits intomainfrom
fix/recovery-batch-size

sethconvex commented Mar 3, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Mar 3, 2026 •

edited

Loading

Rate limit exceeded

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

pkg-pr-new bot commented Mar 3, 2026 •

edited

Loading

Uh oh!

ianmacartney Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sethconvex commented Mar 3, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

pkg-pr-new bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ianmacartney Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sethconvex commented Mar 3, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 3, 2026 •

edited

Loading

pkg-pr-new bot commented Mar 3, 2026 •

edited

Loading