Token truncation handling #2003

jxnl · 2026-01-16T14:41:24Z

feat: add truncation-aware retry handling with failfast and auto-ramp

Describe your changes

This enhancement introduces two new parameters to improve how Instructor handles model response truncation:

failfast_on_truncation=True: When enabled, Instructor will immediately raise an IncompleteOutputException if a model's response is truncated due to max_tokens limits. This prevents wasted validation retries when the issue is insufficient output length, not malformed JSON.
max_tokens_auto_ramp: This option allows Instructor to automatically increase the max_tokens (or equivalent model-specific token limits like max_output_tokens, maxTokens) for subsequent retries when truncation is detected. It's configurable with a multiplier, an optional cap, and max_attempts to control the ramping behavior. This addresses truncation by increasing the token budget only when needed, optimizing rate limits and concurrency.

These features are integrated into the instructor.patch function and the Instructor client's create methods, and are supported by updated documentation and new unit tests.

Issue ticket number and link

567-252, #1962

Checklist before requesting a review

I have performed a self-review of my code
If it is a core feature, I have added thorough tests.
If it is a core feature, I have added documentation.

Linear Issue: 567-252

Co-authored-by: jason <[email protected]>

cursor · 2026-01-16T14:41:26Z

Cursor Agent can help with this pull request. Just @cursor in comments and I'll start working on changes in this branch.
_{Learn more about Cursor Agents}

cloudflare-workers-and-pages · 2026-01-16T14:41:30Z

Deploying with Cloudflare Workers

The latest updates on your project. Learn more about integrating Git with Workers.

Status	Name	Latest Commit	Preview URL	Updated (UTC)
✅ Deployment successful! View logs	instructor	`c807efa`	Commit Preview URL Branch Preview URL	Jan 16 2026, 02:41 PM

feat(retry): handle truncation retries

c807efa

Co-authored-by: jason <[email protected]>

github-actions bot added documentation Improvements or additions to documentation enhancement New feature or request size:M This PR changes 30-99 lines, ignoring generated files. labels Jan 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Token truncation handling #2003

Token truncation handling #2003

Uh oh!

jxnl commented Jan 16, 2026

Uh oh!

cursor bot commented Jan 16, 2026

Uh oh!

cloudflare-workers-and-pages bot commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Token truncation handling #2003

Are you sure you want to change the base?

Token truncation handling #2003

Uh oh!

Conversation

jxnl commented Jan 16, 2026

Describe your changes

Issue ticket number and link

Checklist before requesting a review

Uh oh!

cursor bot commented Jan 16, 2026

Uh oh!

cloudflare-workers-and-pages bot commented Jan 16, 2026

Deploying with Cloudflare Workers

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants