Skip to content

Commit 4e10b08

Browse files
chore/thinking-update
1 parent aa5cde5 commit 4e10b08

File tree

1 file changed

+21
-0
lines changed

1 file changed

+21
-0
lines changed

openapi.yaml

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -16557,6 +16557,27 @@ components:
1655716557
default: false
1655816558
stream_options:
1655916559
$ref: "#/components/schemas/ChatCompletionStreamOptions"
16560+
thinking:
16561+
type: object
16562+
nullable: true
16563+
description: |
16564+
View the thinking/reasoning tokens as part of your response. Thinking models produce a long internal chain of thought before generating a response. Supported only for specific Claude models on Anthropic, Google Vertex AI, and AWS Bedrock. Requires setting `strict_openai_compliance = false` in your API call.
16565+
properties:
16566+
type:
16567+
type: string
16568+
enum: ["enabled", "disabled"]
16569+
description: Enables or disables the thinking mode capability.
16570+
default: "disabled"
16571+
budget_tokens:
16572+
type: integer
16573+
description: |
16574+
The maximum number of tokens to allocate for the thinking process.
16575+
A higher token budget allows for more thorough reasoning but may increase overall response time.
16576+
minimum: 1
16577+
example: 2030
16578+
required:
16579+
- type
16580+
example: { "type": "enabled", "budget_tokens": 2030 }
1656016581
temperature:
1656116582
type: number
1656216583
minimum: 0

0 commit comments

Comments
 (0)