feat: use openrouter's builtin metric #8314

happyZYM · 2025-07-20T04:18:21Z

What this PR does

Before this PR: 需要手动配置模型价格，并使用本地估计的token数来计量花费

After this PR: 对于Openrouter Provider，直接使用内置的统计功能（https://openrouter.ai/docs/use-cases/usage-accounting）

Checklist

This checklist is not enforcing, but it's a reminder of items that could be relevant to every PR.
Approvers are expected to review this list.

PR: The PR description is expressive enough and will help future contributors
Code: Write code that humans can understand and Keep it simple
Refactor: You have left the code cleaner than you found it (Boy Scout Rule)
Upgrade: Impact of this change on upgrade flows was considered and addressed if required
Documentation: A user-guide update was considered and is present (link) or not required. You want a user-guide update if it's a user facing feature.

vaayne · 2025-07-20T05:02:29Z

https://openrouter.ai/docs/api-reference/get-a-generation
要获取 OpenRouter 准确的 token 使用信息，需要调用这个接口。Chat API 返回的 token 使用量是通过 gpt-4 tiktoken估算得出，不是精确数据。

happyZYM · 2025-07-20T07:48:18Z

https://openrouter.ai/docs/api-reference/get-a-generation 要获取 OpenRouter 准确的 token 使用信息，需要调用这个接口。Chat API 返回的 token 使用量是通过 gpt-4 tiktoken估算得出，不是精确数据。

文档 https://openrouter.ai/docs/api-reference/overview#querying-cost-and-stats 说：

The token counts that are returned in the completions API response are not counted via the model’s native tokenizer. Instead it uses a normalized, model-agnostic count (accomplished via the GPT4o tokenizer). This is because some providers do not reliably return native token counts. This behavior is becoming more rare, however, and we may add native token counts to the response object in the future.

而文档 https://openrouter.ai/docs/use-cases/usage-accounting 说：

When enabled, the API will return detailed usage information including:

    Prompt and completion token counts using the model’s native tokenizer
    Cost in credits
    Reasoning token counts (if applicable)
    Cached token counts (if available)

This information is included in the last SSE message for streaming responses, or in the complete response for non-streaming requests.

从我目前的实际测试来看（测试了gemini、claude、gpt、qwen等主流模型），确实已经换成了第二个文档里所说的那样，chat completion里面开metric，返回的是真实token count。

feat: use openrouter's builtin metric

e2ad74e

DeJeune approved these changes Jul 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: use openrouter's builtin metric #8314

feat: use openrouter's builtin metric #8314

happyZYM commented Jul 20, 2025

Uh oh!

vaayne commented Jul 20, 2025

Uh oh!

happyZYM commented Jul 20, 2025

Uh oh!

Uh oh!

feat: use openrouter's builtin metric #8314

Are you sure you want to change the base?

feat: use openrouter's builtin metric #8314

Conversation

happyZYM commented Jul 20, 2025

What this PR does

Checklist

Uh oh!

vaayne commented Jul 20, 2025

Uh oh!

happyZYM commented Jul 20, 2025

Uh oh!

Uh oh!