v1.67.1-nightly
·
4 commits
to 44264ab6d696782f7d372a90c638c07f64cd5e19
since this release
What's Changed
- [UI] Bug Fix, team model selector by @ishaan-jaff in #10171
- [Bug Fix] Auth Check, Fix typing to ensure case where model is None is handled by @ishaan-jaff in #10170
- [Docs] Responses API by @ishaan-jaff in #10172
- Litellm release notes 04 19 2025 by @krrishdholakia in #10169
- fix(transformation.py): pass back in gemini thinking content to api by @krrishdholakia in #10173
- Litellm docs SCIM by @ishaan-jaff in #10174
- fix(common_daily_activity.py): support empty entity id field by @krrishdholakia in #10175
- fix(proxy_server.py): pass llm router to get complete model list by @krrishdholakia in #10176
- Model pricing updates for Azure & VertexAI by @marty-sullivan in #10178
- fix(bedrock): wrong system prompt transformation by @hewliyang in #10120
- Fix: Potential SQLi in spend_management_endpoints.py by @n1lanjan in #9878
- Handle edge case where user sets model_group inside model_info + Return hashed_token in
token
field on/key/generate
by @krrishdholakia in #10191 - Remove user_id from url by @krrishdholakia in #10192
- [Feat] Pass through endpoints - ensure
PassthroughStandardLoggingPayload
is logged and contains method, url, request/response body by @ishaan-jaff in #10194 - [Feat] Add Responses API - Routing Affinity logic for sessions by @ishaan-jaff in #10193
- [Feat] Add infinity embedding support (contributor pr) by @ishaan-jaff in #10196
- [Bug Fix] caching does not account for thinking or reasoning_effort config by @ishaan-jaff in #10140
- Gemini-2.5-flash improvements by @krrishdholakia in #10198
Full Changelog: v1.67.0-nightly...v1.67.1-nightly
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.67.1-nightly
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 220.0 | 263.64700999835935 | 6.1132795166960605 | 0.0 | 1829 | 0 | 199.11094299999377 | 4358.182531000011 |
Aggregated | Passed ✅ | 220.0 | 263.64700999835935 | 6.1132795166960605 | 0.0 | 1829 | 0 | 199.11094299999377 | 4358.182531000011 |