Release v0.2.17 · meta-llama/llama-stack

Highlights

feat(tests): introduce inference record/replay to increase test reliability by @ashwinb in #2941
fix(library_client): improve initialization error handling and prevent AttributeError by @mattf in #2944
fix: use OLLAMA_URL to activate Ollama provider in starter by @ashwinb in #2963
feat(UI): adding MVP playground UI by @franciscojavierarceo in #2828
Standardization of errors (@nathan-weinberg)
feat: Enable DPO training with HuggingFace inline provider by @Nehanth in #2825
chore: rename templates to distributions by @ashwinb in #3035

What's Changed

docs: update using llama stack as library docs by @Bobbins228 in #2931
feat: add base64 encoded PDF support for OpenAI Chat Completions by @nathan-weinberg in #2881
fix: error on failed job, do not wait for timeout by @mattf in #2945
fix: Update SFTConfig parameter to fix CI and Post Training Workflow by @Nehanth in #2948
feat(tests): introduce inference record/replay to increase test reliability by @ashwinb in #2941
chore: Delete coverage-badge by @franciscojavierarceo in #2950
fix: use same image_name logic for build & run config by @mattf in #2949
feat(test): record agents, safety and vector_io integration tests by @ashwinb in #2952
chore(ci): disable post training tests by @ashwinb in #2953
feat(tests): record responses for evals and telemetry tests by @ashwinb in #2954
feat(ci): use replay mode, setup ollama if specific label exists on PR by @ashwinb in #2955
feat(ci): keep only one re-recording job because independent recordings will conflict by @ashwinb in #2956
fix(library_client): improve initialization error handling and prevent AttributeError by @mattf in #2944
fix: use OLLAMA_URL to activate Ollama provider in starter by @ashwinb in #2963
chore: fix k8s config by @ehhuang in #2959
docs: part 1 - fix warnings in documentation generation by @kelbrown20 in #2861
fix(ci): slightly update workflow trigger by @ashwinb in #2966
chore: standardize model not found error by @nathan-weinberg in #2964
fix: fix No module named 'ollama' in test_inference_recordings.py by @r3v5 in #2967
chore: standardize dataset not found error by @nathan-weinberg in #2962
chore: standardize vector store not found error by @nathan-weinberg in #2968
docs: Add detailed docstrings to API models and update OpenAPI spec by @saiprashanths in #2889
fix: remove redundant code from unregister_dataset by @nathan-weinberg in #2971
fix(ci): more correct concurrency key for workflows by @ashwinb in #2973
feat(UI): adding MVP playground UI by @franciscojavierarceo in #2828
chore(rename): move llama_stack.distribution to llama_stack.core by @ashwinb in #2975
feat: Enable DPO training with HuggingFace inline provider by @Nehanth in #2825
fix: post_training ci by @cdoern in #2984
fix: remove unused DPO parameters from schema and tests by @Nehanth in #2988
docs: Reformat external provider documentation by @kelbrown20 in #2982
chore: Remove coverage badge from README.md by @ChristianZaccaria in #2976
fix: remove redundant code from unregister_vector_db by @nathan-weinberg in #2983
fix: Post Training Model change in Tests in order to make it less intensive by @Nehanth in #2991
fix: integration tests not triggering on PR open by @cdoern in #2985
feat(ci): add support for running vision inference tests by @ashwinb in #2972
feat: Add clear error message when API key is missing by @mattf in #2992
fix(ci): simplify integration tests replay mode by @ashwinb in #2997
feat(ci): only run on 3.12, run on both 3.12 and 3.13 nightly by @ashwinb in #3000
chore: Enabling Integration tests for Weaviate by @franciscojavierarceo in #2882
feat(ci): introduce workflow for re-recording inference outputs by @ashwinb in #3002
test: use llama stack build when starting server by @ehhuang in #2999
feat: Add openAI compatible APIs to Qdrant by @varshaprasad96 in #2465
chore: Update README for supported DBs by @franciscojavierarceo in #3005
fix: sambanova inference provider by @mattf in #2996
fix(recording): endpoint resolution by @ehhuang in #3013
chore: create integration-tests script by @ehhuang in #3016
refactor: remove Conda support from Llama Stack by @r3v5 in #2969
chore(python-deps): bump huggingface-hub from 0.34.2 to 0.34.3 by @dependabot[bot] in #3028
chore: CI action names by @ehhuang in #3014
refactor: Remove double filtering based on score threshold by @varshaprasad96 in #3019
test: Implement vector store search test by @varshaprasad96 in #3001
feat: switch to async completion in LiteLLM OpenAI mixin by @eranco74 in #3029
chore(ci): remove reportlab dep by @mattf in #3033
chore(python-deps): bump weaviate-client from 4.16.4 to 4.16.5 by @dependabot[bot] in #3026
chore(python-deps): bump opentelemetry-exporter-otlp-proto-http from 1.35.0 to 1.36.0 by @dependabot[bot] in #3027
chore: rename templates to distributions by @ashwinb in #3035
chore(python-deps): bump openai from 1.97.1 to 1.98.0 by @dependabot[bot] in #3025
chore: standardize tool group not found error by @nathan-weinberg in #2986
chore: standardize session not found error by @nathan-weinberg in #3031
chore: Fixing Markdown renderer by @franciscojavierarceo in #3038
chore: fix: integration tests failures marked as successful by @ehhuang in #3039

New Contributors

@saiprashanths made their first contribution in #2889

Full Changelog: v0.2.16...v0.2.17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.2.17

Highlights

What's Changed

New Contributors

Contributors

Uh oh!