You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
LiteLLM supports embedding models via HTTP (OpenAI-compatible APIs), including Mistralai embeddings.
However, in some production setups—particularly with the latest Mistralai-embed model (mistral-embed-2312)—embeddings are served over non-HTTP backends (e.g., NATS) with binary serialization (pickle) and pre-tokenized inputs. LiteLLM currently assumes JSON over HTTP, which makes integration difficult without a separate adapter.
Proposal
Add a pluggable transport + serialization interface for embeddings, allowing:
Custom backends (NATS, gRPC, RPC, etc.)
Binary payloads and pre-tokenized inputs
Reuse of LiteLLM’s model abstraction, routing, and observability
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
LiteLLM supports embedding models via HTTP (OpenAI-compatible APIs), including Mistralai embeddings.
However, in some production setups—particularly with the latest Mistralai-embed model (mistral-embed-2312)—embeddings are served over non-HTTP backends (e.g., NATS) with binary serialization (pickle) and pre-tokenized inputs. LiteLLM currently assumes JSON over HTTP, which makes integration difficult without a separate adapter.
Proposal
Add a pluggable transport + serialization interface for embeddings, allowing:
Custom backends (NATS, gRPC, RPC, etc.)
Binary payloads and pre-tokenized inputs
Reuse of LiteLLM’s model abstraction, routing, and observability
Example config:
This would simplify integration for high-performance internal pipelines using mistral-embed-2312.
Willingness to Contribute
Happy to help with design discussion.
Beta Was this translation helpful? Give feedback.
All reactions