Skip to content

[FEATURE] Support Vertex AI for generation and embedding #691

@badmonster0

Description

@badmonster0

We already have Google Gemini integration, which is based on Google AI Studio API.

Google also have Vertex API (part of Google Cloud), which is recommended for production usage based on here:

Google AI Studio

Experiment, prototype, and deploy. Google AI Studio is the fast path for developers, students, and researchers who want to try Gemini models and get started building with the Gemini Developer API.

Vertex AI

Build AI agents and integrate generative AI into your applications, Google Cloud offers Vertex AI, a single, fully-managed, unified development platform for using Gemini models and other third party models at scale.

Vertex AI APIs:

We can use the LlmApiType.VERTEX_AI enum for Vertex AI API (versus LlmApiType.GEMINI for AI studio API. This follows similar names as LiteLLM, which uses vertex_ai/ vs gemini/).

Code locations:


❤️ Contributors, please refer to 📙Contributing Guide.
Unless the PR can be sent immediately (e.g. just a few lines of code), we recommend you to leave a comment on the issue like I'm working on it or Can I work on this issue? to avoid duplicating work. Our Discord server is always open and friendly.

Metadata

Metadata

Assignees

Labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions