Support prompting with media files #201

longseespace · 2024-08-09T03:49:49Z

Description of the feature request:

The Gemini API supports uploading media files separately from the prompt input, allowing your media to be reused across multiple requests and multiple prompts.

https://ai.google.dev/gemini-api/docs/prompting_with_media?lang=python
https://ai.google.dev/api/files

What problem are you trying to solve with this feature?

Add the ability to prompt a document from a client

Any other information you'd like to share?

No response

andrewheard · 2024-08-09T22:24:52Z

Hi @longseespace, it's possible to use media files that have already been uploaded with the server-side SDKs (Python, Go, Node.js) or REST APIs using fileData in the Swift SDK, e.g.:

let content = try await model.generateContent(
  ModelContent.Part.fileData(
    mimetype: "image/jpeg",
    uri: "https://generativelanguage.googleapis.com/v1beta/files/some-hash"
  ),
  "What is in this image?"
)

Unfortunately, based on our current engineering plan and product backlog, there is no plan to support uploading files using the Swift SDK in the near term. As a potential alternative, the similar product Vertex AI for Firebase SDK supports media uploaded with the Cloud Storage for Firebase SDK. This guide shows how to use the two SDKs together: https://firebase.google.com/docs/vertex-ai/solutions/cloud-storage

andrewheard added status:wontfix This will not be worked on type:feature request New feature/request/enhancement labels Aug 9, 2024

andrewheard mentioned this issue Aug 21, 2024

Prompts with video failing with error 500 #203

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support prompting with media files #201

Support prompting with media files #201

longseespace commented Aug 9, 2024

andrewheard commented Aug 9, 2024

Support prompting with media files #201

Support prompting with media files #201

Comments

longseespace commented Aug 9, 2024

Description of the feature request:

What problem are you trying to solve with this feature?

Any other information you'd like to share?

andrewheard commented Aug 9, 2024