Skip to content

Update default voices for Gemini and Geminimulti TTS configurations #272

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

gianpaj
Copy link

@gianpaj gianpaj commented Jun 26, 2025

@aidivas
Copy link

aidivas commented Jul 5, 2025

Hey tested this and it failed for geminimulit and gemini as tts - using a simple shortform podast command both in CLI and API

What commands are you using to create the podcast
Do you have multi-speaker voice allowed already?

Tested these config changes and using geminimulti as the tts on the CLI

{
"detail": "Failed to generate audio: 403 Multi-speaker voices are only available to allowlisted projects."
}

Testing gemini with the new 2.5 preview TTS in conversation_config.yaml and 2.5 pro in config.yaml

ERROR:podcastfy:Error generating podcast: Failed to generate audio: 400 Either input.text or input.ssml is longer than the limit of 5000 bytes. This limit is different from quotas. To fix, reduce the byte length of the characters in this request, or consider using the Long Audio API: https://cloud.google.com/text-to-speech/docs/create-audio-text-long-audio-synthesis.⁠

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants