Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEAT]: Enhanced OpenAI Compatible TTS Functionality #2879

Open
lewismacnow opened this issue Dec 19, 2024 · 0 comments
Open

[FEAT]: Enhanced OpenAI Compatible TTS Functionality #2879

lewismacnow opened this issue Dec 19, 2024 · 0 comments
Labels
enhancement New feature or request feature request

Comments

@lewismacnow
Copy link
Contributor

lewismacnow commented Dec 19, 2024

What would you like to see?

Currently, the TTS options are limited especially for OpenAI Compatible endpoints.

The model defaults to tts-1 and there is no option to change this. Additionally there is no response splitting, the result is an unnatural conversation flow.

This is compared to Open-WebUI where these options are available and the result when using the same endpoint offers a superior TTS experience. In fact the 'call' feature paired with matatonic/openedai-speech's repo and a GPU is a fantastic.

image

This feature suggestion proposes adding a method to control the model used for "openAiGeneric" TTS (perhaps still default to tts-1?)
And suggests replicating the 'splitting' feature offered by Open-WebUI.

@lewismacnow lewismacnow added enhancement New feature or request feature request labels Dec 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request feature request
Projects
None yet
Development

No branches or pull requests

1 participant