[FEAT]: Enhanced OpenAI Compatible TTS Functionality #2879

lewismacnow · 2024-12-19T12:26:18Z

What would you like to see?

Currently, the TTS options are limited especially for OpenAI Compatible endpoints.

The model defaults to tts-1 and there is no option to change this. Additionally there is no response splitting, the result is an unnatural conversation flow.

This is compared to Open-WebUI where these options are available and the result when using the same endpoint offers a superior TTS experience. In fact the 'call' feature paired with matatonic/openedai-speech's repo and a GPU is a fantastic.

This feature suggestion proposes adding a method to control the model used for "openAiGeneric" TTS (perhaps still default to tts-1?)
And suggests replicating the 'splitting' feature offered by Open-WebUI.

lewismacnow added enhancement New feature or request feature request labels Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEAT]: Enhanced OpenAI Compatible TTS Functionality #2879

[FEAT]: Enhanced OpenAI Compatible TTS Functionality #2879

lewismacnow commented Dec 19, 2024 •

edited

Loading

[FEAT]: Enhanced OpenAI Compatible TTS Functionality #2879

[FEAT]: Enhanced OpenAI Compatible TTS Functionality #2879

Comments

lewismacnow commented Dec 19, 2024 • edited Loading

What would you like to see?

lewismacnow commented Dec 19, 2024 •

edited

Loading