feat(server): add OpenAI-compatible endpoint #421

teddybear082 · 2025-06-08T22:33:11Z

-Add openai-compatible v1/audio/speech endpoint to server.py

-Add lowvram command line argument to server.py (if running in cuda, switches model to cpu when idle)

-Allow openai server to use built in speaker voices, cloning wavs or directory with wavs as the voice

-Allow language_id as server.py command line argument in alternative to request parameter

-split long input text using built in segmenter (eg to get around hard input limits in xtts2)

-tested on windows only (I do not have mac or linux)

-tested with xtts2 but changes in theory should not impact other models

-Add openai-compatible v1/audio/speech endpoint to server.py -Add lowvram command line argument to server.py (if running in cuda, switches model to cpu when idle) -Allow openai server to use built in speaker voices, cloning wavs or directory with wavs as the voice -Allow language_id as server.py command line argument in alternative to request parameter -split long input text using built in segmenter

eginhard

Cool, thank you! Can you share a link to the relevant OpenAI API spec so I can check everything works as expected? Could you also add some information and examples for this to https://github.com/idiap/coqui-ai-TTS/blob/dev/docs/source/server.md?

TTS/server/server.py

teddybear082 · 2025-06-09T15:21:10Z

Cool, thank you! Can you share a link to the relevant OpenAI API spec so I can check everything works as expected? Could you also add some information and examples for this to https://github.com/idiap/coqui-ai-TTS/blob/dev/docs/source/server.md?

Yes here you go, I'm going to try to do extensive testing, though can confirm the code already works with a project that uses the openai TTS spec extensively (WingmanAI). I think the most testing is needed for models other than xtts2 as I have no idea how to use those other models coqui-tts supports.

https://platform.openai.com/docs/guides/text-to-speech

Also:

https://platform.openai.com/docs/api-reference/audio

-Eliminate low VRAM mode (results in approx. 1.5gb more VRAM use and creeps up over time versus lowVRAM mode variant) -eliminate models and voices endpoints since not a current part of OpenAI spec -eliminate split sentences code, relying on coqui tts' already utilized split_sentences functionality with api

Since not using lowvram mode, gc import now unnecessary

-add usage examples to server.md -fix bug with elif statement in last changes -default to using speaker_idx if any specified at server launch and no voice parameter is passed to openai server

teddybear082 · 2025-06-09T16:35:34Z

OK I think this is ready for your testing.

eginhard

Thank you again! I just fixed up a few small things and checked that it works as expected. Feel free to open any follow-up PRs/issues if you spot anything else!

teddybear082 · 2025-06-11T23:45:23Z

Thank you for merging! I'll try to turn to that underlying synthesizer issue to make sure proper segmenter is always used per language in the coming days.

eginhard requested changes Jun 9, 2025

View reviewed changes

TTS/server/server.py Outdated Show resolved Hide resolved

TTS/server/server.py Outdated Show resolved Hide resolved

teddybear082 added 3 commits June 9, 2025 11:46

eliminate unnecessary gc import

4f339e4

Since not using lowvram mode, gc import now unnecessary

add usage examples for openai server, fix bug

b765dbb

-add usage examples to server.md -fix bug with elif statement in last changes -default to using speaker_idx if any specified at server launch and no voice parameter is passed to openai server

teddybear082 requested a review from eginhard June 9, 2025 22:20

fixup

bb52dfe

eginhard approved these changes Jun 11, 2025

View reviewed changes

eginhard changed the title ~~add openai-compatible endpoint, lowvram to server.py~~ feat(server): add OpenAI-compatible endpoint Jun 11, 2025

eginhard merged commit 327b23c into idiap:dev Jun 11, 2025
29 checks passed

teddybear082 deleted the add_openai_server_endpoint branch June 12, 2025 01:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(server): add OpenAI-compatible endpoint #421

feat(server): add OpenAI-compatible endpoint #421

Uh oh!

teddybear082 commented Jun 8, 2025

Uh oh!

eginhard left a comment

Uh oh!

Uh oh!

Uh oh!

teddybear082 commented Jun 9, 2025 •

edited

Loading

Uh oh!

teddybear082 commented Jun 9, 2025

Uh oh!

eginhard left a comment

Uh oh!

Uh oh!

teddybear082 commented Jun 11, 2025

Uh oh!

Uh oh!

feat(server): add OpenAI-compatible endpoint #421

feat(server): add OpenAI-compatible endpoint #421

Uh oh!

Conversation

teddybear082 commented Jun 8, 2025

Uh oh!

eginhard left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

teddybear082 commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

teddybear082 commented Jun 9, 2025

Uh oh!

eginhard left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

teddybear082 commented Jun 11, 2025

Uh oh!

Uh oh!

teddybear082 commented Jun 9, 2025 •

edited

Loading