[Feature]: MaskGCT long-form audio #290

fakerybakery · 2024-10-23T01:36:56Z

Hi,
Thanks for releasing MaskGCT! Are there any plans to support long-form speech synthesis besides using chunking?
Thanks!

HeCheng0625 · 2024-10-25T11:59:03Z

Hi, thank you for your attention. In the future, we will expand the training data to the minute level and use a codec with a higher compression rate to generate longer audio.

JonathanFly · 2024-11-01T03:04:40Z

Hi, thank you for your attention. In the future, we will expand the training data to the minute level and use a codec with a higher compression rate to generate longer audio.

In the current version, what was the duration of the training data "chunk"?

fakerybakery added the enhancement New feature or request label Oct 23, 2024

Tybost mentioned this issue Nov 2, 2024

[Help]: Gradio demo isn't working correctly on either Windows or Ubuntu. I'm experiencing the same issue on both operating systems. #327

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: MaskGCT long-form audio #290

[Feature]: MaskGCT long-form audio #290

fakerybakery commented Oct 23, 2024

HeCheng0625 commented Oct 25, 2024

JonathanFly commented Nov 1, 2024

[Feature]: MaskGCT long-form audio #290

[Feature]: MaskGCT long-form audio #290

Comments

fakerybakery commented Oct 23, 2024

HeCheng0625 commented Oct 25, 2024

JonathanFly commented Nov 1, 2024