Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: MaskGCT long-form audio #290

Open
fakerybakery opened this issue Oct 23, 2024 · 2 comments
Open

[Feature]: MaskGCT long-form audio #290

fakerybakery opened this issue Oct 23, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@fakerybakery
Copy link

Hi,
Thanks for releasing MaskGCT! Are there any plans to support long-form speech synthesis besides using chunking?
Thanks!

@fakerybakery fakerybakery added the enhancement New feature or request label Oct 23, 2024
@HeCheng0625
Copy link
Collaborator

Hi, thank you for your attention. In the future, we will expand the training data to the minute level and use a codec with a higher compression rate to generate longer audio.

@JonathanFly
Copy link

Hi, thank you for your attention. In the future, we will expand the training data to the minute level and use a codec with a higher compression rate to generate longer audio.

In the current version, what was the duration of the training data "chunk"?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants