13.3 Hours - Chinese Mandarin Synthesis Corpus-Female, Emotional. It is recorded by Chinese native speaker,emotional text, and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
For more details, please refer to the link:https://www.nexdata.ai/datasets/tts/1141?source=Github
48,000Hz, 16bit, uncompressed wav, mono channel;
professional recording studio;
six emotions (happiness, anger, sadness, surprise, fear, disgust);
female, 20-30 years old, soft and friendly voice;
microphone;
Mandarin;
word and pinyin transcription, prosodic boundary annotation;
speech synthesis.
Commercial License