We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
看到数据处理函数aishell.py中似乎并没有对一个长句子进行短句时间戳分段,而是从零开始到音频结束作为一整个时间戳 所以这个Belle-whisper-large-v2-zh模型的分句效果不太好是因为数据没进行分句的原因吗
The text was updated successfully, but these errors were encountered:
数据处理 默认训练数据均为短句(大多10秒内),所以没有做细致的切分。 识别过程中,分句主要依赖vad模块。 由于微调过程没有进行加时间戳的微调,可能会影响识别结果中时间戳的准确率。如果对时间戳准确率有要求,可以微调中加入时间戳微调。
Sorry, something went wrong.
您好,感谢您优秀的工作,想问下测试时如何得到每句话的具体时间戳呢?谢谢!
时间戳可以参考 faster whisper https://github.com/SYSTRAN/faster-whisper?tab=readme-ov-file#word-level-timestamps
No branches or pull requests
看到数据处理函数aishell.py中似乎并没有对一个长句子进行短句时间戳分段,而是从零开始到音频结束作为一整个时间戳
所以这个Belle-whisper-large-v2-zh模型的分句效果不太好是因为数据没进行分句的原因吗
The text was updated successfully, but these errors were encountered: