Name		Name	Last commit message	Last commit date
parent directory ..
conformer_ctc		conformer_ctc
local		local
pruned_transducer_stateless2		pruned_transducer_stateless2
.gitignore		.gitignore
README.md		README.md
RESULTS.md		RESULTS.md
prepare.sh		prepare.sh
shared		shared

README.md

GigaSpeech

GigaSpeech, an evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio, collected from audiobooks, podcasts and YouTube, covering both read and spontaneous speaking styles, and a variety of topics, such as arts, science, sports, etc. More details can be found: https://github.com/SpeechColab/GigaSpeech

Download

Apply for the download credentials and download the dataset by following https://github.com/SpeechColab/GigaSpeech#download. Then create a symlink

ln -sfv /path/to/GigaSpeech download/GigaSpeech

Performance Record

	Dev	Test
`conformer_ctc`	10.47	10.58
`pruned_transducer_stateless2`	10.40	10.51

See RESULTS for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ASR

ASR

README.md

GigaSpeech

Download

Performance Record

Files

ASR

Directory actions

More options

Directory actions

More options

Latest commit

History

ASR

Folders and files

parent directory

README.md

GigaSpeech

Download

Performance Record