Issues applying fine-tuning to existing public model #2121
-
(Question orignally asked on gitter, putting it here together with answer for the sake of future documentation) I'm trying to fine-tune an existing STT model to my own voice for better recognition but I'm struggling to "import" it. The model I'm trying to fine tune is As described in the docs, to fine-tune an existing model one simply has to python -m coqui_stt_training.train --auto_input_dataset /mnt/mydata/data.csv --checkpoint_dir /mnt/mydata/German\ STT\ v0.9.0\ \(Aashish\ Agarwal\)/ So running this inside the Docker container for training I get
where
are the lines of interest. So as far as I understand, it fails to detect the existing model, as it was exported and as such no longer contains any checkpoint information.
vs starting from scratch and running for 1-2 epochs
So with all that said, how would I go about fine-tuning the TL;DR: How do I fine-tune an exported model that contains no checkpoint information? Coqui STT fails to detect it. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
(Answer from You can't. For the Coqui English models we release the checkpoints in the STT release page on GitHub, and many model creators who contributed their models to the Model Zoo also have checkpoints available somewhere, but unfortunately we don't host the checkpoints or link to them consistently yet. In this case the best bet is to reach out to Aashish (the author of the model described above). I think the German STT model might have its own repo as well: https://github.com/AASHISHAG/deepspeech-german Looks like there's a link to the checkpoint on GDrive there. |
Beta Was this translation helpful? Give feedback.
(Answer from
reuben
on gitter, slightly modified by me)You can't. For the Coqui English models we release the checkpoints in the STT release page on GitHub, and many model creators who contributed their models to the Model Zoo also have checkpoints available somewhere, but unfortunately we don't host the checkpoints or link to them consistently yet.
In this case the best bet is to reach out to Aashish (the author of the model described above). I think the German STT model might have its own repo as well: https://github.com/AASHISHAG/deepspeech-german
Looks like there's a link to the checkpoint on GDrive there.