forked from rhdunn/tts-dev-studio
-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Milestone
Description
This is automatically adding words from a source text (e.g. a Project GutenbergeText) that are not in the dictionary. It includes automatically generating the pronunciations from a source (e.g. a Text-to-Speech language), and reviewing those pronunciations.
This covers the following command-line script using the cainteoir-engine dictionary command:
cat FILENAME | sed -e 's/-/ /g' | dictionary -d DICTIONARY -np
NOTE: The sed -e 's/-/ /g' part ignores hyphenated words. This should be optional, and possibly be more sophisticated.