Skip to content

New Words From Text Files #2

@rhdunn

Description

@rhdunn

This is automatically adding words from a source text (e.g. a Project GutenbergeText) that are not in the dictionary. It includes automatically generating the pronunciations from a source (e.g. a Text-to-Speech language), and reviewing those pronunciations.

This covers the following command-line script using the cainteoir-engine dictionary command:

cat FILENAME | sed -e 's/-/ /g' | dictionary -d DICTIONARY -np

NOTE: The sed -e 's/-/ /g' part ignores hyphenated words. This should be optional, and possibly be more sophisticated.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions