You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
vocabulary.txt contains the words in the dataset in order to use them for a language model(n-grams for example). the file is created in the utils.py. but if you have enough data the neural network also can learn the language models end-to-end
What does
vocabulary.txt
contain? Do they contain encoded words? Do they contain encoded characters? How was that generated?The text was updated successfully, but these errors were encountered: