Replies: 1 comment 2 replies
-
Thanks for the comment. I thought the |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Thanks for a great review and clear code. I have a question: Shouldn't the training in BPE be applied at the word level? Your implementation can generate merged tokens that include multiple words.
Beta Was this translation helpful? Give feedback.
All reactions