Skip to content

SentencePiece model training #442

Answered by adarob
versae asked this question in Q&A
Apr 21, 2022 · 2 comments · 2 replies
Discussion options

You must be logged in to vote

These question really apply to the original T5 models (https://arxiv.org/abs/1910.10683), as you are not required to use any specific vocabulary with T5X.

  1. <s> was not used in the original T5 paper.
  2. Neither. See https://blog.floydhub.com/tokenization-nlp/#sentencepiece for details.
  3. Yes.
  4. This is recommended for efficiency reasons.

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
1 reply
@versae
Comment options

Answer selected by versae
Comment options

You must be logged in to vote
1 reply
@versae
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants