[`sentence-transformers`] Add sentencepiece dependency for running models with slow tokenizers only #432

tomaarsen · 2024-06-25T09:41:56Z

Hello!

Pull Request overview

Add sentencepiece dependency for running models with slow tokenizers only

Details

See e.g. https://huggingface.co/rokn/slovlo-v1:

sentencepiece was removed as a required dependency for sentence-transformers, because most models don't require it anymore and it doesn't work well with Python 3.12 on Windows 11. Nowadays, we simply throw an error if it's required but not installed. However, the API Inference should have it installed to ensure that we can also run these models with slow tokenizers. I've verified locally that sentencepiece==0.2.0 works for e.g. that slovlo-v1 model.

Tom Aarsen

…only

tomaarsen · 2024-07-06T09:24:24Z

The same issue exists for some Camembert models, e.g.: https://huggingface.co/Photon-BR/sentence-camembert-large

Tom Aarsen

osanseviero

Makes sense!

Add sentencepiece dependency for running models with slow tokenizers …

272c792

…only

tomaarsen requested a review from Narsil June 25, 2024 09:41

tomaarsen requested a review from osanseviero July 6, 2024 09:22

osanseviero approved these changes Jul 8, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`sentence-transformers`] Add sentencepiece dependency for running models with slow tokenizers only #432

[`sentence-transformers`] Add sentencepiece dependency for running models with slow tokenizers only #432

tomaarsen commented Jun 25, 2024 •

edited

Loading

tomaarsen commented Jul 6, 2024

osanseviero left a comment

[sentence-transformers] Add sentencepiece dependency for running models with slow tokenizers only #432

Are you sure you want to change the base?

[sentence-transformers] Add sentencepiece dependency for running models with slow tokenizers only #432

Conversation

tomaarsen commented Jun 25, 2024 • edited Loading

Pull Request overview

Details

tomaarsen commented Jul 6, 2024

osanseviero left a comment

Choose a reason for hiding this comment

[`sentence-transformers`] Add sentencepiece dependency for running models with slow tokenizers only #432

[`sentence-transformers`] Add sentencepiece dependency for running models with slow tokenizers only #432

tomaarsen commented Jun 25, 2024 •

edited

Loading