Skip to content
This repository has been archived by the owner on Mar 19, 2024. It is now read-only.

Improved LangID model? #1345

Open
loretoparisi opened this issue Aug 18, 2023 · 0 comments
Open

Improved LangID model? #1345

loretoparisi opened this issue Aug 18, 2023 · 0 comments

Comments

@loretoparisi
Copy link

In the "not too much old" 2020 post related to M2M-100 MMT called "The first AI model that translates 100 languages without relying on English data" it has been allegedly reported that

As part of this effort, we created a new LASER 2.0 and improved fastText language identification, which improves the quality of mining and includes open sourced training and evaluation scripts

While LASER 2.0 (93 languages) and even LASER 3.0 have been released, which includes a new Encoder supporting over 200 languages, I'm not aware of the release of a newer version of the 176 languages FastText LangID model here.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant