Skip to content

Issues: mozilla/translations

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Linters needs to ignore node_modules bug Something is broken or not correct inference
#932 opened Nov 15, 2024 by gregtatum
Experiment with distillation data inference experiment A training experiment with hypothesis and results
#931 opened Nov 15, 2024 by gregtatum
Use PyMarian for COMET evaluations cost & perf Speeding up and lowering cost for the pipeline
#929 opened Nov 13, 2024 by marco-c
Single-side deduplication quality Improving robustness and translation quality
#928 opened Nov 13, 2024 by ZJaume
Create an analyze-datasets step in the pipeline quality Improving robustness and translation quality
#924 opened Nov 6, 2024 by gregtatum
Investigate merging document sentences in HPLT quality Improving robustness and translation quality
#923 opened Nov 6, 2024 by eu9ene
Reduce monolingual data for en-lt to investigate distillation performance experiment A training experiment with hypothesis and results
#915 opened Oct 31, 2024 by gregtatum
Allow for split vocabs language-coverage Issues related to covering specific languages quality Improving robustness and translation quality
#913 opened Oct 30, 2024 by gregtatum
[meta] Kick off a 2024-H2 training run meta A collection of sub-issues that uses a tasklist
#912 opened Oct 30, 2024 by gregtatum
More corpora specific fixes quality Improving robustness and translation quality
#910 opened Oct 30, 2024 by ZJaume
Limit the amount of data used for distillation cost & perf Speeding up and lowering cost for the pipeline
#905 opened Oct 29, 2024 by gregtatum
Check if issues with short sentences were caused by bicleaner hard rules quality Improving robustness and translation quality
#903 opened Oct 24, 2024 by eu9ene
Investigate word-based filtering for CJK language-coverage Issues related to covering specific languages
#899 opened Oct 23, 2024 by eu9ene
Add support for Chinese Traditional language-coverage Issues related to covering specific languages
#896 opened Oct 22, 2024 by eu9ene
Experiment with student model parameters experiment A training experiment with hypothesis and results quality Improving robustness and translation quality
#894 opened Oct 22, 2024 by gregtatum
[meta] Retrain older models meta A collection of sub-issues that uses a tasklist quality Improving robustness and translation quality
#891 opened Oct 21, 2024 by eu9ene
Consider adding NTREX-128 for evaluation data sources Data importer support evals Issues related to model evaluations
#889 opened Oct 21, 2024 by ZJaume
Vocabulary construction quality Improving robustness and translation quality
#887 opened Oct 21, 2024 by ZJaume
Use HPLT 2.0 data sources Data importer support
#884 opened Oct 17, 2024 by eu9ene
Use our localization data for training data sources Data importer support
#882 opened Oct 16, 2024 by marco-c
Consider statistically translating short sentences from monolingual datasets. data sources Data importer support quality Improving robustness and translation quality
#880 opened Oct 15, 2024 by gregtatum
Consider harvesting short sentences from parallel data data sources Data importer support quality Improving robustness and translation quality
#879 opened Oct 15, 2024 by gregtatum
Consider using data augmentation to synthesize one word translations data sources Data importer support quality Improving robustness and translation quality
#878 opened Oct 15, 2024 by gregtatum
ProTip! What’s not been updated in a month: updated:<2024-10-15.