This repository was archived by the owner on Nov 1, 2024. It is now read-only.
This repository was archived by the owner on Nov 1, 2024. It is now read-only.
Possible feature and bugfix contributions from Microsoft research team's fork of Metaseq #726
Open
Description
We are a team at @microsoft Research that has a fork Metaseq repo with these additional features:
- New pipeline task to perform Knowledge Distillation via Log Probabilities using a modified Cross Entropy implementation.
- Improved inference script with added functionality such as ability to output logprobs/logits.
- Improvements to Training Stop Conditions
- Scripts to support Teacher data generation using Open AI Service
- Documentation system using Sphinx
- Documentation of Co-Teaching training process (https://arxiv.org/pdf/2305.02031.pdf)
- Improved evaluation configuration to evaluate with different metrics depending on dataset
- Miscellaneous Bug Fixes
jsonl_dataset.py#_build_index
properly accounts for multi-byte characters.
Questions
- Which of the features above would you be interested in us contributing back to Metaseq?
- Would you be able to offer assistance with the merge process?
- For example, testing and verification of functionality for a feature PR.
We would be happy to answer any questions you have about the above components.