Skip to content
This repository was archived by the owner on Nov 1, 2024. It is now read-only.
This repository was archived by the owner on Nov 1, 2024. It is now read-only.

Possible feature and bugfix contributions from Microsoft research team's fork of Metaseq #726

Open
@mattmazzola

Description

@mattmazzola

We are a team at @microsoft Research that has a fork Metaseq repo with these additional features:

  1. New pipeline task to perform Knowledge Distillation via Log Probabilities using a modified Cross Entropy implementation.
  2. Improved inference script with added functionality such as ability to output logprobs/logits.
  3. Improvements to Training Stop Conditions
  4. Scripts to support Teacher data generation using Open AI Service
  5. Documentation system using Sphinx
    1. Documentation of Co-Teaching training process (https://arxiv.org/pdf/2305.02031.pdf)
  6. Improved evaluation configuration to evaluate with different metrics depending on dataset
  7. Miscellaneous Bug Fixes
    1. jsonl_dataset.py#_build_index properly accounts for multi-byte characters.

Questions

  • Which of the features above would you be interested in us contributing back to Metaseq?
  • Would you be able to offer assistance with the merge process?
    • For example, testing and verification of functionality for a feature PR.

We would be happy to answer any questions you have about the above components.

@tupini07

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions