This repository was archived by the owner on Nov 1, 2024. It is now read-only.
This repository was archived by the owner on Nov 1, 2024. It is now read-only.
QA about continue training on checkpoint #757
Open
Description
❓ Questions and Help
Before asking:
- search the issues.
- search the docs.
What is your question?
- Is it possible to help the checkpoints of OPT-1.3b model around 10K - 20K training step?
- By the way, if we want continue training on those checkpoint/ official checkpoint, is it possible to get the all training dataset meta used in OPT models?
Code
What have you tried?
What's your environment?
- metaseq Version (e.g., 1.0 or master):
- PyTorch Version (e.g., 1.0)
- OS (e.g., Linux):
- How you installed metaseq (
pip
, source): - Build command you used (if compiling from source):
- Python version:
- CUDA/cuDNN version:
- GPU models and configuration:
- Any other relevant information: