Skip to content
This repository was archived by the owner on Nov 1, 2024. It is now read-only.
This repository was archived by the owner on Nov 1, 2024. It is now read-only.

QA about continue training on checkpoint #757

Open
@robinzixuan

Description

@robinzixuan

❓ Questions and Help

Before asking:

  1. search the issues.
  2. search the docs.

What is your question?

  1. Is it possible to help the checkpoints of OPT-1.3b model around 10K - 20K training step?
  2. By the way, if we want continue training on those checkpoint/ official checkpoint, is it possible to get the all training dataset meta used in OPT models?

Code

What have you tried?

What's your environment?

  • metaseq Version (e.g., 1.0 or master):
  • PyTorch Version (e.g., 1.0)
  • OS (e.g., Linux):
  • How you installed metaseq (pip, source):
  • Build command you used (if compiling from source):
  • Python version:
  • CUDA/cuDNN version:
  • GPU models and configuration:
  • Any other relevant information:

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions