Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to convert and use huggingface checkpoint in litgpt ? #1850

Open
manshmaurya opened this issue Dec 3, 2024 · 1 comment
Open

How to convert and use huggingface checkpoint in litgpt ? #1850

manshmaurya opened this issue Dec 3, 2024 · 1 comment
Labels
question Further information is requested

Comments

@manshmaurya
Copy link

manshmaurya commented Dec 3, 2024

How to convert and use huggingface checkpoint in litgpt ?
For example, I want to use https://huggingface.co/LingoIITGN/ganga-1b checkpoint to fine tune for a specific task.

@manshmaurya manshmaurya added the question Further information is requested label Dec 3, 2024
@rasbt
Copy link
Contributor

rasbt commented Jan 7, 2025

Hi there,

if it is a new model not already supported by LitGPT, the general procedure is outlined here (it's a bit involved): https://github.com/Lightning-AI/litgpt/blob/main/tutorials/developer-docs/adding-models.md

However, if it is a model that has the same architecture as an existing model, the following works:

litgpt download NousResearch/Hermes-2-Pro-Mistral-7B \
 --model_name Mistral-7B-v0.1

This is just an example, but suppose the model is based on Llama 3.2, then the following would work:

litgpt download LingoIITGN/tanga-1b \
 --model_name Llama-3.2-1B

where the existing models can be listed via

litgpt download list

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants