GitHub - hoang1007/lightning-accelerate: Easy to train and use Pytorch models with 🤗's Accelerate and Lightning's style⚡️

Lightning Accelerate aim to provide a simple and easy-to-use framework for training deep learning model on GPU, TPU, etc... with 🤗 Huggingface's Accelerate and⚡️Pytorch Lightning's style.

Installation

To install Lightning Accelerate, run this command:

pip install git+https://github.com/hoang1007/lightning-accelerate.git

Features

Support training with multiple GPUs, TPUs, etc...
Support finetuning models efficiently with LoRA
Support several optimization techniques such as mixed precision, DeepSpeed, bitandbytes, etc...
Support tracking experiment with Wandb and Tensorboard

Usage

Training

To train a model, you need to define a TrainingModule and a DataModule. Here is an simple example of training a digit classifier on MNIST dataset:

# -------------------
# Step 1: Define a TrainingModule.
# This module contains the model, training and evaluation logics to easy training with `Trainer` later.
# -------------------
class MnistTrainingModule(TrainingModule):
    def __init__(self):
        super().__init__()
        self.model = nn.Sequential(nn.Flatten(), nn.Linear(28 * 28, 10))

    def training_step(self, batch, batch_idx: int, optimizer_idx: int):
        x, y = batch
        logits = self.model(x)
        loss = nn.functional.cross_entropy(logits, y)
        return loss

    def get_optim_params(self):
        return self.model.parameters()

# -------------------
# Step 2: Define a DataModule. This module contains the data preparation logics such as downloading data, preprocessing, etc... and then is used to feed to the `TrainingModule` for training and evaluation.
# -------------------
class MnistDataModule(DataModule):
    def prepare_data(self):
        # Place downloading data to avoid downloading data in every process.
        train_data = MNIST("root", train=True, download=True)
        val_data = MNIST("root", train=False, download=True)

    def get_training_dataset(self) -> Dataset:
        return MNIST(
            "root",
            train=True,
            transform=transforms.Compose([
                transforms.RandomAffine(15), transforms.ToTensor()
            ]),
        )

    def get_validation_dataset(self) -> Dataset:
        return MNIST("root", train=False, transform=transforms.ToTensor())

# -------------------
# Step 3: Configure parameters with `TrainingArguments` and start training!
# -------------------
args = TrainingArguments("mnist", train_batch_size=32, num_epochs=10)
training_module = MnistTrainingModule()
data_module = MnistDataModule()

Trainer(
    training_module=training_module,
    training_args=args,
    data_module=data_module,
).fit()

Training with advanced features

You can accelerate the training process with several techniques such as mixed precision, DeepSpeed, etc... which are supported by Accelerate. For details, please refer to Accelerate's documentation. For example, to train your models on multiple GPUs, you can run

accelerate launch --multi_gpu my_script.py

Evaluation

To evaluate the pretrained model, you can use Trainer.evaluate method:

args = TrainingArguments(
    "mnist",
    eval_batch_size=32,
    # Set `resume_from_checkpoint` to the path of the checkpoint you want to evaluate or set to `latest` to evaluate the latest checkpoint.
    resume_from_checkpoint='latest'
)
training_module = MnistTrainingModule()
data_module = MnistDataModule()

# Trainer will automatically load the checkpoint and evaluate the model.
Trainer(
    training_module=training_module,
    training_args=args,
    data_module=data_module,
).evaluate()

I build the framework on top of Huggingface's Accelerate with minimum requirements while maintaining the code style as similar as possible to Pytorch Lightning 😊.

Contributing

I am an inexperienced developer, so I am very happy to receive your contributions to improve the code quality and features of the framework. Please feel free to open an issue or pull request to contribute to the project 🥰.

Acknowledgement

Special thanks to Huggingface's Accelerate and Pytorch Lightning for providing the great frameworks for training deep learning models.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github		.github
examples/classification		examples/classification
requirements		requirements
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Table of Contents

Installation

Features

Usage

Training

Training with advanced features

Evaluation

Contributing

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

hoang1007/lightning-accelerate

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Installation

Features

Usage

Training

Training with advanced features

Evaluation

Contributing

Acknowledgement

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages