Mini_LLama

Mini_LLama is a lightweight implementation of the LLama (Large Language Model) architecture, optimized for efficient training and inference on limited hardware. This project is designed for research and experimentation in Natural Language Processing (NLP) and deep learning.

Features

Efficient transformer-based architecture
Customizable model size and training configurations
Support for fine-tuning on custom datasets
Lightweight inference for deployment on resource-constrained devices

Installation

# Clone the repository
git clone https://github.com/dzungnguyen21/Mini_LLama.git
cd Mini_LLama

# Create a virtual environment (optional but recommended)
python -m venv venv
source venv/bin/activate  # On Windows use: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

Usage

Training the Model

python train.py --config configs/train_config.json

Running Inference

python infer.py --model checkpoint/model.pth --text "Your input text here"

Configuration

Model and training parameters can be customized in the configs/ directory. Example:

{
  "model_size": "small",
  "learning_rate": 0.001,
  "batch_size": 32,
  "epochs": 10
}

Dataset Preparation

The dataset should be formatted in JSON or CSV format and placed in the data/ directory. Modify data_loader.py to preprocess your specific dataset.

Contributing

Contributions are welcome! Please follow these steps:

Fork the repository.
Create a new branch (feature-branch).
Commit your changes.
Push to your fork and create a pull request.

Contact

For questions and collaborations, feel free to reach out via GitHub issues or email.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.venv		.venv
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
base_llama.py		base_llama.py
classifier.py		classifier.py
config.py		config.py
llama.py		llama.py
optimizer.py		optimizer.py
optimizer_test.npy		optimizer_test.npy
optimizer_test.py		optimizer_test.py
rope.py		rope.py
rope_test.py		rope_test.py
rotary_embedding_actual.data		rotary_embedding_actual.data
run_llama.py		run_llama.py
sanity_check.data		sanity_check.data
sanity_check.py		sanity_check.py
setup.sh		setup.sh
sst-dev-finetuning-output.txt		sst-dev-finetuning-output.txt
sst-test-finetuning-output.txt		sst-test-finetuning-output.txt
test.py		test.py
tokenizer.model		tokenizer.model
tokenizer.py		tokenizer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mini_LLama

Features

Installation

Usage

Training the Model

Running Inference

Configuration

Dataset Preparation

Contributing

Contact

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

dzungnguyen21/Mini_LLama

Folders and files

Latest commit

History

Repository files navigation

Mini_LLama

Features

Installation

Usage

Training the Model

Running Inference

Configuration

Dataset Preparation

Contributing

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages