GitHub

Finetuning and Inference

To be able to run and finetune the models you need a >6GB VRAM Nvidia GPU with CUDA.

Xturing, vLLM, and QLora are used in finetuning and running inference. 

Install into separate virtual environments as they each use different library versions.

conda create -n xt python=3.10
conda activate xt
pip install xturing==0.1.4

conda create -n vl python=3.10
conda activate vl
pip install vllm==0.1.2

conda create -n ql python=3.10
conda activate ql
git clone https://github.com/artidoro/qlora
pip install -U -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
dataset.py		dataset.py
finetune.py		finetune.py
finetune_quant.py		finetune_quant.py
generic.py		generic.py
inference.py		inference.py
inference_vllm.py		inference_vllm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Finetuning and Inference

About

Releases

Packages

Languages

VUISIS/FormulaLLM

Folders and files

Latest commit

History

Repository files navigation

Finetuning and Inference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages