dLLM-Factory

dLLM-Factory is a robust and comprehensive project centered on Diffusion Large Language Models (dLLMs). It offers a complete suite of implementation code for essential modules, including Pre-training, Supervised Fine-tuning (SFT), Reinforcement Learning (RL), and Inference.

📖 Project Introduction

This project developed by SJTU and Shanghai AI Lab aims to provide researchers and developers with an efficient, user-friendly platform for training and deploying dLLMs. It encompasses the full workflow, from data preprocessing and model training to inference and deployment, featuring a well-organized structure that facilitates secondary development and customization. Support for Dream and LLaDA is already included.

✨ Key Features

🧠 Pre-training: Train foundational models from scratch.
- Supported datasets: SlimPajama

🔧 Supervised Fine-tuning (SFT): Adapt pre-trained models to specific tasks.
- Supported datasets: simplescaling-s1K

🤖 Reinforcement Learning (RL): Optimize model performance using feedback.
- Supported methods: diff-grpo

🚀 Inference: Efficiently run trained models for real-world applications.
- Supported accelerations: dLLM-cache

📈 Evaluation: Thorough assessment across diverse benchmarks.

Benchmark	LLaDA Support	Dream Support
BBH	✅	✅
GPQA	✅	✅
GSM8K	✅	✅
HumanEval	✅	✅
Long Bench	✅	-
MBPP	✅	✅
Minerva Math	✅	✅
MMLU	✅	✅
MMLU Pro	✅	✅

📝 TODO

Broaden dataset support for pretraining and SFT
Incorporate additional RL algorithms and strategies
Introduce more dLLM acceleration techniques (e.g., quantization, pruning, etc.)
Expand evaluation benchmarks and metrics
Improve user experience for deployment and customization

🛠️ Usage

Pretraining

Initiate pretraining with the following command:

cd pretrain
bash run_pretrain.sh

Supervised Fine-tuning (SFT)

Start supervised fine-tuning with this command:

cd sft
accelerate launch --config_file ./config/accelerate/lora_config.yaml ./sft.py

Reinforcement Learning (RL)

Launch reinforcement learning using the provided script:

cd rl
bash examples/script/train_diffu_grpo.sh

Evaluation

Obtain evaluation results with this command:

cd evaluation
bash scripts/Dream/run_Dream_bbh_base.sh

🙏 Acknowledgments

We express our heartfelt thanks to the following projects for their outstanding contributions. The Reinforcement Learning code in this repository has been adapted from their work:

d1: A project dedicated to enhancing dLLM reasoning capabilities through reinforcement learning.
dLLM-cache: An implementation for adaptive caching to accelerate dLLMs, now integrated into this repository.
TinyLlama SMDM: The pretraining code in this project draws inspiration from these repositories, and we are deeply grateful for their contributions.

📖 Citation

@misc{yangyicun2025dLLMFactory,
  title={dLLM-Factory: A Comprehensive Platform for Diffusion Large Language Models},
  author={Yicun Yang and Shuang Cheng and Dawei Liu and Yihan Bian and Yaojie Zhang and Biqing Qi and Linfeng Zhang},
  year={2025},
  url = {https://github.com/maomaocun/dllm-Factory}
}

📧 Contact

For any questions or collaboration inquiries, feel free to reach out at: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
chat_demo		chat_demo
evaluation		evaluation
pretrain		pretrain
rl		rl
sft		sft
.gitignore		.gitignore
README-ZH.md		README-ZH.md
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

dLLM-Factory

📖 Project Introduction

✨ Key Features

📝 TODO

🛠️ Usage

Pretraining

Supervised Fine-tuning (SFT)

Reinforcement Learning (RL)

Evaluation

🙏 Acknowledgments

📖 Citation

📧 Contact

🌟 Star History

About

Uh oh!

Releases

Packages

Contributors 6

Uh oh!

Languages

maomaocun/dLLM-Factory

Folders and files

Latest commit

History

Repository files navigation

dLLM-Factory

📖 Project Introduction

✨ Key Features

📝 TODO

🛠️ Usage

Pretraining

Supervised Fine-tuning (SFT)

Reinforcement Learning (RL)

Evaluation

🙏 Acknowledgments

📖 Citation

📧 Contact

🌟 Star History

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Uh oh!

Languages

Packages