Skip to content

This repo is a tiny language model mimicking LLMs. I would like to create LLM-like model from scratch. To some degree LLM could also be abbreviated to little language model. :D

License

Notifications You must be signed in to change notification settings

BreezeShane/LittleLanguageModel

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LittleLanguageModel

This repo is a tiny language model mimicking LLMs. I would like to create LLM-like model from scratch. To some degree LLM could also be abbreviated to little language model. :D

Todolist

  • Collect Data
  • Clean & Augment Data
  • Make Tokenizer
  • Make Model Based on Transformer
  • Pretrain Model
  • LLM Alignment

Optional

  • Transfer to downstream tasks.
  • SFT Training
  • Distribute Data Parallel
  • Distill model
  • RAG
  • Agent

Recommend to read

  1. LLM数据工程3——数据收集魔法:获取顶级训练数据的方法
  2. 掌握 LLM 技术:数据预处理

About

This repo is a tiny language model mimicking LLMs. I would like to create LLM-like model from scratch. To some degree LLM could also be abbreviated to little language model. :D

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages