Skip to content
Change the repository type filter

All

    Repositories list

    • minions

      Public
      Big & Small LLMs working together
      Python
      MIT License
      1131k012Updated Jun 25, 2025Jun 25, 2025
    • Python
      MIT License
      01000Updated Jun 24, 2025Jun 24, 2025
    • Tile primitives for speedy kernels
      Cuda
      MIT License
      1582.5k3814Updated Jun 22, 2025Jun 22, 2025
    • Storing long contexts in tiny caches with self-study
      Python
      Apache License 2.0
      57140Updated Jun 19, 2025Jun 19, 2025
    • zoology

      Public
      Understand and test language model architectures on synthetic tasks.
      Python
      Apache License 2.0
      3521821Updated Jun 8, 2025Jun 8, 2025
    • based

      Public
      Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"
      Python
      Apache License 2.0
      1523530Updated Jun 6, 2025Jun 6, 2025
    • kernels, of the mega variety
      Python
      MIT License
      1940621Updated Jun 2, 2025Jun 2, 2025
    • hyena-dna

      Public
      Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena
      Assembly
      Apache License 2.0
      99694327Updated Apr 22, 2025Apr 22, 2025
    • Python
      MIT License
      5700Updated Mar 18, 2025Mar 18, 2025
    • lolcats

      Public
      Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"
      Python
      Apache License 2.0
      2523970Updated Jan 31, 2025Jan 31, 2025
    • aioli

      Public
      Aioli: A unified optimization framework for language model data mixing
      Jupyter Notebook
      Apache License 2.0
      42710Updated Jan 17, 2025Jan 17, 2025
    • FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
      C++
      Apache License 2.0
      28319183Updated Dec 28, 2024Dec 28, 2024
    • m2

      Public
      Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
      Assembly
      Apache License 2.0
      42554252Updated Dec 28, 2024Dec 28, 2024
    • meerkat

      Public
      Creative interactive views of any dataset.
      Python
      Apache License 2.0
      4384183Updated Dec 24, 2024Dec 24, 2024
    • smoothie

      Public
      Jupyter Notebook
      MIT License
      3700Updated Dec 10, 2024Dec 10, 2024
    • train-tk

      Public
      train with kittens!
      Python
      7k6000Updated Oct 25, 2024Oct 25, 2024
    • WONDERBREAD benchmark + dataset for BPM tasks
      Jupyter Notebook
      62410Updated Oct 20, 2024Oct 20, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      8.3k100Updated Oct 14, 2024Oct 14, 2024
    • Automating enterprise workflows with multimodal agents
      Jupyter Notebook
      Apache License 2.0
      1310700Updated Oct 9, 2024Oct 9, 2024
    • An open science effort to benchmark legal reasoning in foundation models
      Python
      6544497Updated Aug 25, 2024Aug 25, 2024
    • hgcn

      Public
      Hyperbolic Graph Convolutional Networks in PyTorch.
      Python
      114627203Updated Jul 25, 2024Jul 25, 2024
    • manifest

      Public
      Prompt programming with FMs.
      Python
      Apache License 2.0
      4544362Updated Jul 22, 2024Jul 22, 2024
    • Python
      15500Updated Jul 9, 2024Jul 9, 2024
    • safari

      Public
      Convolutions for Sequence Modeling
      Assembly
      Apache License 2.0
      71890251Updated Jun 13, 2024Jun 13, 2024
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      2.5k1000Updated Jun 8, 2024Jun 8, 2024
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      2.5k800Updated Jun 3, 2024Jun 3, 2024
    • axolive

      Public
      Go ahead and axolotl questions
      Python
      Apache License 2.0
      1.1k100Updated Jun 3, 2024Jun 3, 2024
    • Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
      Jupyter Notebook
      2.5k100Updated Jun 3, 2024Jun 3, 2024
    • Python
      Apache License 2.0
      3217820Updated May 27, 2024May 27, 2024
    • evaporate

      Public
      This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes"
      Python
      44488102Updated Mar 26, 2024Mar 26, 2024