Skip to content
Change the repository type filter

All

    Repositories list

    • A framework for few-shot evaluation of language models.
      Python
      2.6k000Updated Jul 30, 2025Jul 30, 2025
    • Secure open source cloud runtime for AI apps & AI agents
      MDX
      627000Updated Jun 25, 2025Jun 25, 2025
    • A PyTorch native platform for training generative AI models
      Python
      448000Updated Jun 11, 2025Jun 11, 2025
    • Python
      0001Updated Jun 10, 2025Jun 10, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      9k200Updated May 10, 2025May 10, 2025
    • Python
      11000Updated Apr 4, 2025Apr 4, 2025
    • Unified automatic quality assessment for speech, music, and sound.
      Python
      38100Updated Mar 22, 2025Mar 22, 2025
    • Zonos

      Public
      Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers.
      Python
      7756.9k13425Updated Mar 5, 2025Mar 5, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      30k200Updated Feb 18, 2025Feb 18, 2025
    • Python
      14870Updated Feb 5, 2025Feb 5, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      30k300Updated Feb 4, 2025Feb 4, 2025
    • Zamba2

      Public
      PyTorch implementation of models from the Zamba2 series.
      Python
      1718431Updated Jan 23, 2025Jan 23, 2025
    • zcookbook

      Public
      Training hybrid models for dummies.
      Python
      32501Updated Jan 16, 2025Jan 16, 2025
    • Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
      Python
      512710Updated Dec 3, 2024Dec 3, 2024
    • FastChat

      Public
      An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
      Python
      4.7k000Updated Nov 6, 2024Nov 6, 2024
    • Ongoing research training transformer models at scale
      Python
      3k0104Updated Aug 20, 2024Aug 20, 2024
    • Ongoing research training transformer language models at scale, including: BERT & GPT-2
      Python
      3k002Updated Aug 19, 2024Aug 19, 2024
    • Fast and memory-efficient exact attention
      Python
      1.9k000Updated Jul 8, 2024Jul 8, 2024
    • Python
      1700Updated Jul 1, 2024Jul 1, 2024
    • mamba

      Public
      Python
      1.4k500Updated Jun 27, 2024Jun 27, 2024
    • Python
      23710Updated Jun 19, 2024Jun 19, 2024
    • High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
      C++
      440100Updated Jun 11, 2024Jun 11, 2024
    • Dataset for the temporal memory tests
      0700Updated Jun 4, 2024Jun 4, 2024
    • Robust recipes to align language models with human and AI preferences
      Python
      453000Updated Jun 3, 2024Jun 3, 2024
    • Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
      Python
      208000Updated Mar 8, 2024Mar 8, 2024
    • Code repository for Black Mamba
      Python
      1725260Updated Feb 8, 2024Feb 8, 2024
    • 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
      Rust
      945101Updated Feb 3, 2024Feb 3, 2024
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      4.5k000Updated Nov 2, 2023Nov 2, 2023
    • apex

      Public
      A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
      Python
      1.5k000Updated Nov 1, 2023Nov 1, 2023