Skip to content
Change the repository type filter

All

    Repositories list

    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      8.7k52k1.8k789Updated Jul 11, 2025Jul 11, 2025
    • Python
      1401Updated Jul 11, 2025Jul 11, 2025
    • Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
      Python
      Apache License 2.0
      1741.6k2831Updated Jul 11, 2025Jul 11, 2025
    • Community maintained hardware plugin for vLLM on Spyre
      Python
      Apache License 2.0
      18301117Updated Jul 11, 2025Jul 11, 2025
    • Community maintained hardware plugin for vLLM on Ascend
      Python
      Apache License 2.0
      249865231118Updated Jul 11, 2025Jul 11, 2025
    • ci-infra

      Public
      This repo hosts code for vLLM CI & Performance Benchmark infrastructure.
      HCL
      291408Updated Jul 11, 2025Jul 11, 2025
    • guidellm

      Public
      Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
      Python
      Apache License 2.0
      523944510Updated Jul 10, 2025Jul 10, 2025
    • vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
      Python
      Apache License 2.0
      2231.5k5441Updated Jul 10, 2025Jul 10, 2025
    • aibrix

      Public
      Cost-efficient and pluggable Infrastructure components for GenAI inference
      Go
      Apache License 2.0
      3923.9k19517Updated Jul 10, 2025Jul 10, 2025
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.8k80012Updated Jul 9, 2025Jul 9, 2025
    • HTML
      181100Updated Jul 1, 2025Jul 1, 2025
    • Python
      Apache License 2.0
      72020Updated Jun 5, 2025Jun 5, 2025
    • rfcs

      Public
      0100Updated Jun 3, 2025Jun 3, 2025
    • FlashMLA

      Public
      Cuda
      MIT License
      875500Updated Apr 23, 2025Apr 23, 2025
    • HTML
      MIT License
      7801Updated Feb 7, 2025Feb 7, 2025
    • media-kit

      Public
      vLLM Logo Assets
      1300Updated Dec 12, 2024Dec 12, 2024
    • vllm-nccl

      Public archive
      Manages vllm-nccl dependency
      Python
      Apache License 2.0
      31720Updated Jun 3, 2024Jun 3, 2024
    • dashboard

      Public
      vLLM performance dashboard
      Python
      Apache License 2.0
      73200Updated Apr 26, 2024Apr 26, 2024