Skip to content
Change the repository type filter

All

    Repositories list

    • server

      Public
      The Triton Inference Server provides an optimized cloud and edge inferencing solution.
      Python
      BSD 3-Clause "New" or "Revised" License
      1.6k9.3k70872Updated Jun 4, 2025Jun 4, 2025
    • The Triton TensorRT-LLM Backend
      Shell
      Apache License 2.0
      12284631022Updated Jun 4, 2025Jun 4, 2025
    • FIL backend for the Triton Inference Server
      Jupyter Notebook
      Apache License 2.0
      3779533Updated Jun 4, 2025Jun 4, 2025
    • common

      Public
      Common source, scripts and utilities shared across all Triton repositories.
      C++
      BSD 3-Clause "New" or "Revised" License
      747205Updated Jun 4, 2025Jun 4, 2025
    • Python
      BSD 3-Clause "New" or "Revised" License
      1874129Updated Jun 3, 2025Jun 3, 2025
    • core

      Public
      The core library and APIs implementing the Triton Inference Server.
      C++
      BSD 3-Clause "New" or "Revised" License
      109133015Updated Jun 3, 2025Jun 3, 2025
    • The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
      C++
      MIT License
      33134225Updated Jun 3, 2025Jun 3, 2025
    • The Triton backend for TensorFlow.
      C++
      BSD 3-Clause "New" or "Revised" License
      215102Updated Jun 2, 2025Jun 2, 2025
    • Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Server models.
      Python
      Apache License 2.0
      77478276Updated Jun 2, 2025Jun 2, 2025
    • tutorials

      Public
      This repository contains tutorials and examples for Triton Inference Server
      Python
      BSD 3-Clause "New" or "Revised" License
      118718815Updated May 27, 2025May 27, 2025
    • Python
      BSD 3-Clause "New" or "Revised" License
      2826109Updated May 14, 2025May 14, 2025
    • Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inference Server.
      Python
      56223Updated May 14, 2025May 14, 2025
    • Third-party source packages that are modified for use in Triton.
      C
      BSD 3-Clause "New" or "Revised" License
      59704Updated May 14, 2025May 14, 2025
    • The Triton backend for TensorRT.
      C++
      BSD 3-Clause "New" or "Revised" License
      327601Updated May 14, 2025May 14, 2025
    • Simple Triton backend used for testing.
      C++
      BSD 3-Clause "New" or "Revised" License
      5300Updated May 14, 2025May 14, 2025
    • An example Triton backend that demonstrates sending zero, one, or multiple responses for each request.
      C++
      BSD 3-Clause "New" or "Revised" License
      7600Updated May 14, 2025May 14, 2025
    • TRITONCACHE implementation of a Redis cache
      C++
      BSD 3-Clause "New" or "Revised" License
      41320Updated May 14, 2025May 14, 2025
    • The Triton backend for the PyTorch TorchScript models.
      C++
      BSD 3-Clause "New" or "Revised" License
      5215005Updated May 14, 2025May 14, 2025
    • Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
      C++
      BSD 3-Clause "New" or "Revised" License
      170617011Updated May 14, 2025May 14, 2025
    • OpenVINO backend for Triton.
      C++
      BSD 3-Clause "New" or "Revised" License
      173163Updated May 14, 2025May 14, 2025
    • The Triton backend for the ONNX Runtime.
      C++
      BSD 3-Clause "New" or "Revised" License
      64148732Updated May 14, 2025May 14, 2025
    • Implementation of a local in-memory cache for Triton Inference Server's TRITONCACHE API
      C++
      BSD 3-Clause "New" or "Revised" License
      1510Updated May 14, 2025May 14, 2025
    • Example Triton backend that demonstrates most of the Triton Backend API.
      C++
      BSD 3-Clause "New" or "Revised" License
      13700Updated May 14, 2025May 14, 2025
    • C++
      91804Updated May 14, 2025May 14, 2025
    • client

      Public
      Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
      Python
      BSD 3-Clause "New" or "Revised" License
      2396244628Updated May 14, 2025May 14, 2025
    • The Triton repository agent that verifies model checksums.
      C++
      BSD 3-Clause "New" or "Revised" License
      71100Updated May 14, 2025May 14, 2025
    • backend

      Public
      Common source, scripts and utilities for creating Triton backends.
      C++
      BSD 3-Clause "New" or "Revised" License
      9632603Updated May 14, 2025May 14, 2025
    • Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
      Python
      Apache License 2.0
      2520230Updated Apr 22, 2025Apr 22, 2025
    • .github

      Public
      Community health files for NVIDIA Triton
      2200Updated Mar 27, 2025Mar 27, 2025
    • triton_distributed

      Public archive
      Rust
      Apache License 2.0
      15493637Updated Mar 7, 2025Mar 7, 2025