NVIDIA Corporation

All

612 repositories

NVFlare
Public
NVIDIA Federated Learning Application Runtime Environment
python decentralized pet privacy-protection federated-learning federated-analytics federated-computing
Python
•
Apache License 2.0
•214•809•11•21•Updated Oct 14, 2025Oct 14, 2025
cloud-native-docs
Public
Documentation repository for NVIDIA Cloud Native Technologies
kubernetes containers kubernetes-operator
PowerShell
•
Apache License 2.0
•29•29•5•12•Updated Oct 14, 2025Oct 14, 2025
recsys-examples
Public
Examples for Recommenders - easy to train and deploy on accelerated infrastructure.
pytorch recommender-system recommenders generative-recommenders
Python
•
Other
•33•153•38•7•Updated Oct 14, 2025Oct 14, 2025
Fuser
Public
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
C++
•
Other
•69•357•201•189•Updated Oct 14, 2025Oct 14, 2025
bionemo-framework
Public
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
machine-learning gpu pytorch drug-discovery
Jupyter Notebook
•87•534•55•84•Updated Oct 14, 2025Oct 14, 2025
dgx-spark-playbooks
Public
Collection of step-by-step playbooks for setting up AI/ML workloads on NVIDIA DGX Spark devices with Blackwell architecture.
TypeScript
•
Apache License 2.0
•0•2•0•0•Updated Oct 14, 2025Oct 14, 2025
TensorRT-LLM
Public
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
cuda pytorch moe blackwell llm-serving
C++
•
Apache License 2.0
•1.8k•12k•749•403•Updated Oct 14, 2025Oct 14, 2025
cudaqx
Public
Accelerated libraries for quantum-classical computing built on CUDA-Q.
C++
•
Other
•33•60•23•14•Updated Oct 14, 2025Oct 14, 2025
VisRTX
Public
NVIDIA OptiX based implementation of ANARI
C++
•
Other
•35•264•9•0•Updated Oct 14, 2025Oct 14, 2025
holodeck
Public
Holodeck is a project to create test environments optimised for GPU projects.
Go
•
Apache License 2.0
•8•19•3•5•Updated Oct 14, 2025Oct 14, 2025
NeMo-Agent-Toolkit
Public
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
Python
•
Apache License 2.0
•384•1.4k•57•28•Updated Oct 14, 2025Oct 14, 2025
jax-tvm-ffi
Public
JAX support for tvm-ffi abi
C++
•
Apache License 2.0
•1•5•0•1•Updated Oct 14, 2025Oct 14, 2025
cccl
Public
CUDA Core Compute Libraries
cpp hpc gpu modern-cpp parallel-computing cuda nvidia gpu-acceleration cuda-kernels gpu-computing
C++
•
Other
•284•2k•1.1k•187•Updated Oct 14, 2025Oct 14, 2025
cuda-quantum
Public
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
python cpp quantum quantum-computing hacktoberfest quantum-programming-language quantum-algorithms quantum-machine-learning unitaryhack
C++
•
Other
•292•817•404•88•Updated Oct 13, 2025Oct 13, 2025
skyhook
Public
A Kubernetes Operator to manage Node OS customizations.
Go
•
Apache License 2.0
•3•27•0•2•Updated Oct 13, 2025Oct 13, 2025
TensorRT-Model-Optimizer
Public
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.
Python
•
Apache License 2.0
•174•1.4k•120•34•Updated Oct 13, 2025Oct 13, 2025
nvidia-resiliency-ext
Public
NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to failures and interruptions.
Python
•
Other
•34•226•1•12•Updated Oct 13, 2025Oct 13, 2025
aistore
Public
AIStore: scalable storage for AI applications
kubernetes high-performance distributed-storage high-availability object-storage multi-cloud batch-jobs s3-compatible multipart-upload ml-training
Go
•
MIT License
•219•1.6k•0•0•Updated Oct 13, 2025Oct 13, 2025
topograph
Public
A toolkit for discovering cluster network topology.
Go
•
Apache License 2.0
•6•72•1•0•Updated Oct 13, 2025Oct 13, 2025
numba-cuda
Public
The CUDA target for Numba
Python
•
BSD 2-Clause "Simplified" License
•39•197•90•24•Updated Oct 13, 2025Oct 13, 2025
jaxpp
Public
JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training
Python
•
Apache License 2.0
•1•54•0•1•Updated Oct 13, 2025Oct 13, 2025
NeMo-Agent-Toolkit-UI
Public
The NVIDIA AIQToolkit UI streamlines interacting with AIQToolkit workflows in an easy-to-use web application.
TypeScript
•
Other
•36•50•4•11•Updated Oct 13, 2025Oct 13, 2025
gpu-operator
Public
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
kubernetes gpu cuda nvidia
Go
•
Apache License 2.0
•393•2.3k•392•69•Updated Oct 13, 2025Oct 13, 2025
gpu-driver-container
Public
The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.
Shell
•
Apache License 2.0
•56•135•25•22•Updated Oct 13, 2025Oct 13, 2025
TransformerEngine
Public
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
python machine-learning deep-learning gpu cuda pytorch jax fp8
Python
•
Apache License 2.0
•517•2.8k•219•93•Updated Oct 13, 2025Oct 13, 2025
cuopt
Public
GPU accelerated decision optimization
gpu optimization cuda linear-programming
Cuda
•
Apache License 2.0
•82•464•85•16•Updated Oct 13, 2025Oct 13, 2025
audio-intelligence
Public
Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with synthetic captions.
Python
•4•57•0•0•Updated Oct 13, 2025Oct 13, 2025
spark-rapids
Public
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
big-data spark gpu rapids
Scala
•
Apache License 2.0
•258•935•1.7k•23•Updated Oct 13, 2025Oct 13, 2025
spark-rapids-jni
Public
RAPIDS Accelerator JNI For Apache Spark
Cuda
•
Apache License 2.0
•74•51•79•7•Updated Oct 13, 2025Oct 13, 2025
QEMU
Public
NVIDIA fork of QEMU
C
•
Other
•4•6•0•0•Updated Oct 13, 2025Oct 13, 2025