VideoProc

VideoProc 提供了基于 CUDA 加速优化的视频编解码推理 Python SDK，支持多路视频流的同时解码、推理、编码，同时支持多 GPU 与多模型推理

⭐ 多进程单线程绕过 GIL 限制，提升 Python 并发性能

⭐ 减少 Host-Device 数据传输，降低 GPU 显存冗余拷贝，提升推理速度

⭐ 尽可能在 GPU 上计算，以降低 CPU 计算负担

⭐ 开箱即用，简单易懂，扩展性强，适合中小型项目快速部署

	Open Source	Learning Curve	Developer Friendliness	Performance	Architecture Design
DeepStream	❌	High	Low	High	Single-process, multi-threaded
VideoPipe	✅	medium（requires cpp knowledge）	Medium（requires cpp knowledge）	Medium	Single-process, multi-threaded
Our	✅	≈ 0	High +++++++++++	Medium	Multi-process, single-threaded

Quick Start

本项目推荐 Docker 容器运行，首先确保本地环境满足以下三个条件：

Docker >= 24.0.0
NVIDIA Driver >= 590
NVIDIA Container Toolkit >= 1.13.0

1. 生成镜像

clone 本项目，生成包含完整开发环境的镜像

git clone https://github.com/lmk123568/VideoProc.git
cd VideoProc/docker
docker build -t vidproc:cuda12.8 .

镜像生成后，进入容器，不报错即成功

docker run -it \
  --gpus all \
  -e NVIDIA_DRIVER_CAPABILITIES=all \
  -v {your_path}/VideoProc:/workspace \
  vidproc:cuda12.8 \
  bash

后续示例代码默认在容器内/workspace运行

⚠️ 不推荐自己本地装环境，如果一定要自己装，请参考 Dockerfile

2. 编译加速包

python scripts/setup.py install

这里面包含了硬件编解码、YOLO26 推理优化的 C++ 实现，并通过 Pybind11 给 Python 调用

3. 训练模型权重转换

将通过 ultralytics 训练的pt模型导入到当前目录（/workspace）下（示例模型为 yolo26n.pt）

python scripts/pt2trt.py  --w ./yolo26n.pt --fp16

转换过程中会与 ultralytics 官方结果进行推理对齐

💡 TensorRT 编译生成 .engine 过程中，推理尺寸默认设置为(576,1024)，可以跳过letterbox降低计算开销

遇到警告 requirements: Ultralytics requirement ['onnxruntime-gpu'] not found, attempting AutoUpdate... 可以 Ctrl + C 跳过

4. 运行

开启 MPS（Multi-Process Service）

nvidia-cuda-mps-control -d
# echo quit | nvidia-cuda-mps-control  关闭 MPS

阅读理解其代码并运行

python main.py

Benchmark

测试日期: 2026-01-25

测试硬件: AMD Ryzen 9 5950 X + NVIDIA GeForce RTX 3090

测试任务: 4 × RTSP Decoders → YOLO26 (TensorRT) → 4 × RTMP Encoders

	CPU	RAM	GPU VRAM	GPU-Util
VideoPipe（ffmpeg codec）	511.6 %	1.5 GiB	2677 MiB	16 %
Our	40 %	1.2GiB	3932 MiB	12 %

output_annotated.mp4

Pipeline

License

BSD-2-Clause

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
assets		assets
cpp		cpp
docker		docker
scripts		scripts
.clang-format		.clang-format
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VideoProc

Quick Start

1. 生成镜像

2. 编译加速包

3. 训练模型权重转换

4. 运行

Benchmark

Pipeline

License

About

Uh oh!

Languages

License

lmk123568/VideoProc

Folders and files

Latest commit

History

Repository files navigation

VideoProc

Quick Start

1. 生成镜像

2. 编译加速包

3. 训练模型权重转换

4. 运行

Benchmark

Pipeline

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages