Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
cliffordkleinsr authored Feb 23, 2024
1 parent 76fac5c commit 8d0d300
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -85,8 +85,9 @@ Original | Processed
# Performance Optimizations
1. Moved the final scaling and uint8 quantization to GPU, reducing CPU and main memory bandwidth consumption. 2.5x speed-up.
2. Instruct FFMPEG to use RGB frames instead of BGR so no need to swap channels.
3. Batched inference (controlled by --batch parameter, default is 4).
3. Batched inference (controlled by invoking the --batch & --batches parameter, default is 4).
4. Instruct torch to make contiguous tensors after the BCHW -> BHWC transform on GPU. So no need to copy the buffer before writing to FFMPEG . Reduced output IO time by 10x.
5. Use NVENC pipilene when available to decode and encode the images when piping inputs

# Open tasks
1. [X] Take a video frame and turn it into images
Expand Down

0 comments on commit 8d0d300

Please sign in to comment.