Feature/gpu async encoding speed enhancement #2169

nhnifong · 2025-10-10T23:48:04Z

What this does

This PR is a close relative of #1671
I rebased it to the lerobot main branch to make it work with dataset format 3.0
At the time of this writing, this PR is few changes ahead of that one. Please let me know if you would rather use the other one.

I have removed batched encoding completely. it is replaced with this feature.

With async video encoding enabled, the 30 to 40 second encoding wait cited here
#1434
becomes about 5s. If the imagewriter format is changed to JPEG from the default PNG, it becomes 0s.

Additionally If GPU encoding is enabled, or the codec is changed from av1 to h264, CPU usage is drastically reduced.

How it was tested

I created new LeRobotDataset instances with the following combinations of settings and confirmed that the saved datasets could be read by lerobot_train without errors being reported

create a new dataset with async video encoding enabled. save one episode and push to huggingface, resume the dataset (that is, create it with LeRobotDataset() not LeRobotDataset.create()) with async video encoding enabled and record one episode and push to huggingface.
the same as 1, but with the cache deleted and download_videos=False when resuming the dataset.
the same as 1, but with async encoding disabled.
record one episode with async enabled. delete the cache, resume the dataset with async enabled and download_videos=False, record one episode, delete the cache, resume the dataset with async enabled and download_videos=True, record one episode.

I did find that causing three episodes to be concatenated in one session fails, but this reported seperated in #2161 and I didn't fix it. I couldn't figure out how. I just avoided it by disabling the concatenation.

How to checkout & try? (for the reviewer)

Follow one of the basic dataset recording tutorials and watch for the reduction in the time taken by the save_episode() call. this PR should only be a performance improvement during recording and should not have any other effect.

…mparison

- Added FOURCC configuration option to OpenCVCamera and OpenCVCameraConfig for specifying video format. - Implemented _validate_fourcc method to validate and set the camera's FOURCC code. - Updated _configure_capture_settings to apply FOURCC settings before FPS and resolution. - Enhanced camera detection to include default FOURCC code in camera info. - Updated documentation to reflect new FOURCC parameter and its implications on performance.

for more information, see https://pre-commit.ci

- Implemented tests to validate FOURCC configuration and its application in OpenCVCamera. - Added checks for valid FOURCC codes and ensured that invalid codes raise appropriate errors. - Included a test for camera connection functionality using specified FOURCC settings.

for more information, see https://pre-commit.ci

…ithout double prefixing

…SV, and AMD VCE

…dependent features

…encoding, including async and GPU encoding guides, performance measurement tools, and test scripts. This cleanup streamlines the codebase and eliminates redundancy following recent enhancements and optimizations.

- GPU acceleration using NVIDIA NVENC (3-4x speedup) - Async background encoding with configurable workers - Automatic CPU fallback for reliability - Timeout protection to prevent stuck processes - Tested with real robot hardware (SO-101) - Includes test scripts and concise documentation Performance: Non-blocking recording with 3-4x encoding speedup

… to streamline the codebase following recent enhancements. This cleanup eliminates redundancy and focuses on the latest improvements in video encoding features.

- Remove unused variable 'result' in encode_video method - Use shutil.which() to get full path to ffmpeg for security - Fix bandit B607 warning about partial executable path

- Format Python files with ruff - Fix code style and formatting issues - Ensure consistent code formatting across the codebase

- Fix trailing whitespace in GPU_ENCODING_README.md and Python files - Apply ruff format to lerobot_dataset.py and record.py - Ensure all files meet pre-commit formatting standards

- Fix ruff linting issues (16 errors resolved) - Apply pyupgrade syntax updates to Python files - Apply prettier formatting to markdown files - Ensure all files meet pre-commit standards

- Update Python syntax to Python 3.9+ standards - Apply modern Python syntax improvements - Ensure all files meet latest pyupgrade requirements

- Fix remaining 10 ruff linting errors - Apply final pyupgrade syntax updates - Ensure all files meet pre-commit standards completely

- Use --py310-plus flag to match pre-commit configuration - Apply Python 3.10+ syntax updates - Ensure compatibility with pre-commit pyupgrade hook

- Apply remaining ruff fixes to all Python files - Ensure all linting issues are completely resolved - Final step to achieve full pre-commit compliance

Cause workers to wait for all the images in their working directory to be complete.

…omplete

… ints

forgetwhatuwant and others added 30 commits October 8, 2025 11:06

Add comprehensive benchmarking system for encoding speed analysis

a29dfc2

Fix division by zero issues in benchmarking system

2d3a7aa

Implement async video encoding for improved recording performance

2c4d001

Fix AsyncVideoEncoder shutdown and add comprehensive tests

1eb58d5

Add comprehensive async encoding implementation summary

20d3a0d

Add async encoding support to benchmarking script with performance co…

ce5bd45

…mparison

Add encoding timing test and monitoring tools for async encoding

1f421b7

[pre-commit.ci] auto fixes from pre-commit.com hooks

d293124

for more information, see https://pre-commit.ci

[pre-commit.ci] auto fixes from pre-commit.com hooks

d16b444

for more information, see https://pre-commit.ci

Fix circular import in __init__.py - change to relative import

1a2fd82

Fix async video encoder path construction for correct dataset structure

bca53b1

Fix async video encoder path construction - use video keys directly w…

57d73aa

…ithout double prefixing

Add GPU-accelerated video encoding support with NVIDIA NVENC, Intel Q…

c11060d

…SV, and AMD VCE

Add comprehensive GPU encoding guide and test scripts

d849538

Clarify distinction between async encoding and GPU acceleration as in…

f680608

…dependent features

Remove outdated GPU encoding issues documentation and related scripts…

0c68d08

… to streamline the codebase following recent enhancements. This cleanup eliminates redundancy and focuses on the latest improvements in video encoding features.

Fix linting and security issues in gpu_video_encoder.py

4a28a16

- Remove unused variable 'result' in encode_video method - Use shutil.which() to get full path to ffmpeg for security - Fix bandit B607 warning about partial executable path

Apply code formatting fixes from pre-commit

634a030

- Format Python files with ruff - Fix code style and formatting issues - Ensure consistent code formatting across the codebase

Apply remaining formatting fixes

d22fb30

- Fix trailing whitespace in GPU_ENCODING_README.md and Python files - Apply ruff format to lerobot_dataset.py and record.py - Ensure all files meet pre-commit formatting standards

Apply final formatting fixes from pre-commit

5cf8b68

- Fix ruff linting issues (16 errors resolved) - Apply pyupgrade syntax updates to Python files - Apply prettier formatting to markdown files - Ensure all files meet pre-commit standards

Apply final pyupgrade syntax updates

0e2bc39

- Update Python syntax to Python 3.9+ standards - Apply modern Python syntax improvements - Ensure all files meet latest pyupgrade requirements

Apply final pre-commit fixes

145e9b2

- Fix remaining 10 ruff linting errors - Apply final pyupgrade syntax updates - Ensure all files meet pre-commit standards completely

Apply pyupgrade with correct Python 3.10+ version

a1a73d1

- Use --py310-plus flag to match pre-commit configuration - Apply Python 3.10+ syntax updates - Ensure compatibility with pre-commit pyupgrade hook

Fix final 6 ruff linting errors

1f3c1a4

- Apply remaining ruff fixes to all Python files - Ensure all linting issues are completely resolved - Final step to achieve full pre-commit compliance

Make async encoding work with 3.0 dataset format

5f0388d

Test sync and async episode recording

c653781

nhnifong added 4 commits October 8, 2025 11:06

Re-enable GPU video encoding feature

6e1afbb

Cause workers to wait for all the images in their working directory to be complete.

Update metadata for episode differently when async vid recording is c…

075b004

…omplete

Handle case where episode is resumed with download_videos=False

e41a8a8

Pass along codec setting and always interpret chunk and file index as…

c79eaa8

… ints

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/gpu async encoding speed enhancement #2169

Feature/gpu async encoding speed enhancement #2169

nhnifong commented Oct 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feature/gpu async encoding speed enhancement #2169

Are you sure you want to change the base?

Feature/gpu async encoding speed enhancement #2169

Conversation

nhnifong commented Oct 10, 2025

What this does

How it was tested

How to checkout & try? (for the reviewer)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants