Skip to content

Conversation

chenpeng-zhao
Copy link

What this does

Explain what this PR does. Feel free to tag your PR with the appropriate label(s).

Examples:

Title Label
Fixes #[issue] (🐛 Bug)
Adds new dataset (🗃️ Dataset)
Optimizes something (⚡️ Performance)

How it was tested

Explain/show how you tested your changes.

Examples:

  • Added test_something in tests/test_stuff.py.
  • Added new_feature and checked that training converges with policy X on dataset/environment Y.
  • Optimized some_function, it now runs X times faster than previously.

How to checkout & try? (for the reviewer)

Provide a simple way for the reviewer to try out your changes.

Examples:

pytest -sx tests/test_stuff.py::test_something
lerobot-train --some.option=true

SECTION TO REMOVE BEFORE SUBMITTING YOUR PR

Note: Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR. Try to avoid tagging more than 3 people.

Note: Before submitting this PR, please read the contributor guideline.

chenpeng_zhao added 4 commits October 5, 2025 16:27
- Implemented comprehensive video clipping and merging functionality
- Added command-line argument parsing for configurability
- Features include:
  - Clip videos/parquet data by timestamp duration
  - Merge clipped episodes into continuous dataset
  - Upload to new HuggingFace repository with automatic cleanup
  - Flag-controlled video processing (parquet-only or full video processing)
  - Absolute path handling for video concatenation
  - Metadata preservation with clipping information
- Usage: python process_data.py --repo_id source --new_repo_id target [--process_videos]
Copy link
Author

@chenpeng-zhao chenpeng-zhao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant