Gesture-based Mouse Control

Demo Video

Overview

This project enables mouse control through hand gestures by processing Mediapipe and Hagrid models in parallel.

The system captures hand movements via video and translates them into corresponding mouse actions.

Key Components:

Mediapipe: Detects finger joints and skeletal structure to track precise finger positions
Hagrid (YOLOv10n): Recognizes various hand poses and gestures
Integration: Visualizes hand movements using Mediapipe's coordinate data while determining mouse actions through YOLO-detected poses

Performance Analysis Results (37 measurements)

Average Processing Times by Component:

Camera: 6.16ms (Range: 5-8ms)
Reasoning: 39.19ms (Range: 25-51ms)
Visualization: 9.35ms (Range: 0-29ms)

Total Average Processing Time: 54.70ms

Performance Distribution:

Camera: 11.3% of total time
Reasoning: 71.6% of total time
Visualization: 17.1% of total time

Key Insights:

The reasoning component dominates the processing pipeline, accounting for over 70% of the total execution time. Camera capture is highly consistent and efficient, while visualization time varies significantly (0-29ms) depending on whether rendering is needed.

System Architecture Flow:

Input Stage
- Camera frame acquisition captures video frames
Primary Processing Stage
- Acquired frames branch into three directions:
  - MediaPipe processing (hand landmark detection)
  - YOLO processing (object detection)
  - Update visualize model (visualization model update)
Parallel Processing Stage
- MediaPipe results branch into 2 parallel tasks:
  - Control the mouse (mouse control)
  - Visualize the boundbox (bounding box visualization)
Integration and Output Stage
- All parallel processing results are finally integrated into "update image"
- Processed results are displayed on screen

Key Features:

Parallel Processing: MediaPipe and YOLO execute simultaneously for performance optimization
Multi-tasking: Single MediaPipe result enables simultaneous mouse control and visualization
Real-time Processing: All results are integrated into one image providing real-time feedback

Main Function

Point up: move the mouse cursor
Fist: click the left mouse button
Captures hand landmarks and gestures and visualizes the bounding box and finger joint points
Tracks fingertip positions

Based on your gesture-based mouse control project documentation, here are the patch notes for version 1.1:

Patch Notes

Version 1.0 (Initial Release)

Initial release of gesture-based mouse control system
Implemented parallel processing of Mediapipe and Hagrid (YOLOv10n) models
Basic hand gesture recognition for mouse control:
- Point up: Move mouse cursor
- Fist: Left mouse click on
- open palm : Left mouse click off
Real-time hand landmark detection and visualization
Integrated bounding box and finger joint visualization
Average processing time: about 95.00ms per frame

Version 1.1 (Performance Optimization Update)

Enhanced Visualization System: Updated visualization pipeline for improved rendering efficiency
Smart Object Detection Skipping: Added intelligent frame skipping for YOLO processing to reduce computational overhead
YOLO Computation Optimization: Implemented advanced optimization techniques for YOLOv10m inference
Frame Rate Improvements: Overall system optimization resulting in better frame rate performance
Processing Pipeline Refinements: Optimized parallel processing workflow for reduced latency

Performance Impact: These optimizations significantly improve the real-time responsiveness of the gesture control system while maintaining accuracy in hand pose detection and mouse control functionality.

References

Models & Frameworks

Mediapipe - Hand landmark detection
Hagrid YOLOv10n - Object detection backbone

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
models		models
.gitattributes		.gitattributes
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
CMakePresets.json		CMakePresets.json
Mediapipe_practice.cpp		Mediapipe_practice.cpp
Mouse_event.h		Mouse_event.h
OnnxModel.h		OnnxModel.h
OnnxYolo.h		OnnxYolo.h
README.md		README.md
box_visualizer.h		box_visualizer.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Gesture-based Mouse Control

Overview

Performance Analysis Results (37 measurements)

Average Processing Times by Component:

Performance Distribution:

Key Insights:

System Architecture Flow:

Main Function

Patch Notes

Version 1.0 (Initial Release)

Version 1.1 (Performance Optimization Update)

References

Models & Frameworks

About

Uh oh!

Releases

Packages

Languages

BicMak/HandTracking_CPP

Folders and files

Latest commit

History

Repository files navigation

Gesture-based Mouse Control

Overview

Performance Analysis Results (37 measurements)

Average Processing Times by Component:

Performance Distribution:

Key Insights:

System Architecture Flow:

Main Function

Patch Notes

Version 1.0 (Initial Release)

Version 1.1 (Performance Optimization Update)

References

Models & Frameworks

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages