Gesture Once

Overview

Gesture Once (p.k.a. Learn-ASL) is a machine learning project designed to recognize American Sign Language (ASL) gestures and translate them into text using the YOLOv8 object detection model and MediaPipe for hand landmark detection. This project aims to bridge communication gaps for ASL users by providing an educational and efficient sign-to-text conversion service for learning basic ASL, including the alphabet and many common phrases in ASL.

Features

Object Detection with YOLOv8: Recognizes ASL letters and gestures from a live video feed. Hand Landmarks with MediaPipe: Enhances gesture recognition by aligning bounding boxes to hand landmarks. Gesture Logging: Logs the highest predicted gesture with confidence scores to a text file for debugging and potential user interfaces.

Demo

IMPORTANT NOTES

Although it'd be awesome to have this deployed so others can freely test the model, deploying it will be computationally expensive, and users may run into network issues regardless. However, setting this up locally is extremely easy! Instructions are available below.

Tools and Libraries

Use Cases:

Real-Time Conversion

The system can be engineered to detect ASL gestures using a camera which converts sign language into text in real-time. This would enable deaf individuals to communicate more easily with those who do not understand ASL. Additionally, the system can be extended to translate ASL videos into text for users who do not know ASL.

Educational and Training Purposes:

The system can serve as a learning platform for users who are practicing sign langauge. It can be developed so that it can evaluate a user’s sign language accuracy and provide instant feedback.

Setting Up

NOTE

Make sure Python is installed.

Installation

Install the necessary Python packages

pip install -r requirements.txt

Change directory to the frontend and install necessary dependencies

cd frontend/
npm install

Run the client

npm run dev

Change directory to the backend and run the server that serves the YOLOv8 model

cd ..
cd backend/
python model_api.py

Start signing!

Future Work

Interface Development: Build a GUI for real-time gesture-to-text translation. Dataset Expansion: Incorporate more ASL gestures for robust recognition. Performance Optimization: Optimize logging and frame processing speed.

Dataset

Dataset Link: ASL Letters Dataset

Contributors

Jay Noppone Pornpitaksuk, Claudio Perinuzzi, Loyd Flores, Kenneth Guillont

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
backend		backend
frontend		frontend
runs/detect		runs/detect
.gitignore		.gitignore
README.md		README.md
gesture_once.gif		gesture_once.gif
requirements.txt		requirements.txt
train.py		train.py
yolov8n.pt		yolov8n.pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Gesture Once

Overview

Features

Demo

IMPORTANT NOTES

Tools and Libraries

Use Cases:

Real-Time Conversion

Educational and Training Purposes:

Setting Up

NOTE

Installation

Future Work

Dataset

Contributors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

jaynopponep/Gesture-Once

Folders and files

Latest commit

History

Repository files navigation

Gesture Once

Overview

Features

Demo

IMPORTANT NOTES

Tools and Libraries

Use Cases:

Real-Time Conversion

Educational and Training Purposes:

Setting Up

NOTE

Installation

Future Work

Dataset

Contributors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages