YOLO-Object-Detection-Voice

📸 Voice-Enabled Smart Vision Backend Description This is designed to enhance accessibility for visually impaired users. Built with FastAPI and integrated with a modular camera pipeline, it captures live frames, analyzes objects using deep learning, and narrates results using pyttsx3.

🔧 Features

Real-Time Object Detection Seamless camera-to-backend pipeline optimized for mobile and desktop environments.
Voice Narration Engine Converts object labels into spoken feedback using a threaded TTS system.
Modular Architecture Clean separation of capture, analysis, and narration layers for rapid iteration and scalability.
Accessibility-First Design Prioritizes low-latency feedback and compatibility with assistive technologies.

🚀 Getting Started git clone https://github.com/your-username/vss-backend.git cd vss-backend pip install -r requirements.txt uvicorn main:app --reload

📌 Roadmap

[*] Thread-safe voice narration
[*] Mobile-compatible camera capture
[*] Multi-language narration support
[*] Integration with Bharat Explorer frontend
[*] Offline mode for low-connectivity regions

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
node_modules/texttospeech		node_modules/texttospeech
BoxF1_curve.png		BoxF1_curve.png
BoxPR_curve.png		BoxPR_curve.png
BoxP_curve.png		BoxP_curve.png
BoxR_curve.png		BoxR_curve.png
README.md		README.md
args.yaml		args.yaml
captured.jpg		captured.jpg
confusion_matrix.png		confusion_matrix.png
confusion_matrix_normalized.png		confusion_matrix_normalized.png
detect.py		detect.py
favicon.ico		favicon.ico
index.html		index.html
labels.jpg		labels.jpg
main.cpython-313.pyc		main.cpython-313.pyc
main.py		main.py
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
results.csv		results.csv
results.png		results.png
style.css		style.css
train_batch0.jpg		train_batch0.jpg
train_batch1.jpg		train_batch1.jpg
train_batch2.jpg		train_batch2.jpg
train_batch90.jpg		train_batch90.jpg
train_batch91.jpg		train_batch91.jpg
train_batch92.jpg		train_batch92.jpg
val_batch0_labels.jpg		val_batch0_labels.jpg
val_batch0_pred.jpg		val_batch0_pred.jpg
vss.code-workspace		vss.code-workspace
yolo11n.pt		yolo11n.pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

YOLO-Object-Detection-Voice

About

Uh oh!

Releases

Packages

Languages

Akarshak51/YOLO-Object-Detection-Voice

Folders and files

Latest commit

History

Repository files navigation

YOLO-Object-Detection-Voice

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages