American Sign Language to Text Converter

Computer-vision prototype that recognizes static ASL alphabet gestures from a webcam and assembles them into text.

Problem Statement

Static hand-sign recognition requires more than image classification: the system must isolate the hand, distinguish visually similar letters, stabilize predictions over time, and convert a stream of letters into readable words.

Architecture

flowchart LR
    CAM[Webcam Frame] --> ROI[Hand Region of Interest]
    ROI --> PRE[Grayscale, Gaussian Blur, Adaptive Threshold]
    PRE --> BASE[27-class CNN]
    BASE --> DISAMBIG[Specialist CNNs for Similar Letters]
    DISAMBIG --> STABLE[Temporal Vote Stabilization]
    STABLE --> TEXT[Character, Word, Sentence]
    TEXT --> SPELL[Dictionary Suggestions]

Model Design

Base convolutional network: 842,107 trainable parameters across 27 classes, including blank.
Specialist classifiers resolve visually similar groups: D/R/U, D/I/K/T, and M/N/S.
A temporal counter accepts a character only after repeated stable predictions.
pyenchant provides word-completion suggestions in the desktop interface.

Tech Stack

Python, TensorFlow/Keras
OpenCV image preprocessing
Tkinter desktop interface
Pillow and PyEnchant
Jupyter notebooks for data preparation and training

Repository Structure

Main.ipynb          Training and model analysis
create_data.ipynb   Dataset preparation
app2.py             Real-time recognition interface
camera.py           Webcam capture utility
model/              Serialized model architectures and weights
tests/              Import and repository smoke checks

Setup

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
python app2.py

The application requires a webcam and an English dictionary available to PyEnchant.

Evaluation

The repository includes the trained models and training notebook, but it does not contain a final reproducible test-set accuracy report. The honest next step is to add a fixed test split, per-class precision/recall, confusion matrix, and latency benchmark before comparing this system with modern approaches.

Privacy and Responsible Use

Webcam frames are processed locally by the desktop application.
No biometric identity inference is performed.
This prototype recognizes a constrained static alphabet; it is not a complete sign-language translation system.
Real accessibility use requires signer-led evaluation across lighting, skin tones, hand shapes, motion, and regional language variation.

Future Improvements

Replace hand-crafted thresholding with robust hand landmark detection
Add dynamic gestures and sequence modeling
Quantize the model for lower-latency edge inference
Publish a reproducible evaluation dataset and model card
Test with ASL users and accessibility specialists

Attribution

Dataset provenance is recorded in Dataset link. Review the source license before redistributing training data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

American Sign Language to Text Converter

Problem Statement

Architecture

Model Design

Tech Stack

Repository Structure

Setup

Evaluation

Privacy and Responsible Use

Future Improvements

Attribution

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github/workflows		.github/workflows
model		model
tests		tests
.gitignore		.gitignore
Dataset link		Dataset link
Main.ipynb		Main.ipynb
README.md		README.md
app2.py		app2.py
camera.py		camera.py
create_data.ipynb		create_data.ipynb
pyproject.toml		pyproject.toml
requirements-ci.txt		requirements-ci.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

American Sign Language to Text Converter

Problem Statement

Architecture

Model Design

Tech Stack

Repository Structure

Setup

Evaluation

Privacy and Responsible Use

Future Improvements

Attribution

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages