Captcha Recognizer

A powerful and efficient CAPTCHA recognition system built with TensorFlow that can recognize text-based CAPTCHAs without requiring image segmentation. This project provides a complete pipeline from CAPTCHA generation to neural network training and recognition.

🚀 Features

No Image Segmentation Required: Direct end-to-end recognition using deep learning
High Accuracy: Achieves excellent recognition rates with proper training data
Multi-GPU Support: Train on multiple GPUs for faster training
Flexible Input: Supports both JPG and PNG image formats
Easy Training: Simple pipeline from data preparation to model deployment
Production Ready: Includes evaluation and recognition scripts for real-world use

📋 Requirements

System Requirements

OS: Ubuntu 18.04+ (tested on Ubuntu 18.04)
Python: 3.10+
Memory: Minimum 8GB RAM (16GB+ recommended for large datasets)

Dependencies

Python 3.10
TensorFlow 2.10.0
NumPy 1.23.4
captcha package

🛠️ Installation

Option 1: Using pip

pip install -r requirements.txt

Option 2: Manual Installation

# Install TensorFlow
pip install tensorflow==2.10.0

# Install NumPy
pip install numpy==1.23.4

# Install captcha package
pip install captcha==0.1.1

📁 Project Structure

captcha-recognizer/
├── data/
│   ├── train_data/     # Training images
│   ├── valid_data/     # Validation images
│   ├── test_data/      # Test images
│   ├── train.tfrecord  # Training dataset (generated)
│   └── valid.tfrecord  # Validation dataset (generated)
├── captcha_gen_default.py      # CAPTCHA generation
├── captcha_records.py          # Dataset conversion
├── captcha_train.py            # Single GPU training
├── captcha_multi_gpu_train.py # Multi-GPU training
├── captcha_eval.py             # Model evaluation
├── captcha_recognize.py        # CAPTCHA recognition
├── model.py                    # Neural network architecture
├── trainer.py                  # Training logic
├── predictor.py                # Prediction interface
└── requirements.txt            # Python dependencies

🚀 Quick Start

1. Prepare Training Data

Place your CAPTCHA images in the appropriate directories:

Training: data/train_data/ - for model training
Validation: data/valid_data/ - for model evaluation
Testing: data/test_data/ - for recognition testing

Image Requirements:

Format: JPG or PNG
Naming: label_*.jpg or label_*.png (e.g., ABC123_label_001.jpg)
Size: Recommended 128x48 pixels
Content: Text-based CAPTCHAs

Or use the built-in generator:

python captcha_gen_default.py

2. Convert Dataset to TFRecords

Convert your images to TensorFlow's efficient TFRecord format:

python captcha_records.py

This creates:

data/train.tfrecord - Training dataset
data/valid.tfrecord - Validation dataset

3. Train the Model

Single GPU Training:

python captcha_train.py

Multi-GPU Training (faster):

python captcha_multi_gpu_train.py

Training Tips:

Accuracy improves with larger training datasets
More training steps generally yield better results
Monitor validation accuracy to prevent overfitting

4. Evaluate Model Performance

Test your model's accuracy on the validation set:

python captcha_eval.py

5. Recognize CAPTCHAs

Use your trained model to recognize new CAPTCHAs:

python captcha_recognize.py

Example Output:

image WFPMX_num552.png recognize ----> 'WFPMX'
image QUDKM_num468.png recognize ----> 'QUDKM'

🔧 Configuration

Model Parameters

The neural network architecture and training parameters can be customized in config.py:

Input dimensions: Image width and height
Character set: Supported characters for recognition
Network architecture: Layer sizes and activation functions
Training parameters: Learning rate, batch size, epochs

Training Configuration

Adjust training parameters in the training scripts:

Batch size: Adjust based on available GPU memory
Learning rate: Start with default and tune as needed
Epochs: More epochs for better accuracy (with proper validation)

📊 Performance

Accuracy Factors

Training Data Size: Larger datasets improve accuracy
Data Quality: Clean, diverse CAPTCHAs work better
Training Steps: More iterations generally help
Model Architecture: Optimized for CAPTCHA recognition

Optimization Tips

Use data augmentation for better generalization
Implement early stopping to prevent overfitting
Experiment with different learning rates
Consider ensemble methods for production use

🤝 Contributing

We welcome contributions! Here's how you can help:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Development Setup

# Clone the repository
git clone https://github.com/yourusername/captcha-recognizer.git
cd captcha-recognizer

# Install development dependencies
pip install -r requirements.txt

# Run tests
python test_basic.py

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

CAPTCHA Generator: Gregwar/CaptchaBundle for sample CAPTCHA generation
TensorFlow: For the deep learning framework
Community: All contributors and users of this project

📞 Support

Issues: GitHub Issues
Discussions: GitHub Discussions
Wiki: Project Wiki

Made with ❤️ for the open-source community

If you find this project useful, please consider giving it a ⭐ star!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Captcha Recognizer

🚀 Features

📋 Requirements

System Requirements

Dependencies

🛠️ Installation

Option 1: Using pip

Option 2: Manual Installation

📁 Project Structure

🚀 Quick Start

1. Prepare Training Data

2. Convert Dataset to TFRecords

3. Train the Model

4. Evaluate Model Performance

5. Recognize CAPTCHAs

🔧 Configuration

Model Parameters

Training Configuration

📊 Performance

Accuracy Factors

Optimization Tips

🤝 Contributing

Development Setup

📝 License

🙏 Acknowledgments

📞 Support

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.github/workflows		.github/workflows
data		data
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
captcha_eval.py		captcha_eval.py
captcha_gen_default.py		captcha_gen_default.py
captcha_input.py		captcha_input.py
captcha_model.py		captcha_model.py
captcha_multi_gpu_train.py		captcha_multi_gpu_train.py
captcha_recognize.py		captcha_recognize.py
captcha_records.py		captcha_records.py
captcha_train.py		captcha_train.py
config.py		config.py
data_loader.py		data_loader.py
example_usage.py		example_usage.py
model.py		model.py
predictor.py		predictor.py
requirements.txt		requirements.txt
setup.py		setup.py
test_basic.py		test_basic.py
trainer.py		trainer.py

License

SecurityEnthusiast/captcha-recognizer

Folders and files

Latest commit

History

Repository files navigation

Captcha Recognizer

🚀 Features

📋 Requirements

System Requirements

Dependencies

🛠️ Installation

Option 1: Using pip

Option 2: Manual Installation

📁 Project Structure

🚀 Quick Start

1. Prepare Training Data

2. Convert Dataset to TFRecords

3. Train the Model

4. Evaluate Model Performance

5. Recognize CAPTCHAs

🔧 Configuration

Model Parameters

Training Configuration

📊 Performance

Accuracy Factors

Optimization Tips

🤝 Contributing

Development Setup

📝 License

🙏 Acknowledgments

📞 Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages