EdgeNet: Image Filters with CNNs

This repository contains my submission for the "What The Hack" ML hackathon organized by the Technical Council at IIT Gandhinagar on 1 September, 2024.

Name: Shardul Junagade
Batch: BTech CSE '23

Link to Kaggle Notebooks:

Problem Statement:

Develop a neural network that learns to apply the Sobel filter to images.

Tasks:

Dataset Preparation:
- Use the provided dataset of images.
- Apply the Sobel filter to the dataset using a standard image processing library.
- Save the original and Sobel-filtered image pairs.
Model Development: Design a neural network that takes an original image as input and produces a Sobel-filtered image as output.
Training: Train your model using the prepared dataset.
Evaluation: Evaluate your model's performance on a provided test set.

Bonus Tasks: Extend your project by applying the same approach to other classical image filters such as the Laplacian Filter and Prewitt Filter. Additionally, visualize and compare representations of different CNN layers as images. Explore various CNN architectures to gain deeper insights into their effects and performance. This will demonstrate your ability to generalize and analyze the model's behavior across different scenarios.

Installation and Usage

Clone: Clone the repository to your local machine.

git clone "https://github.com/ShardulJunagade/WhatTheHack-ML-Hackathon.git"
cd "WhatTheHack-ML-Hackathon"

Install Dependencies: Install the required libraries specified in the requirements.txt file.
```
pip install -r requirements.txt
```
Download the natural images dataset from kaggle.
Run the preprocessing.ipynb to create the train and test datasets.
Open the Notebook: Start Jupyter Notebook or another compatible environment and open the Sobel Filter (LR).ipynb file for running the code for Sobel filter.
The code for Laplacian and Prewitt filters is present in Sobel, Laplacian, Prewitt Filter NN.ipynb.

Project Overview

This project involves developing a Convolutional Neural Network (CNN) that learns to apply the Sobel filter to images. The Sobel filter is a classical image processing filter used for edge detection. The core aim of this project is to demonstrate how CNNs can learn to approximate such traditional filters. The network is based on the U-Net architecture and is trained to predict the output of a Sobel filter applied to images. The model is implemented in TensorFlow and Keras, and this project includes options for using a learning rate scheduler, model evaluation metrics, and sample visualizations.

The project also explores other classical filters like Laplacian and Prewitt filters in the bonus section.

The complete code can be found in Sobel Filter (LR).ipynb file. All the preprocessing and splitting data into train and test sets has been done in the preprocessing.ipynb file.

The project is divided into several key tasks:

Dataset Preparation
Model Development
Training with a learning rate scheduler
Training without a learning rate schedueler
Evaluating the model

The graphical comparisons between the 2 cases can be found in the lr_scheduler_comparison.ipynb file.

Dataset Preparation

For this task, the dataset consists of grayscale images on which the Sobel filter is applied to create a ground truth for edge detection. The goal is for the model to learn how to apply the Sobel filter purely from training data.

The raw images are loaded and converted to grayscale (if necessary).

The Sobel filter is applied using a standard image processing library, OpenCV to create the target outputs.

def apply_sobel_filter(image):
    sobel_x = cv2.Sobel(image, cv2.CV_64F, 1, 0, ksize=3)
    sobel_y = cv2.Sobel(image, cv2.CV_64F, 0, 1, ksize=3)
    sobel = np.sqrt(sobel_x**2 + sobel_y**2)
    return np.uint8(sobel)

The dataset is split into training, validation, and test sets for evaluation purposes.
These datasets are saved in the Saved_Datasets folder in .npy format.
- X_train and X_test: Original images
- y_train and y_test: Ground truth Sobel-filtered images

Model Architecture

The model is built using the U-Net architecture, which is commonly used for image segmentation tasks. It includes:

Encoder: Consists of convolutional layers with increasing filter sizes, followed by max-pooling layers.
Bottleneck: A set of convolutional layers that forms the latent representation.
Decoder: Consists of transpose convolutional layers that upsample the feature maps, followed by concatenation with skip connections from the encoder.

The final output is a single-channel image representing the predicted Sobel-filtered image.

inputs = layers.Input(shape=(256, 256, 1))
c1 = layers.Conv2D(64, (3, 3), activation='relu', padding='same')(inputs)
...
outputs = layers.Conv2D(1, (1, 1), activation='sigmoid', padding='same')(c5)

Training the model

The model is compiled using the Adam optimizer and trained using Mean Squared Error (MSE) loss. The optional learning rate scheduler reduces the learning rate after 5 epochs to improve convergence.

Learning Rate Scheduler

An optional learning rate scheduler is implemented to reduce the learning rate after 5 epochs exponentially.

def lr_scheduler(epoch, lr):
    if epoch < 5:
        return lr
    else:
        return lr * tf.math.exp(-0.1)

Evaluation

The model is evaluated using several metrics:

Mean Squared Error (MSE): Measures the average squared difference between predicted and true values.
Mean Absolute Error (MAE): Measures the average absolute difference between predicted and true values.
Structural Similarity Index (SSIM): Measures similarity between images
Custom Accuracy: A pixel-wise accuracy metric that counts how many pixels in the predicted image are within a specified threshold from the true values.

def custom_accuracy(y_true, y_pred, threshold=0.1):
    y_true = tf.cast(y_true, tf.float32)
    y_pred = tf.cast(y_pred, tf.float32)
    diff = tf.abs(y_true - y_pred)
    correct_pixels = tf.less_equal(diff, threshold)
    accuracy = tf.reduce_mean(tf.cast(correct_pixels, tf.float32))
    return accuracy

Results

After training for 10 epochs, the model achieved the following performance:

Model	MSE (Test)	MAE (Test)	Custom Accuracy (%)	SSIM (Test)
With Learning Rate Scheduler	0.0065	0.0364	92.9	0.91
Without Learning Rate Scheduler	0.0064	0.0338	93.47	0.916

Conclusion

This project demonstrates how a CNN can learn to approximate the Sobel filter, a classical edge detection algorithm. The results show that the network can effectively mimic the Sobel filter's functionality, providing insight into how CNNs can generalize traditional image processing techniques.

Bonus Tasks

The bonus tasks have been implemented in the Sobel, Laplacian, Prewitt Filter NN.ipynb file. Refer to Bonus.md for more details regarding the bonus tasks.

Acknowledgments

This project was created as part of a hackathon submission. Thanks to the Technical Council of IITGN (hackathon organizers) for providing me with the opportunity to work on this project.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EdgeNet: Image Filters with CNNs

Link to Kaggle Notebooks:

Problem Statement:

Tasks:

Installation and Usage

Project Overview

Dataset Preparation

Model Architecture

Training the model

Learning Rate Scheduler

Evaluation

Results

Conclusion

Bonus Tasks

Acknowledgments

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
Bonus.md		Bonus.md
LICENSE		LICENSE
README.md		README.md
Sobel Filter (LR).ipynb		Sobel Filter (LR).ipynb
Sobel, Laplacian, Prewitt Filter NN.ipynb		Sobel, Laplacian, Prewitt Filter NN.ipynb
Sobel.md		Sobel.md
What The Hack - Guidelines and Problem Statements.pdf		What The Hack - Guidelines and Problem Statements.pdf
lr_scheduler_comparison.ipynb		lr_scheduler_comparison.ipynb
preprocessing.ipynb		preprocessing.ipynb
requirements.txt		requirements.txt

License

ShardulJunagade/EdgeNet-Image-Filters-with-CNNs

Folders and files

Latest commit

History

Repository files navigation

EdgeNet: Image Filters with CNNs

Link to Kaggle Notebooks:

Problem Statement:

Tasks:

Installation and Usage

Project Overview

Dataset Preparation

Model Architecture

Training the model

Learning Rate Scheduler

Evaluation

Results

Conclusion

Bonus Tasks

Acknowledgments

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages