DeepLabV3 Image Segmentation with Foreground-Background Separation and Text Addition

This project demonstrates how to use a pre-trained DeepLabV3 model from the torchvision library for semantic segmentation, separating the foreground and background of an image. Additionally, it allows adding custom text to the background, making it useful for various applications such as image editing or creative content generation.

Features:

Semantic Segmentation: Using a pre-trained DeepLabV3 model with ResNet-101 backbone to segment the image into various categories (e.g., person, animal, background, etc.).
Foreground-Background Separation: The foreground and background of the image are separated based on the segmentation map.
Text Overlay: Customizable text can be added to the background of the image, allowing for further creative manipulation.
Visualization: The original image, segmentation map, foreground, and background with text are displayed using matplotlib for easy visualization.

Requirements:

Python 3.x
PyTorch and torchvision
PIL (Pillow)
matplotlib
NumPy

Installation:

Clone this repository and install the required dependencies:

git clone https://github.com/yourusername/deeplabv3-image-segmentation.git
cd deeplabv3-image-segmentation
pip install -r requirements.txt

Usage:

Make sure to replace the image_path variable with the path to your image.
Run the script to perform segmentation, separate the foreground and background, and add text to the background.
The results (original image, segmentation map, foreground, and background with text) will be displayed using matplotlib.

python segmentation_script.py

Usage for Web Application:

Run the Flask Application:
- Ensure you have all the dependencies installed by running the following command:
```
pip install -r requirements.txt
```
Start the Flask App:
- Replace the image_path and any necessary configurations in the app.py file if required.
- Start the Flask server by running:
```
python app.py
```
- This will start a local web server, and you can access the app in your browser by navigating to http://127.0.0.1:5000/.
Uploading an Image:
- On the web app interface, use the file upload option to select and upload an image.
- The app will process the image, perform segmentation using the pre-trained DeepLabV3 model, separate the foreground and background, and add text to the background.
View Results:
- After uploading, the app will display:
  - Original Image
  - Segmentation Map (color-coded)
  - Foreground Image
  - Background with Text Overlay (in a separate preview section)
Customization:
- You can modify the text that is added to the background through the web interface.
- The output will be displayed dynamically in a preview area.

File Structure:

The project follows a simple structure to keep the code organized:

deeplabv3-image-segmentation/
│
├── app.py                  # Flask application for the web interface
├── segmentation_script.py  # Standalone script for running segmentation
├── static/                 # Static folder for serving uploaded images
│   ├── uploads/            # Folder for storing uploaded images temporarily
│   └── results/            # Folder for saving processed results
├── templates/              # HTML templates for rendering the web interface
│   └── index.html          # Main HTML page for the web app
├── requirements.txt        # List of Python dependencies
├── README.md               # Project documentation
└── LICENSE                 # License file

Detailed File Breakdown:

app.py: This is the main entry point for the web application. It handles routing, processing of images, and rendering the results on the frontend.
segmentation_script.py: A standalone script that can be used to run the segmentation process outside of the web app context. Useful for testing or batch processing.
static/: A folder for serving files such as uploaded images and result images. The uploads/ folder stores the raw images that are uploaded by the user, and results/ stores the output images after processing.
templates/: Contains the HTML files used to render the web interface. The index.html file is the primary interface where users can upload images and view the results.
requirements.txt: Contains the Python dependencies required to run the project, including Flask, PyTorch, torchvision, and others.
README.md: This file, which contains the documentation for the project.

Example Output:

Original Image
Segmentation Map (using a jet color map)
Foreground Image (only the segmented foreground)
Background with Text (background with customizable text added)

License:

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments:

DeepLabV3 model from torchvision.
The idea for separating foreground and background is inspired by image segmentation applications in computer vision.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Figure_1.png		Figure_1.png
README.md		README.md
app.py		app.py
index.html		index.html
part1.py		part1.py
requirements.txt		requirements.txt
segementation_deeplabs.py		segementation_deeplabs.py
test.py		test.py
test_img.png		test_img.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DeepLabV3 Image Segmentation with Foreground-Background Separation and Text Addition

Features:

Requirements:

Installation:

Usage:

Usage for Web Application:

File Structure:

Detailed File Breakdown:

Example Output:

License:

Acknowledgments:

About

Uh oh!

Releases

Packages

Languages

GauravPandit27/DeepLabV3-Image-Segmentation-with-Foreground-Background-Separation-and-Text-Addition

Folders and files

Latest commit

History

Repository files navigation

DeepLabV3 Image Segmentation with Foreground-Background Separation and Text Addition

Features:

Requirements:

Installation:

Usage:

Usage for Web Application:

File Structure:

Detailed File Breakdown:

Example Output:

License:

Acknowledgments:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages