RECAP — Rapid Event-level Classification of Affected Properties

Group Information

Aryan Anand Email: [email protected]

Sri Satishkumar Email: [email protected]

Project Summary / Abstract

Natural disasters cause widespread damage, making rapid assessment essential for response and recovery. Traditional manual inspection of satellite imagery is slow and resource‑intensive.

This project introduces RECAP, a two‑stage deep‑learning pipeline for Rapid Event-level Classification of Affected Properties using pre‑ and post‑disaster satellite images from the xBD dataset.

Our pipeline consists of:

Building Segmentation ModelA FCN‑ResNet50 network extracts building footprints from disaster imagery.
Polygon‑aware Siamese Damage ClassifierA ResNet‑34 Siamese encoder learns changes between pre/post images and assigns each building one of four xBD damage categories:
- No Damage
- Minor Damage
- Major Damage
- Destroyed

We benchmark model performance and provide research‑ready evaluation plots (confusion matrix, ROC/PR curves, calibration, etc.).This study contributes a reproducible, end‑to‑end framework capable of scaling to real‑world disaster‑response workflows.

Problem Description

Problem Summary

Disaster‑response teams need fast, reliable methods to analyze satellite images and estimate building damage. The challenge lies in:

Variability across disaster types (floods, hurricanes, wildfires)
Inconsistent lighting, shadows, and viewing angles
Dense urban environments requiring precise building extraction
Highly imbalanced damage categories

Motivation

Reduce human workload in large‑scale disaster sites
Provide consistent, automated, and explainable predictions
Explore deep-learning strategies for polygon‑level classification
Deliver tools that can visualize results on an interactive map

Challenges

Noisy or incomplete building outlines
Domain shift across geographic regions
Large dataset size (~40k high‑res tiles)
Aggregating pixel‑level features into building polygons
Balancing accuracy across rare classes (e.g., major damage)

Contributions

[Novel System] + [Extension of Prior Work]

This project builds on concepts from prior segmentation and change‑detection research but introduces a unique, fully reproducible workflow:

Our Key Contributions

✔ A two‑stage vision system combining semantic segmentation + polygon‑level damage classification
✔ A ResNet‑34 Siamese change‑encoder that learns building‑level differences
✔ Feature pooling inside polygons, not bounding boxes — increasing accuracy
✔ A full evaluation suite generating:
- Confusion matrix
- ROC curves
- PR curves
- Calibration curves
- Per-class F1
- Class distribution plots
✔ A partial HuggingFace dataset downloader (public xBD mirror) for easy setup
✔ A Streamlit map interface for visualizing building damage on an interactive world map

References

Gupta et al. xBD: A Dataset for Assessing Building Damage from Satellite Imagery (2019).
He et al. Deep Residual Learning for Image Recognition. CVPR (2016).
Long et al. Fully Convolutional Networks for Semantic Segmentation. CVPR (2015).
Spall, J.C. Simultaneous Perturbation Stochastic Approximation (1998).
More coming from your academic paper.

Dependencies

Core Dependencies

Python 3.11+
PyTorch
Torchvision
Numpy
Matplotlib
Pandas
Scikit‑learn
Pillow
tqdm
shapely
huggingface_hub
hf_transfer (optional, for faster downloads)
Streamlit (for frontend)
pydeck (for map visualization)

Dependencies & Installation

This project uses uv as the package manager (recommended by course requirements). All dependencies — including PyTorch, torchvision, numpy, matplotlib, sklearn, and Streamlit — are fully specified in:

pyproject.toml
uv.lock

Step 1 — Install uv (if not installed)

curl -LsSf https://astral.sh/uv/install.sh | sh

Or on Windows (PowerShell):

irm https://astral.sh/uv/install.ps1 | iex

Step 2 — Sync the environment

From the root of the repo, run:

uv sync

This will:

Create a virtual environment automatically

Install all required Python dependencies exactly as pinned in uv.lock

Ensure reproducible builds across machines

Step 3 — Activate the environment

On macOS/Linux:

source .venv/bin/activate

On Windows (PowerShell):

.venv\Scripts\Activate.ps1

Step 4 — Verify installation

Run:

uv run python -c "import torch; import sklearn; print('OK')"

You should see:

OK

Dataset Download Instructions

We use a partial downloader for the public HuggingFace dataset:

Repo: aryananand/xBD

Downloader script: /src/download_xbd_partial.py

Run the downloader:

python3 src/download_xbd_partial.py

Downloaded dataset structure:

data/xBD/
    train/
        images/
        labels/
        target/
    test/
        images/
        labels/
        target/

Each image tile includes pre‑ and post‑disaster PNGs with matching label JSONs.

Directory Structure

├── data/                                   # Dataset downloaded from HuggingFace
│   └── xBD/
│       ├── train/
│       │   ├── images/
│       │   ├── labels/
│       │   └── target/
│       └── test/
│           ├── images/
│           ├── labels/
│           └── target/
│
├── models/                                 # Saved models
│   └── polygon_siamese_best.pt
│
├── artifacts/
│   ├── plots/                              # Evaluation figures
│   └── demo_overlays/                      # Sample predicted overlay images
│
├── src/
│   ├── train_building_seg.py
│   ├── train_polygon_siamese.py
│   ├── eval_full_model.py
│   ├── model_polygon_siamese.py
│   ├── building_seg_model.py
│   ├── dataset_tiles.py
│   ├── app_streamlit_map.py
│   └── common.py
│
├── HuggingFace/
│   └── get_xbd.py                           # Partial dataset downloader
│
├── README.md
└── requirements.txt

Download Required Large Files (Google Drive)

Some essential model weight files are not stored in this repository because of GitHub’s file-size limits.Before running the project, please download the three required files from Google Drive:

👉 **Google Drive link:https://drive.google.com/drive/folders/1Q8N0FYI1Gcs70K2qR7lmEiIpMUw2Uscc?usp=sharing**

Files you must download

polygon_siamese_best.pt Final damage-classifier model weights models/

building_seg_best.pt Final building-segmentation model weights models/

fcn_resnet50_coco-1167a1af.pth Pretrained FCN‑ResNet50 weights for segmentation weights/

After downloading

Place the files in the following paths inside your repo:

project-root/
│
├── models/
│     ├── polygon_siamese_best.pt
│     └── building_seg_best.pt
│
└── weights/
      └── fcn_resnet50_coco-1167a1af.pth

Make sure the filenames match exactly, otherwise the training/evaluation scripts will not find them.

How to Run

1. Download Dataset

python3 src/download_xbd_partial.py

2. Train Building Segmentation

python3 src/train_building_seg.py

3. Train Polygon Siamese Classifier

python3 src/train_polygon_siamese.py

4. Evaluate Full Model

(Generates all research‑quality plots)

python3 src/eval_full_model.py

5. Generate Overlays for Demo

python3 -m src.demo_random_tiles

6. Launch the Streamlit App

streamlit run src/app_streamlit_map.py

Demo Video

Link: https://youtu.be/qiD3_BmWUoM

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
HuggingFace		HuggingFace
P0		P0
P1		P1
P2		P2
SafetyChecks		SafetyChecks
__pycache__		__pycache__
artifacts		artifacts
figures		figures
info		info
metrics		metrics
models		models
notebooks		notebooks
recap		recap
src		src
weights		weights
.DS_Store		.DS_Store
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test_dataset.py		test_dataset.py
uv.lock		uv.lock

License

csce585-mlsystems/RECAP

Folders and files

Latest commit

History

Repository files navigation