🎨 Doodle Matcher

A mobile app that matches your animal sketches with real photos using AI-powered vector search. Draw an animal, get matched with actual animal photos ranked by similarity!

🎬 Demo

📱 Try It Live!

📦 Android Install: Download & Install

Open the link on your Android device to install the app directly

✨ How It Works

User Drawing → CLIP Embeddings → Vector Search → Top 3 Matches
     📱              🧠              🔍            📊

Draw: Sketch an animal on your phone
Process: CLIP model converts your doodle into a 512-dimensional vector
Search: Qdrant finds the most similar animal photos in vector space
Results: Get top 3 matches with confidence scores (0-100%)

🏗️ Architecture

┌─────────────────┐    ┌─────────────────┐    ┌─────────────────┐
│                 │    │                 │    │                 │
│   Expo/RN App   │────│   FastAPI       │────│   Qdrant        │
│                 │    │                 │    │                 │
│ • Canvas Drawing│    │ • CLIP Model    │    │ • Vector Search │
│ • Results UI    │    │ • Embeddings    │    │ • Similarity    │
│ • History       │    │ • Image Proc    │    │ • Top-K Results │
│                 │    │                 │    │                 │
└─────────────────┘    └─────────────────┘    └─────────────────┘
      Mobile                Backend              Vector Database

🚀 Tech Stack

Frontend (Mobile)

Expo/React Native - Cross-platform mobile development
TypeScript - Type safety and better DX
React Native-Skia - High-performance 2D graphics and smooth vector drawing canvas
Expo Router - File-based navigation
NativeWind - Tailwind CSS for React Native

Backend (API)

FastAPI - High-performance async Python API
ONNX Runtime - Optimized CLIP model inference
Qdrant - Vector database for similarity search
Pillow - Image processing and optimization

Infrastructure

Railway - Deployment platform
Unsplash API - Animal photo dataset source

📱 App Flow

The app has three main states on a single screen:

1. Drawing State

Vector-based canvas for smooth drawing
Brush controls and canvas management
"Search" button to trigger matching

2. Loading State

Embedding generation in progress
Cancel option available

3. Results State

Top 3 matches with confidence scores
Visual confidence indicators (color-coded)
Hero layout - best match prominent, others below
"Draw Again" to restart

🔍 The Magic: Vector Similarity

Why CLIP? CLIP (Contrastive Language-Image Pre-training) creates embeddings that work well for both sketches and real photos in the same vector space.

Why Cosine Similarity? Measures the angle between vectors rather than distance, perfect for high-dimensional embeddings where magnitude matters less than direction.

Why Top-K Search? Qdrant's vector search returns ranked results, showcasing the power of similarity scoring in real-time.

⚠️ Technical Challenges & Solutions

Continuous Drawing Performance
Implementing smooth finger drawing with Skia required careful PanResponder handling and optimized stroke rendering. Managing multiple stroke states while maintaining stable performance and correct render view is a challenge.
Model Size Optimization
Initial PyTorch CLIP implementation resulted in ~27GB model weights, making local development impractical. Switched to ONNX Runtime for 10x size reduction and faster inference.
Hugging Face Transformers Limitations
Not all CLIP models are exposed through the Hugging Face Inference API. Some models required direct download and local execution to enable reliable embedding generation.
Unsplash Integration Complexity
API rate limits, inconsistent payload structures, and URL format variations required multiple iterations to achieve reliable dataset population and development workflow.
Decision to Run Local CLIP Model
To maintain full control over embeddings, improve latency, and avoid cloud inference limits, a local CLIP model download was required. This added extra setup complexity but solved reliability and cost issues.

Railway + Qdrant Connection Issues
The qdrant-client library has compatibility issues with Railway's URL format, causing 6+ second connection delays and timeouts. Solution: Use explicit host/port configuration instead of URL format:

# ❌ This causes timeouts on Railway
client = QdrantClient(url="https://your-qdrant.railway.app")

# ✅ This works properly
client = QdrantClient(
    host="your-qdrant.railway.app",
    port=443,
    https=True,
    timeout=60
)

⚙️ Environment Variables

Create a .env file in the backend directory with the following variables:

# Required for populating database with animal photos
UNSPLASH_API_KEY=your_unsplash_access_key
# Qdrant database URL (production)
QDRANT_URL=https://your-qdrant-instance.up.railway.app

Note: Free-tier Unsplash has API rate limits—avoid making bulk requests too quickly when populating the database.

🛠️ Setup & Development

Prerequisites

Node.js 18+ and npm/yarn
Python 3.10+ with Poetry
Expo CLI (npm install -g @expo/cli)
Docker and Docker Compose

Local Development

Clone and install dependencies

git clone https://github.com/yourusername/DoodleMatcher.git
cd DoodleMatcher

# Backend
cd backend
poetry install

# Frontend
cd ../mobile
npm install

Download CLIP ONNX Model

cd backend/models/clip
wget https://huggingface.co/Qdrant/clip-ViT-B-32-vision/resolve/main/model.onnx -O clip-ViT-B-32-vision.onnx

Set up environment variables

cd backend
cp .env.example .env
# Edit .env with your Unsplash API key

Start local services

# Start all services (Qdrant + FastAPI backend) with Docker Compose
docker-compose up

# Start Expo app (in new terminal)
cd mobile
npx expo start

Populate database (one-time setup)

cd backend
poetry run python -m scripts.populate_qdrant

Key Dependencies

Backend Python packages (installed via Poetry):

onnxruntime - ONNX model inference
numpy - Numerical computations
Pillow - Image processing
fastapi - Web API framework
pydantic - Data validation
qdrant-client - Vector database client

GPU Acceleration (optional): For faster inference, install ONNX Runtime with CUDA support separately:

pip install onnxruntime-gpu
# Then modify providers=["CUDAExecutionProvider"] in embeddings.py

🚀 Deployment

Railway Deployment

Deploy Qdrant service
- Create new Railway project
- Deploy Qdrant from template
- Note the Railway domain URL
Deploy FastAPI backend
- Connect GitHub repository
- Set environment variables:
```
UNSPLASH_API_KEY=your-unsplash-key
```
- Deploy from /backend directory

⚠️ Configure Qdrant client for Railway

In services/qdrant_service.py, use the host/port format for Railway compatibility:

# For Railway deployment
client = QdrantClient(
    host="your-qdrant-railway-domain.up.railway.app",
    port=443,
    https=True,
    timeout=60
)

Upload CLIP model to production

Ensure the ONNX model is available in your deployment environment.
Populate production database

To populate Railway Qdrant from your local machine, temporarily update the client configuration in qdrant_service.py, then:
```
cd backend
poetry run python -m scripts.populate_qdrant
```
Build mobile app
```
cd mobile
eas build --platform all
```

🔧 API Endpoints

`POST /search-doodle`

Matches a drawn sketch with animal photos.

Request:

{
  "image_data": "base64_png_string"
}

Response:

{
  "matches": [
    {
      "photo_url": "https://images.unsplash.com/...",
      "confidence": 94.2,
      "animal_type": "cat",
      "photographer": "John Doe"
    }
  ],
  "search_time_ms": 156
}

`GET /health`

System health and model status. Use this endpoint to verify backend and CLIP model readiness.

🎯 Key Features

Real-time vector search with sub-200ms response times
Confidence scoring converted to user-friendly percentages
Drawing history with local storage
Responsive canvas optimized for finger drawing
Graceful error handling with retry mechanisms

📈 Performance

Embedding generation: ~100-200ms per sketch
Vector search: ~50-100ms for top-3 results
End-to-end latency: <500ms typical
Database size: 500+ animal photos, ~256MB embeddings

🛠️ Tech Highlights

React Native + Expo for cross-platform mobile development
ONNX Runtime integration for optimized AI inference
Vector database operations with real-time similarity search
Cross-modal AI bridging sketches and photographs

Built with 🐾 because every doodle deserves its perfect match

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
backend		backend
mobile		mobile
.gitignore		.gitignore
README.md		README.md
doodle-cat.gif		doodle-cat.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎨 Doodle Matcher

🎬 Demo

📱 Try It Live!

✨ How It Works

🏗️ Architecture

🚀 Tech Stack

Frontend (Mobile)

Backend (API)

Infrastructure

📱 App Flow

1. Drawing State

2. Loading State

3. Results State

🔍 The Magic: Vector Similarity

⚠️ Technical Challenges & Solutions

⚙️ Environment Variables

🛠️ Setup & Development

Prerequisites

Local Development

Key Dependencies

🚀 Deployment

Railway Deployment

🔧 API Endpoints

`POST /search-doodle`

`GET /health`

🎯 Key Features

📈 Performance

🛠️ Tech Highlights

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎨 Doodle Matcher

🎬 Demo

📱 Try It Live!

✨ How It Works

🏗️ Architecture

🚀 Tech Stack

Frontend (Mobile)

Backend (API)

Infrastructure

📱 App Flow

1. Drawing State

2. Loading State

3. Results State

🔍 The Magic: Vector Similarity

⚠️ Technical Challenges & Solutions

⚙️ Environment Variables

🛠️ Setup & Development

Prerequisites

Local Development

Key Dependencies

🚀 Deployment

Railway Deployment

🔧 API Endpoints

POST /search-doodle

GET /health

🎯 Key Features

📈 Performance

🛠️ Tech Highlights

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`POST /search-doodle`

`GET /health`

Packages