Scratch is a swiss army knife for big data.
-
Updated
Jul 19, 2024 - Go
Scratch is a swiss army knife for big data.
Edit database tables like Vim buffers. Staged mutations + live SQL preview, transaction undo, schema browser + ER diagrams + DDL, FK navigation, cross-database federation, data profiling, SQL notebooks, AI SQL via Anthropic/OpenAI/Gemini/Ollama, Parquet/CSV/remote files. PostgreSQL · SQLite · MySQL · DuckDB · MotherDuck
Examples for the MotherDuck WASM Client library, enabling MotherDuck integration for WebAssembly-powered DuckDB
A Beginner's Guide to DuckDB's Python Client
Personal project for setting up an open source data warehouse.
A `dbt` project to analyze data snapshots from OpenAlex
AI-powered geopolitical news intelligence platform. Ingests 100K+ daily events from GDELT, stores in MotherDuck (DuckDB), orchestrates with Dagster, and features an AI chat interface with Text-to-SQL. Full data engineering stack at $0/month.
Production-ready SQLAlchemy dialect for DuckDB and MotherDuck with operational defaults and migration support from duckdb_engine.
Digimon TCG tournament tracker - player standings, deck meta, store directory, and tournament results
Production-grade ELT pipeline classifying VIX volatility regimes. Features incremental models, SCD-2 snapshots, recursive CTEs, and Slim CI/CD. Stack: dbt, DuckDB/MotherDuck, Python.
End-to-end GA4 e-commerce analytics pipeline using PySpark, dbt, and MotherDuck. Includes automated CI/CD via GitHub Actions and Power BI validation of gold marts.
Showcasing different ways of syncing local DuckDB data to MotherDuck (serverless DuckDB)
dbt data pipeline & evidence BI for analyzing trash wheel collection data
Add a description, image, and links to the motherduck topic page so that developers can more easily learn about it.
To associate your repository with the motherduck topic, visit your repo's landing page and select "manage topics."