supertrader

Personal quantitative research platform. First strategy: Reddit-sentiment mean-reversion on US equities. The framework is strategy-agnostic — new strategies are config files plus a small Strategy subclass.

This is exploratory research code. DO NOT use it to trade real capital. See LICENSE and docs/known-limitations.md for the full caveats.

Status

Two research cycles complete + a platform-honesty pass. Framework is the deliverable; strategies tested so far are four documented null results. For a single-document read of the project, see docs/retrospective.md.

Phase	Status
Data layer (Polars + Parquet store, point-in-time view)	done
vectorbt engine + costs + metrics + cross-sectional ranking strategy	done
HoldoutGuard + one-shot holdout discipline	done
HTML tear sheet + `RunManifest` reproducibility ledger + SPY benchmark	done
Cycle 1: rsm_v1 Reddit-sentiment mean-reversion	done — negative verdict
Cycle 2: v2 tech momentum / reversal / volume surge on SP500	done — three negative verdicts
Platform-honesty pass turnover cap + smoothing + cost-model v2 + universe-guard	done
Paid data (Polygon / EODHD)	deferred per ADR 0008
Paper trading via Alpaca, Form 4 integration	future, pending a working strategy

Results to date. Four documented null results across two cycles. The platform's discipline machinery correctly identified each as not-tradeable and left all holdouts untouched:

RSM v1 — test Sharpe +0.94 but train Sharpe -0.47 (anti- generalization), 2× cost Sharpe 0.68 (below 0.8 threshold), IR vs SPY only +0.34. Most of the test-window gain was concentrated in a single quarter.
v2 cross-sectional momentum — train Sharpe -0.06, test Sharpe -0.89. Window includes too many momentum-crash regimes (Mar 2020 / Jan 2021 / late 2022).
v2 z-score reversal — train Sharpe -2.05, test Sharpe -3.01. Short-term reversal is arbitraged out on SP500 large caps.
v2 volume surge — train Sharpe -0.67, test Sharpe +0.89, but IR vs SPY -0.38. Same anti-generalization shape as rsm_v1.

Per ADR 0005's bonferroni accounting, the running test-peek count (N=7) now implies a per-strategy Sharpe threshold of ~1.6 to clear multi-comparison noise. No strategy has cleared it.

Full reasoning:

Retrospective — single-document read spanning both cycles + the platform-honesty pass.
v1 verdict — initial rsm_v1 read.
v1 postmortem — considered analysis after diagnostics.
v2 comparative verdict — three technical signals, side-by-side.
v2 postmortem — what the v2 results imply for the next cycle.
Known limitations — eight ranked caveats that bound any result here.

400+ tests, ~90% line coverage, mypy --strict clean, import-linter enforced layer boundaries.

Architecture

Four strict layers, one-way imports:

data ──▶ signals ──▶ strategies ──▶ execution
                       (pipelines compose them)

data — DataSources ingest external data into a canonical Parquet store (ParquetStore + sqlite metadata). Never read by strategies directly. Sources are write-only.
signals — Pure functions of stored data. Produce (date × ticker) panels.
strategies — Consume named signals, emit target weights.
execution / backtest — Translate target weights to orders (backtest via vectorbt; paper trading via Alpaca planned).

import-linter enforces the layering. ADRs in docs/adr/ record the non-obvious decisions: vectorbt as the engine (0001), Arctic Shift for Reddit history (0002), redline as a clean Parquet-export boundary (0003), static universe and survivorship caveat (0004), pluggable sentiment scorer (0006).

Install

uv sync
uv run pre-commit install

Quickstart

# Q1 2024 framework-validation smoke (3-month window — NOT a strategy verdict).
# Requires a clean git tree; pass --allow-dirty to override.
uv run supertrader backtest --config configs/runs/rsm_v1_q1_2024.yaml

# Output lands in data/runs/rsm-v1-q1-2024-smoke/:
#   metrics.json     — sharpe / sortino / drawdown per train/test/holdout
#   manifest.json    — git SHA, config hash, data hashes (reproducibility)
#   tear_sheet.html  — open in a browser

The canonical 18mo train / 6mo test / 3mo holdout config is configs/runs/rsm_v1_backtest.yaml. It needs the full 2022-2024 WSB backfill to be on disk; that backfill is running at the time of this README.

Development

uv run ruff check . && uv run ruff format --check .
uv run mypy --strict src/
uv run lint-imports
uv run pytest                # ~1.5 min

CLAUDE.md is loaded automatically by Claude Code sessions in this repo — it codifies the conventions (layered architecture, strict typing, holdout discipline, no DB mocking, etc.).

docs/dev-notes.md is the running engineering journal.

Reproducibility

Every backtest run writes two sources of truth:

data/meta.sqlite — run_manifests table with run id, config hash, git SHA, dirty flag, python + supertrader versions, started/ended timestamps, status, and a content-hash dict of every input parquet partition.
data/runs/<run_id>/manifest.json — same data, mirrored as JSON for git-able provenance next to the tear sheet.

run_backtest() refuses to start on a dirty git tree unless --allow-dirty is passed (the dirty flag is recorded on the manifest either way).

The holdout window is one-shot per config_hash. A second touch raises HoldoutTouchedError. The only way to re-evaluate is scripts/reset_holdout_lock.py, which appends an audit record to data/runs/holdout_overrides.log (append-only, pre-commit-protected).

Honesty about results

Treat any reported Sharpe ratio in this repository as exploratory until the canonical 18mo/6mo/3mo run is complete and reviewed in light of docs/known-limitations.md. The shortlist of caveats:

The universe is selection-biased toward stocks WSB talks about.
Survivorship bias on this 34-ticker subset is plausibly 3-8%/yr, not 1-3%.
The cost model likely understates round-trip costs by 50-200%.
The 3-month holdout cannot validate a moderate signal — only reject extreme overfitting.

A positive backtest is the start of a research question, not its end.

Project layout

src/supertrader/      # importable package (src layout)
  config/             # pydantic config models + YAML loader
  data/               # ParquetStore, sources, universe, point-in-time view
  signals/            # signal base + reddit_sentiment + technical
  strategies/         # strategy base + mean_reversion + risk
  backtest/           # vectorbt engine, costs, metrics, splits, report
  execution/          # backtest adapter (paper/live planned)
  pipelines/          # run_backtest, the only module that crosses layers
  observability/      # RunManifest + structured logging
  cli.py              # typer entry point
configs/              # version-controlled YAML configs
  runs/               # top-level run configs (compose data + signal + strategy)
  universe/           # ticker snapshots
data/                 # gitignored — on-disk parquet store, sqlite metadata, runs
docs/
  adr/                # architecture decision records
  known-limitations.md
  dev-notes.md
scripts/              # standalone ops scripts (backfill, validation, reset)
tests/
  unit/               # fast, isolated
  integration/        # touches disk + small fixtures
  e2e/                # full pipeline on tiny synthetic data
  golden/             # snapshots + reference data
research/             # throwaway notebooks (not importable)

Canonical 8-week build plan: ~/.claude/plans/i-m-building-a-personal-sorted-kettle.md.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
configs		configs
docs		docs
research		research
scripts		scripts
src/supertrader		src/supertrader
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

supertrader

Status

Architecture

Install

Quickstart

Development

Reproducibility

Honesty about results

Project layout

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

supertrader

Status

Architecture

Install

Quickstart

Development

Reproducibility

Honesty about results

Project layout

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages