Skip to content

Latest commit

 

History

History
95 lines (82 loc) · 2.13 KB

File metadata and controls

95 lines (82 loc) · 2.13 KB

HaystackBench Development Progress

Phase 0: Planning & Documentation

  • PRD.md complete
  • ARCHITECTURE.md complete
  • ROADMAP.md complete
  • METHODOLOGY.md complete
  • Initial git commit

Phase 1: Project Scaffold

  • Folder structure created
  • pyproject.toml configured
  • .gitignore configured
  • Basic package imports working

Phase 2: Core Engine

  • Needle generation (synthetic)
  • Needle generation (template)
  • Needle generation (relational/ATC)
  • Haystack builder (Paul Graham source)
  • Haystack builder (Wikipedia source)
  • Depth injector
  • Evaluator (exact match)
  • Evaluator (semantic)
  • Evaluator (LLM judge)
  • Runner (async)
  • Runner (checkpoint/resume)
  • Runner (caching)

Phase 3: Providers

  • Anthropic provider
  • OpenAI provider
  • Google provider
  • Mistral provider
  • Cohere provider
  • Groq provider
  • Ollama provider (local)

Phase 4: Tasks

  • S-RT task
  • M-RT task
  • M-RS task
  • ATC task
  • Counting task
  • Key-Value task
  • Conflicting needles task

Phase 5: Storage

  • SQLite schema and migrations
  • Credential storage (keyring)
  • Response caching

Phase 6: Visualization

  • Heatmap generator
  • Line chart generator
  • Radar chart generator
  • HTML report generator

Phase 7: CLI

  • Setup commands
  • Run commands
  • Results commands
  • Serve command

Phase 8: Web UI Backend

  • FastAPI app skeleton
  • Provider/credential routes
  • Experiment routes
  • Results routes
  • WebSocket progress streaming

Phase 9: Web UI Frontend

  • Project scaffold (SvelteKit)
  • Dashboard page
  • Model setup page
  • New experiment wizard
  • Progress tracking page
  • Results dashboard with heatmap
  • Results library

Phase 10: Tests

  • Unit tests (80%+ coverage)
  • Integration tests

Phase 11: Documentation

  • README.md
  • QUICKSTART.md
  • CONTRIBUTING.md
  • API docs (auto-generated)

Phase 12: Release Prep

  • CI/CD pipeline
  • Package builds successfully
  • All docs reviewed
  • Git tags and CHANGELOG