CLAUDE.md -- saar

914 functions, 153 classes. Async adoption: 10%. Type hint coverage: 84%.

Frontend

Stack: React + TypeScript + Vite

Package manager: bun -- always use bun install, never npm/yarn
Styling: Tailwind CSS -- no raw CSS files

Coding Conventions

Use snake_case for function names
Use PascalCase for class names
Use UPPER_SNAKE_CASE for constants
Use snake_case for file names

Preferred imports:

from __future__ import annotations
from pathlib import Path
import logging
from typing import Optional
import json
import re
from saar.models import CodebaseDNA
import numpy as np
from dataclasses import dataclass
import os

Logging

Use logging.getLogger(__name__) for all logging, never print()

Critical Files

These files have the most dependents -- understand them before editing:

saar/models.py (33 dependents)
saar/cli.py (10 dependents)
saar/extractor.py (9 dependents)
saar/rl/action_space.py (8 dependents)
saar/rl/agents/reinforce.py (7 dependents)
saar/rl/agents/ucb_bandit.py (7 dependents)
saar/formatters/agents_md.py (7 dependents)
saar/rl/policy_store.py (5 dependents)

[42 lines omitted -- run saar extract --verbose for full output]

Tribal Knowledge

Captured via saar interview -- human knowledge static analysis cannot detect.

This project: CLI tool that extracts codebase DNA and generates AI context files (AGENTS.md, CLAUDE.md, .cursorrules) -- no server, no account, no API key required

Never Do

Always run tests inside venv: source venv/bin/activate && pytest tests/ -q -- system Python missing typer causes collection errors
548 tests must pass before any commit
Never push directly to OpenCodeIntel/saar main. Never commit venv/ dist/ pycache
Never add external infrastructure dependencies (no Supabase, Redis, network calls in core path)
Never add command logic to cli.py -- it only registers app.command() calls. Logic goes in saar/commands/
Never add extraction logic to extractor.py -- DNAExtractor delegates to saar/extractors/ modules
benchmark/ contains OPE-99 results -- never delete benchmark_results.json or benchmark_report.md
saar has NO web auth -- any detected Depends(reusable_oauth2) is a false positive from test fixtures
Always run ruff check saar/ tests/ && pytest tests/ -q before committing
test rule for demo
test mistake
test rule audit
test capture audit
never import from saar.extractor directly
used npm instead of bun

Domain Vocabulary

DNA = extracted architectural patterns of a codebase (not genetic material)
Tribal knowledge = context only humans can provide: gotchas, domain terms, verification workflows -- static analysis cannot detect this
SAAR:AUTO-START/END markers = preservation markers separating auto-generated from human-written sections in generated files
budget = line cap on generated output (default 100 lines, --verbose for full)

Verification Workflow

source venv/bin/activate && pytest tests/ -q -- 548 tests must pass. Then: saar extract . --no-interview to verify CLI output is clean

Off-Limits Files

AI must never modify these:

saar/models.py -- core data contract, CodebaseDNA and all dataclasses, never modify without discussion

Additional Context

This repo dogfoods itself -- after any formatter change regenerate all context files: saar extract . --force --no-interview

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md -- saar

Frontend

Coding Conventions

Logging

Critical Files

Tribal Knowledge

Never Do

Domain Vocabulary

Verification Workflow

Off-Limits Files

Additional Context

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md -- saar

Frontend

Coding Conventions

Logging

Critical Files

Tribal Knowledge

Never Do

Domain Vocabulary

Verification Workflow

Off-Limits Files

Additional Context