From f1bffb6c365a90378d6f1837bda516ee6ee1fea6 Mon Sep 17 00:00:00 2001
From: Alan Yang <79916645+alan5543@users.noreply.github.com>
Date: Mon, 18 May 2026 15:28:31 -0400
Subject: [PATCH 1/2] fix(web): dismissable stale sync-failure banner (#189)
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

* chore(bootstrap): add votee EE overlay + private-repo CodeQL workaround

Bootstraps votee/beever-atlas-ee on top of the OSS fork
(Beever-AI/beever-atlas main @ 947b17d). The full 575-commit OSS history
is preserved as the base of this repo; this single commit layers on the
enterprise IP that lived only in votee's previous fork.

## Votee-only paths added (additive overlay)

  .claude/                                      OpenSpec slash-commands + skills
  .github/workflows/deploy.yml                  AWS EC2 production deploy
  .github/workflows/trigger-docs-rebuild.yml    docs-site dispatch
  docs/Beever_Atlas_Feature_Spec.docx           feature spec
  docs/qa/                                      QA + tool-audit notes
  docs/v1-archive/                              v1 architecture archive
  docs/v2/                                      v2 architecture docs
  openspec/                                     7 change proposals
                                                (m1, m2, RES-177, multi-workspace,
                                                 messages-tab, OSS CLA, ingestion)
  scripts/deploy/                               AWS EC2 bootstrap/provision

## CodeQL workflow patch

  .github/workflows/codeql.yml                  add `upload: never`

votee/beever-atlas-ee is a PRIVATE repo without GitHub Advanced Security,
so the OSS-default SARIF upload fails with "Code Security must be enabled
for this repository" and blocks CI. The CodeQL queries still run cleanly;
only the upload is skipped. Remove `upload: never` if/when GHAS is
purchased for this repo.

## Votee paths intentionally DROPPED (superseded by OSS)

  bot/.eslintrc.json                            -> bot/eslint.config.js (flat)
  web/.../graph/GraphTab.tsx                    -> GraphCanvas + GraphFilters
  web/.../settings/AgentModelRow.tsx            -> AgentModelsTab.tsx
  web/.../settings/AgentModelSettings.tsx       -> AgentModelsTab.tsx
  web/src/hooks/useAgentModels.ts               -> AgentModelsTab.tsx

These had been refactored upstream in OSS (LiteLLM endpoint-catalog work
+ graph component split). Keeping votee's older variants would re-introduce
diverged code paths.

## Origin / upstream relationship

  upstream  https://github.com/Beever-AI/beever-atlas  (OSS, public)
  origin    https://github.com/votee/beever-atlas-ee   (this repo, private)

To sync future OSS changes:
  git fetch upstream
  git merge upstream/main
  # resolve conflicts in overlay paths if any
  git push origin main

## AWS deployment

The existing AWS EC2 instance at 18-118-108-191.nip.io runs the OLD
votee/beever-atlas. A parallel deployment of this -ee repo will be stood
up; once validated, the old deployment is retired.

Constraint: votee/beever-atlas-ee is private and has no GHAS
Constraint: must preserve OSS commit history for upstream-merge workflow
Rejected: hand-built merge of OSS into votee/beever-atlas | unrelated histories,
  produced PR #81 (closed) — was unmaintainable for ongoing sync
Confidence: high
Scope-risk: narrow — single bootstrap commit on fresh repo
Directive: future OSS syncs use `git merge upstream/main`, NOT hand-built
  commit-tree; if conflicts touch overlay paths, prefer keeping the
  votee overlay version unless OSS has actively superseded the file

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* chore(deploy): silence auto-deploy + parameterize NAME for parallel EE deploy

Two fixes that unblock the next step (provisioning a NEW EC2 instance for
the EE deployment, side-by-side with the existing votee/beever-atlas one):

## .github/workflows/deploy.yml — disable push trigger

The deploy job fired on every push and failed at "Setup SSH" because the
EC2_SSH_KEY + EC2_HOST repo secrets aren't set yet on the fresh ee repo.
Restrict to workflow_dispatch only until those secrets are configured;
restore `push: branches: [main]` once the new EC2 is up and secrets land.

## scripts/deploy/*.sh — NAME-overridable

Hardcoded `beever-atlas` as the AWS resource prefix would collide with the
existing votee/beever-atlas deployment (same keypair + security group
names in the same AWS account). Parameterized via:

  NAME="${NAME:-beever-atlas}"        # default keeps legacy behaviour
  KEY_NAME="${NAME}-key"
  SG_NAME="${NAME}-sg"

So the EE side-by-side deploy is:

  NAME=beever-atlas-ee bash scripts/deploy/deploy.sh

The old votee deploy keeps working as before (default NAME unchanged).
Server-side path `/opt/beever-atlas-v2` left as-is — there's only one app
per EC2 instance, so no collision.

Constraint: must not break the legacy votee/beever-atlas deploy
Confidence: high
Scope-risk: narrow — env-var override with backwards-compatible default
Directive: when retiring votee/beever-atlas, also run
  `NAME=beever-atlas bash scripts/deploy/destroy.sh` to clean up the
  legacy AWS resources

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* fix(deploy): use Docker Hub mirror for Weaviate (cr.weaviate.io is offline)

The Weaviate-hosted container registry cr.weaviate.io has been unreachable
from us-east-2 (and elsewhere) since at least 2026-05-14, blocking the
initial EE deployment. The image content is identical on Docker Hub at
`semitechnologies/weaviate:1.28.0` — switching the registry prefix
unblocks the deploy.

The original SHA256 digest (`58b576d3...`) was pinned to the cr.weaviate.io
manifest. Docker Hub serves a different manifest digest for the same
content, so the pin is dropped for now. Restore the pinned cr.weaviate.io
form once that registry is back up.

Constraint: cr.weaviate.io DNS resolves but all 3 IPs (54.244.195.224,
  34.213.189.139, 52.33.86.107) return "connection refused" on :443
Rejected: wait for upstream registry | indefinite outage, blocks EE bring-up
Rejected: copy the image to a private ECR | overkill for an internal demo
Confidence: high — same image, same tag, just different registry
Scope-risk: narrow — single image, only affects the Weaviate service
Directive: when cr.weaviate.io is back, restore the original
  `cr.weaviate.io/semitechnologies/weaviate:1.28.0@sha256:...` line to
  preserve digest-pin defense-in-depth

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* ci(deploy): re-enable push trigger now that EE EC2 + secrets are live

The new EE EC2 instance (3.134.230.101 at https://3-134-230-101.nip.io)
is provisioned, the docker-compose stack is healthy, and the EC2_HOST +
EC2_SSH_KEY secrets are configured on the votee/beever-atlas-ee repo.

Restoring `on: push: branches: [main]` so subsequent pushes deploy
automatically. This commit itself exercises the pipeline end-to-end.

Confidence: high — manual deploy already verified ALL_HEALTHY
Scope-risk: narrow — single trigger restore

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* fix(deploy): restore cr.weaviate.io digest pin for Supply Chain CI

The earlier `157c389` ("use Docker Hub mirror for Weaviate") dropped the
SHA digest pin while routing around a cr.weaviate.io outage on 2026-05-14.
That triggered the CI / Supply Chain (digest pinning) job to fail every
push: it rejects any `image:` reference not pinned via `@sha256:<digest>`.

cr.weaviate.io is back online as of 2026-05-16, and a probe of the Docker
Hub `semitechnologies/weaviate:1.28.0` multi-arch manifest shows it
shares the exact same digest the cr.weaviate.io image was originally
pinned to (`sha256:58b576d3...`). So restoring the OSS-aligned line is
strictly safe — same image, same digest, just a registry that the
supply-chain check accepts.

Constraint: Supply Chain job requires every `image:` to carry an `@sha256:`
  digest
Confidence: high — verified the digest matches across both registries
Scope-risk: narrow — single line in docker-compose.yml

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* test(web): de-flake AgentModelsTab toast assertion on slow CI runners

The "clicking a preset card calls applyPreset and shows the diff toast"
test was failing on ee CI with:

  TestingLibraryElementError: Unable to find an element with the text:
  /Applied 'Gemini balanced' — 1 updated/

Root cause: useToast auto-dismisses info toasts after INFO_TTL_MS=2500ms.
On slow CI runners the test's initial render + fetch resolution can push
the first waitFor poll past the 2500ms window, so the toast has already
self-dismissed when the assertion runs.

Fix: query by role="status" (ToastViewport wraps each toast in <div
role="status">), then regex-match textContent. This is more robust:

  - Doesn't depend on textContent being a single text node
  - Re-checks each poll so it tolerates the brief render → dismiss flicker
  - Survives whitespace / em-dash formatting drift
  - 50ms interval ensures we catch the toast inside its 2500ms TTL window

No runtime / component changes. Test-only fix.

Constraint: don't bump INFO_TTL_MS or the on-screen toast lingers longer
  for real users
Confidence: high — the role + textContent pattern is the testing-library
  recommended workaround for "text broken up by multiple elements"
Scope-risk: narrow — single test assertion swap

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* fix(web): route chat-image proxy through /api/files/proxy instead of /api/media/proxy

Chat answer images (Mattermost / Slack-bot-gated files) were rendering
as broken images in the Ask tab. Clicking through landed on Mattermost's
401 page (`api.context.session_expired.app_error`).

Root cause: `mediaProxyPathFor()` returned `/api/media/proxy?url=...`,
which is the signed-loader-token endpoint. On this deployment
`LOADER_TOKEN_SECRET` is empty, so the signed-token validator falls back
to a path that doesn't resolve the platform_connection's bot credential.
Backend returns `502 Upstream returned 401`. The `<img>` in
`MarkdownImage` then errors and the link wrapper opens the raw
Mattermost URL — which the browser has no Mattermost cookie for, so it
401s a second time.

Backend has a second, working proxy endpoint at `/api/files/proxy` which
runs through the `BEEVER_LOADER_RAW_KEY_FALLBACK=true` raw-key path.
That endpoint is verified working (HTTP 200, returns file bytes) and is
already used by the wiki view (`filesProxyPathFor`).

Switch `mediaProxyPathFor` to route to `/api/files/proxy` so chat-side
callers (MarkdownImage, SourceCard, InlineMedia, proxiedMediaUrl) reuse
the proven endpoint. Wiki-side callers (`filesProxyPathFor`) unchanged.

Verified by direct probe on the live EE deployment:
  GET /api/files/proxy?url=<mattermost-url> -> 200, 38 MB MP4 body
  GET /api/media/proxy?url=<same>           -> 502, "Upstream returned 401"

EE-side patch only — upstream OSS still emits `/api/media/proxy`. Once
the signed-token credential resolver is wired through the Mattermost
adapter, this can revert to the original endpoint.

Constraint: don't touch the backend — the working endpoint already exists,
  just route the frontend to it
Confidence: high — direct curl probe of both endpoints proves the swap
Scope-risk: narrow — single helper function + matching unit test, no
  rendering logic changed

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* fix(bot): self-heal from chat-adapter-mattermost leak via scheduled recycle + restart policy (RES-286) (#185) (#21)

* fix(bot): self-heal from chat-adapter-mattermost leak via scheduled recycle + restart policy (RES-286)

The Mattermost connection on the live EE deployment kept "going down" because the
bot container was OOM-killed and never restarted. Two compounding issues:

1. `chat-adapter-mattermost@1.1.2` leaks ~37 MB/h via its long-lived WebSocket
   handler closures and an unbounded `mattermostUserCache` in bridge.ts. After
   ~19 h the bot's RSS crosses the host's free-memory headroom (700 MiB on a
   t4g.medium) and the kernel OOM-killer selects it as the highest-RSS process.
2. The bot service had `restart: no` and no `mem_limit`, so the kill was silent
   and required manual `docker start` every time.

Structural fix (this commit):

- `bot/src/chat-manager.ts` — new `scheduleAdapterRecycle(intervalMs)` /
  `stopAdapterRecycle()` methods. The timer calls `rebuild()` every 6 h to drop
  accumulated adapter state. Re-entry is guarded via `transitioning`, and the
  existing `WebhookBuffer` covers the ~1 s rebuild window so callers see no
  degradation. Six unit tests cover happy path, no-adapter early-return,
  transitioning guard, disable (interval ≤ 0), idempotent re-schedule, and stop.
- `bot/src/bridge.ts` — export `clearMattermostUserCache()` and hook it into
  the existing `onRebuild` listener alongside `clearBridgeCache()`. The module-
  level Map at bridge.ts:1585 had no eviction path; it's now cleared on every
  recycle and on adapter re-registration.
- `bot/src/index.ts` — wire `chatManager.scheduleAdapterRecycle()` from the
  `ADAPTER_RECYCLE_INTERVAL_MS` env (default 6 h, 0 disables). Enrich `/health`
  with `memory: process.memoryUsage()`, `uptime_seconds`, and return 503
  while transitioning so the Docker healthcheck reflects real liveness. Compress
  startup retry delays from `[1,2,4,8,16]s` (31 s worst case) to
  `[0.5,1,2,4,4]s` (11.5 s) so restart blast radius is shorter.

Safety net (compose):

- `docker-compose.yml` — `restart: unless-stopped`, `mem_limit: 768m`,
  `memswap_limit: 768m`, `NODE_OPTIONS=--max-old-space-size=512` (so V8 GCs
  aggressively before the cgroup line, leaving room for graceful shutdown),
  and `start_period: 45s` on the existing healthcheck to accommodate the
  startup retry window. Even if the leak ever exceeds 2× expectation
  (~440 MB peak inside the 6 h cycle), the bot self-restarts in seconds.

Feature gap (the QA-reported `tech-studio` doesn't appear):

- `web/src/components/settings/ManageChannelsDialog.tsx` — Refresh button in the
  dialog header wired to the existing `useConnectionChannels.refetch`. After
  this and the live bot, a user adding the bot to a new MM channel sees it
  surface within seconds without operator intervention.

Bot tests: 173 / 173 pass. Web tests: 531 / 531 pass.

Constraint: t4g.medium has only 4 GiB RAM, no swap, and 6 hot containers
Constraint: chat-adapter-mattermost@1.1.2 is the latest published version
            (npm versions confirms) — no upstream upgrade available
Rejected: Forking chat-adapter-mattermost to patch the leak | high maintenance
          drag for a pilot; scheduled recycle gets us 95% of the value
Rejected: Bot WS subscription to `channel_member_joined` events | the adapter
          doesn't surface those events publicly; requires forking
Rejected: `restart: on-failure` | won't restart after clean SIGTERM during
          deploys
Rejected: REST enumeration of non-member channels | needs `list_team_channels`
          permission on the customer's MM bot; defer until requested
Directive: Do NOT bump `--max-old-space-size` above 512 unless `mem_limit`
           moves in lockstep — cgroup SIGKILL pre-empts V8 GC otherwise
Directive: If `chat-adapter-mattermost` ever ships >1.1.2, re-evaluate whether
           the scheduled recycle is still needed
Confidence: high
Scope-risk: moderate
Not-tested: 24h drift test against a real Mattermost workspace (requires the
            live EE deployment; verified via OOM math + unit tests only)


* fix(bot): broaden cross-platform leak protection + address review feedback (RES-286)

Round-2 changes after OMC code-reviewer + security-reviewer passes against
the initial RES-286 fix. All three reviewer findings addressed.

**Cross-platform leak protection (audit found 4 more module-level caches):**
- `bot/src/bridge.ts` — `clearUserProfileCache()` exported. Same shape as
  `clearMattermostUserCache` but covers the cross-platform user-profile
  Map at line 339. Wired into the existing `onRebuild` listener.
- `bot/src/bridge.ts` — `pruneStaleTeamsConversations(maxAgeMs)` /
  `pruneStaleTelegramChats(maxAgeMs)` exported. These two registries are
  the ONLY source of truth for `listChannels()` on those platforms
  (populated from inbound webhooks; no list API exists), so we age out
  entries older than 30 days on every recycle rather than wholesale-
  clearing. Logs the prune count when non-zero.
- `bot/src/bridge.ts` — `onRebuild` listener now: clearBridgeCache +
  clearMattermostUserCache + clearUserProfileCache + pruneStaleTeams +
  pruneStaleTelegram. Comment explains why some clear and others prune.

**Reviewer feedback addressed:**
- `bot/src/chat-manager.ts` — circuit breaker on `scheduleAdapterRecycle`.
  After `RECYCLE_FAILURE_LIMIT` (3) consecutive failures the timer halts
  and logs a structured error pointing to investigation. A successful
  rebuild resets the counter, so flaky-but-recovering states don't trip
  it. Addresses code-reviewer MEDIUM #1.
- `bot/src/index.ts` — `ADAPTER_RECYCLE_INTERVAL_MS` now has a 60-s floor
  for positive values; `=== 0` still disables. Prevents a misconfigured
  env from thrashing the websocket. Addresses security-reviewer LOW #2.
- `bot/src/index.ts` — `/health` endpoint annotated with a SECURITY block
  documenting that it's bound to 127.0.0.1 and exposes process metrics;
  flags the gate to add if the port is ever exposed publicly. Addresses
  security-reviewer LOW #3.
- `src/beever_atlas/api/connections.py` — `@limiter.limit("20/minute")` +
  `request: Request` on `list_connection_channels`. Prevents a runaway
  Refresh-button client from exhausting the bot token's upstream rate
  limit on Mattermost/Slack APIs. Addresses security-reviewer MEDIUM #1.

**Tests:**
- `bot/src/chat-manager.test.ts` — 4 new tests covering: swallow-rebuild-
  errors-without-stopping, circuit breaker trips at RECYCLE_FAILURE_LIMIT,
  counter resets on success (so flaky rebuilds don't trip).
- `bot/src/bridge.caches.test.ts` — new file covering prune semantics for
  Teams + Telegram registries (empty registry returns 0, fresh entries
  not pruned within window, empty buckets removed after prune), and
  idempotency of the cache clears.

Bot: 183/183 tests pass (up from 173). Python: 86/86 connection tests pass.

Constraint: Teams + Telegram `listChannels()` rely on registry entries
            populated by webhooks — wholesale clearing would empty the
            sidebar until each conversation posts again
Rejected: Wholesale-clear Teams/Telegram registries on recycle | breaks
          `listChannels` until users re-engage; prune is the correct shape
Rejected: Make `/health` auth-gated now | bot port is 127.0.0.1 only, so
          info disclosure is theoretical; SECURITY comment is the right
          cost/benefit
Rejected: Stricter rate limit than 20/minute | manual Refresh + page-load
          spikes could legitimately hit 5-10/minute per user; 20 leaves
          margin without being a DoS lever
Directive: `RECYCLE_FAILURE_LIMIT` should stay at 3 — set higher and the
           log noise the breaker exists to prevent comes back; set lower
           and a single network blip can disable recycle for the whole
           bot lifetime
Confidence: high
Scope-risk: narrow
Not-tested: long-running 24h drift with Teams + Telegram traffic (the
            prune path is exercised only on a real recycle every 6 h)


---------

Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* fix(ux+observability): TTL ring buffer, orphan platform fallback, Mermaid cancellation guard (#186) (#22)

Three small, independent QA-reported bugs bundled into one PR. Each was
investigated by a separate OMC agent and the consolidated plan was stress-
tested by an OMC critic before any code was written (the critic caught a
missed sibling fix-site and an underspecified cancellation pattern, both
addressed below).

**RES-284 — Agent Models tab "many errors" after sync**

The bug: stale `ok=false` entries from a previous failure burst stayed in
the in-process LiteLLM ring buffer (`deque(maxlen=50)`) until 50 newer
calls evicted them or the process restarted — sometimes painting the
Agent Models tab red for hours after the underlying cause was fixed.

Fix:
- `src/beever_atlas/services/llm_call_log.py` — entries are now stored as
  `(time.time(), call)` tuples so `snapshot()` can age them out. Default
  TTL is 30 min (matches typical "investigate, fix, verify" loop). Pass a
  negative value to disable filtering for operator debugging.
- `src/beever_atlas/api/llm_debug.py` — the debug endpoint exposes
  `?max_age_seconds=N` so operators can inspect older entries on demand.
- Server-only filter (no client-side double-filter) per the critic's
  feedback — using two different time sources (Python `time.time()` vs
  JS `Date.parse`) for the same TTL would be a maintenance trap.
- 4 new tests in `tests/services/test_llm_call_log.py`: default-TTL keeps
  recent, TTL filters old, boundary (just inside vs just outside),
  negative TTL disables filtering.

**RES-287/4a — "Ungrouped (Discord)" mislabel on Mattermost workspace**

The bug: orphan channels (no `connection_id`, e.g. CSV-imported or pre-
connection-model legacy) used to fall back to `platform="discord"`
server-side. The FE sidebar then rendered the Discord icon next to
"Ungrouped" on a Mattermost workspace.

Fix:
- `src/beever_atlas/api/channels.py:514, 667` — `or "discord"` →
  `or "unknown"`. `PlatformIcon` already falls back to the neutral
  `MessageSquare` icon for unknown platforms.
- `src/beever_atlas/agents/tools/_citation_decorator.py:188` (caught by
  the critic — missed by the original investigation) — same fix for the
  sibling site that derived `slack:channel:ts:fact_id` native-identity
  strings for orphan channel-message items. The permalink resolver
  already returns `None` for unknown platforms, so no broken Slack URLs
  get constructed.
- 4 new tests in `tests/test_orphan_platform_fallback.py` covering the
  detector contract (returns None on arbitrary strings, still works for
  legit Slack/Discord shapes) and both fallback sites.

**RES-287/4b — Stacked "Syntax error in text" tiles on wiki page**

The bug: when LLM-generated wiki content contained a malformed mermaid
block, the page rendered multiple identical "Diagram could not be
rendered" fallback tiles. Root cause: React StrictMode double-invokes
`useEffect` in dev. The wiki `MermaidBlock` had NO cleanup function, so
both mount cycles raced two concurrent `mermaid.render()` coroutines
against the singleton mermaid instance, the second of which produced an
error SVG → `setError(...)` → fallback tile per block.

Fix:
- `web/src/components/wiki/MermaidBlock.tsx` — adopt the canonical
  cancellation pattern from the sibling `channel/MermaidBlock.tsx`
  (lines 129-184): `let cancelled` flag, `setTimeout` debounce,
  `clearTimeout` cleanup, `mermaid.parse()` validation before
  `mermaid.render()`, and every `setSvg`/`setError` call guarded by
  `if (!cancelled)`. The critic specifically called out matching this
  pattern verbatim instead of an incomplete snippet.
- 5 new vitest tests in `web/src/components/wiki/__tests__/MermaidBlock.test.tsx`:
  happy path, single-fallback-only on invalid chart, two blocks produce
  two fallbacks (not four — regression test for the stacking symptom),
  StrictMode double-mount produces single fallback, mermaid v11 error-SVG
  fallback.

**LLM prompt changes:** none needed. The wiki prompts in
`src/beever_atlas/wiki/prompts.py` already restrict mermaid to
`graph TD`/`flowchart`; the architect agent verified no
`sequenceDiagram`/`erDiagram`/`gantt`/`pie` is ever requested.

**Tests:** Python 58 / 58 in affected areas. Web 536 / 536 across all
60 vitest files. TypeScript clean. Lint clean (warnings pre-existing).

Constraint: Teams and Telegram listChannels() rely on registry entries
            populated by webhooks; the ring-buffer TTL pattern doesn't
            apply there (those are not failure logs)
Rejected: Client-side TTL filter in useRecentLLMCalls.ts | server filter
          is sufficient; two time sources would diverge under clock skew
Rejected: Wholesale clearing the user-profile cache on every recycle |
          already wired in via RES-286 — no new code needed
Rejected: LLM prompt constraints on diagram types | prompts already
          restrict to graph TD/flowchart per audit
Directive: When extending /api/settings/debug/recent-llm-calls, keep the
           default TTL at 30 min — operator-debugging needs are served
           by ?max_age_seconds, not by changing the default
Directive: If a future mermaid version exposes a true async-cancellation
           API, refactor MermaidBlock to use it instead of the
           cancelled-flag pattern
Confidence: high
Scope-risk: narrow

Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* fix(web): channel-sync state isolation + top-nav gate during sync (RES-285) (#187) (#23)

Two bundled UX bugs the QA hit while syncing Mattermost channels.
Designed via a full ralplan consensus loop (Planner → Architect → Critic);
2 iterations to APPROVE — the substantive shift in iter 2 was flipping
Bug A from a local useEffect reset to a route-level remount key, which
fixes FIVE state-leak vectors instead of one.

## Bug A — Cross-channel state leak
Starting a sync on #marketing and switching to #tech-beever-atlas left
the previous channel's progress bar visible (and worse, could permanently
freeze on the new channel via the lastFingerprintRef dedup guard). Root
cause: the route had no `key` prop, so React Router reused the same
ChannelWorkspace instance across :id changes. ALL useState cells in it
— syncState, channel, cooldownRemaining, refreshing, loadingChannel —
persisted across channel nav.

Fix: new `web/src/routes/ChannelWorkspaceRoute.tsx` — thin wrapper that
reads :id via useParams and mounts <ChannelWorkspace key={id} />. The
React key change forces an atomic unmount + remount of the entire
subtree, so every state cell resets. `App.tsx:88` now routes to the
wrapper.

This is structurally correct: channel-scoped state IS keyed to the
channel, not the component instance. Fixes the visible progress-bar
leak AND four sibling leaks for free (cooldown countdown carry-over,
stale channel.name flash on switch, etc.).

## Bug B — Top-nav not gated during sync
Top-nav tabs (Home, Channels, Ask, Activity, Settings) remained clickable
while a sync was running, even though leaving mid-sync drops MM ws
events. Channel-list switching in the sidebar should stay enabled
(channels are isolated now per Bug A); only the top-nav needs the gate.

Fix: new `web/src/contexts/SyncStatusContext.tsx` mirroring the existing
AskSessionsContext.tsx pattern. Splits state into TWO useState cells —
`isSyncRunning: boolean` and `channelId: string|null` — so React's
Object.is bail-out keeps subscribers from re-rendering when publish
values are equal (a single-object setState would defeat this; the
architect+critic specifically demanded the split).

Publisher in ChannelWorkspace.tsx: a useEffect keyed on
`syncState.state` (string, NOT the whole syncState object — prevents
per-poll thrash) publishes the narrowed boolean, plus a cleanup that
resets the gate to false on unmount.

Subscriber in Sidebar.tsx gates the 4 top-nav NavLinks (Home EXCLUDED
— universal escape hatch, intentional invariant from the Home-trap
design decision). Triple-defense gate: aria-disabled (a11y),
tabIndex={-1} (keyboard tab skip), onClick preventDefault (click +
Enter no-op). Tooltip names the syncing channel. The mobile-sheet
onClose handler at Sidebar.tsx:149 is preserved via merged onClick.

Gate fires ONLY on `state === "syncing"`. NOT on error (terminal —
user needs Settings to recover; gating Settings would trap them) or
idle/completed.

## Tests
- 5 SyncStatusContext tests (default value, throws-outside-provider,
  AC6 render-count discipline, setter stability, channelId carries)
- 3 ChannelWorkspaceRoute tests (renders for current :id, unmount
  + remount on :id change via useNavigate, returns null without :id)
- 544 / 544 web tests pass (was 536). TypeScript clean. 0 lint errors.

The single most important regression test is the AC6 guard:
publishing an already-equal boolean to the context does NOT re-render
subscribers. If a future refactor wraps isSyncRunning+channelId back
into a single object setState, that test fails noisily — preventing
accidentally re-introducing the publisher thrash this design avoids.

Constraint: ChannelWorkspace state is keyed to component instance,
            not channel — wrong without the route key
Constraint: Sidebar is a sibling subtree from ChannelWorkspace —
            useSync hook cannot be subscribed twice
Constraint: Zustand and TanStack Query are not in the web dep graph
Rejected: useEffect synchronous reset in useSync.ts | symptom fix;
          patches one of five leaking state cells, leaves four sibling
          leaks (cooldown, channel.name, etc.) in place
Rejected: Single-object useState({isSyncRunning, channelId}) | breaks
          AC6 — fresh object literal every publish defeats Object.is
          bail-out, consumers re-render on identical publishes
Rejected: Gate on pipelineActive (any non-idle state) | traps user
          away from Settings during error state with no recovery path
Rejected: Gate Home too | universal escape hatch principle; user must
          always be able to reach the dashboard
Rejected: Zustand store for sync state | not in dep graph; the React
          Context pattern is already used by AskSessionsContext
Directive: If a future state cell is added to ChannelWorkspace, it
           automatically resets on channel switch — no per-cell reset
           code needed (this is the "structural cause fix" property)
Directive: Publisher useEffect dep array MUST be primitives only.
           DO NOT add syncState (the whole object) — would thrash on
           every poll tick
Confidence: high
Scope-risk: narrow
Not-tested: modifier-click (Cmd/Ctrl+click) on a gated NavLink opens
            in a new tab and bypasses the gate visually — accepted as
            new-tab boots a fresh app context with no state leak

Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* fix(web): RES-285 follow-ups — sidebar indicator, collapsed monitor, wiki almost-ready, mermaid orphan reaper (#188)

* fix(web): RES-285 follow-ups — sidebar sync indicator, collapsed monitor default, wiki "almost ready", mermaid orphan reaper

Five small UX/correctness follow-ups discovered during QA of RES-285 (PR #187).
All five share the SyncStatusContext + ChannelWorkspaceRoute infrastructure
that PR landed, so they ship as a single bundle.

## 1. Sidebar sync indicator (WorkspaceGroup.tsx)
Subscribers to `useSyncStatus()` now paint a pulsing primary-color dot
(replacing the wiki-state icon) + bold channel name + "Syncing now…"
tooltip on whichever row matches `syncingChannelId`. Closes the
"top-nav greyed out but I don't know WHICH channel" gap.

## 2. Sync monitor collapsed by default (ChannelWorkspace.tsx)
`monitorCollapsed` initial state flipped from `false` to `true`. The
existing `localStorage` key now treats anything-other-than-explicit-
"false" as collapsed — new users get the compact view, anyone who
previously clicked Expand keeps that preference. SyncProgressV2's
existing Expand button is the affordance.

## 3. Wiki tab "Wiki will start shortly" state (WikiTab.tsx)
New empty-state branch when (a) sync + extraction are done
(hasMemories=true), (b) `overview_wiki.state === "pending"`,
(c) `wiki_maintenance.done === 0`. Previously the user saw "No Wiki
Yet" + Generate button even though the AutoOverviewSubscriber was
about to fire — misleading. Now: "Wiki will start shortly — auto-
overview is queued. You can click Generate to start it now."
Narrowed on `pending` specifically (NOT undefined) so legacy/feature-
flag-off backends correctly fall through to the original CTA.

## 4. Publisher widening (ChannelWorkspace.tsx)
RES-285's publisher only fired on `syncState.state === "syncing"`,
but `useSync.ts:300-304` shows the backend can return `state: "idle"`
with phases `in_flight` (the "warming up" window after dispatch). My
narrow check missed that window, so the top-nav gate AND the new
sidebar indicator never lit up. Widened to fire on `state === "syncing"
|| anyPhaseInFlight`. Still excludes `error` per the ralplan decision.

## 5. Mermaid orphan-DOM reaper (wiki/MermaidBlock.tsx + channel/MermaidBlock.tsx)
RES-287/4b's cancellation guard handled the React state side of the
StrictMode race, but missed that mermaid v11 leaves a temp `<div
id="d${id}">` in `document.body` after parse failures. Those orphan
divs render the bomb-emoji "Syntax error in text" SVG at the bottom
of the page — visible OUTSIDE any React boundary. Reaper tracks every
id we ask mermaid to render and removes matching elements (by id +
by a textual `Syntax error in text` sweep across direct body
children) after every render attempt + on unmount. Applied to both
MermaidBlock implementations for consistency.

## Why all in one PR
- All five share the SyncStatusContext or RES-287 surface PR #187/186 created.
- Total production diff: ~120 LOC; all additive.
- None of them are independent bug reports — they're refinements
  caught during QA hands-on of the already-merged fixes.

Tests: 11 / 11 MermaidBlock vitest tests still pass. TypeScript clean.
The existing SyncStatusContext + ChannelWorkspaceRoute tests already
guard the publisher/subscriber/key-remount paths; the widening at
#4 only relaxes the publisher's `true` condition (still narrows
strictly to "sync is actually running"), so the AC6 render-count
discipline is preserved.

Constraint: useSync may return `state: "idle"` with phases in_flight
            (verified at useSync.ts:300-304)
Constraint: Mermaid v11 does NOT reliably clean up temp body divs
            after parse failures; cancellation guard alone isn't
            enough — DOM-level cleanup is required
Rejected: Single-string `state === "syncing"` for publisher | misses
          the in_flight-only window where sync is actively running
Rejected: Auto-trigger wiki generation after extract | backend
          AutoOverviewSubscriber already does this when the feature
          flag is on; FE just needs to communicate "queued" state
Rejected: Remove the user's Expand preference on monitorCollapsed
          flip | breaks anyone who previously clicked Expand
Directive: If a future mermaid version exposes a `cleanup(id)` or
           accepts an explicit container, switch to that and remove
           the textual body sweep
Directive: When adding new state cells to ChannelWorkspace that
           should reset on channel switch, no code is needed — the
           route remount key handles it (RES-285 invariant)
Confidence: high
Scope-risk: narrow

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* fix(web): support concurrent channel syncs + global poller for sidebar indicator

User-observed gap after the initial RES-285 follow-ups landed: the
sidebar indicator only lit up the syncing channel WHILE the user was
on its own page. Navigate away and the indicator vanished — defeating
the point of an at-a-glance "what's syncing?" signal.

The architectural fix is to move the sync-status state from per-channel
publisher to a global tracker that the Provider itself maintains, so
the signal survives the channel's own ChannelWorkspace unmounting.

Also forward-fixes a missed architectural requirement: the system must
support MULTIPLE concurrent syncs across channels. The previous
`channelId: string | null` design assumed at most one at a time;
swapped for `syncingChannels: Set<string>` so the FE no longer
constrains backend concurrency.

## Changes

`web/src/contexts/SyncStatusContext.tsx`:
- State shape: `syncingChannels: ReadonlySet<string>` (was a single id).
  Consumers derive `isSyncRunning = size > 0` and per-channel checks
  via `.has()`.
- Public API: `claim(channelId)` and `release(channelId)` — both
  idempotent, both referentially stable via `useCallback([])`. Set
  identity is preserved when claim/release is a no-op so consumers
  don't re-render unnecessarily (AC6 discipline preserved).
- New: background poller `useEffect` that polls
  `/api/channels/{id}/sync/status` for every tracked id every 5s.
  Releases ids when the backend reports no active sync. Stops
  entirely when the set is empty. Survives ChannelWorkspace mount/
  unmount across navigation.

`web/src/pages/ChannelWorkspace.tsx`:
- Publisher protocol: `claim(id)` when sync is running here,
  `release(id)` otherwise. No unmount cleanup — the Provider's
  poller is the authoritative release path. Channels' publishers
  never touch each other's slots, supporting concurrent syncs.

`web/src/components/layout/Sidebar.tsx`:
- Derives `isSyncRunning = syncingChannels.size > 0`.
- Tooltip now reflects count when multiple syncs are active.

`web/src/components/channel/WorkspaceGroup.tsx`:
- Row indicator: `syncingChannels.has(ch.channel_id)` instead of an
  equality check against a single id — so every concurrent-sync row
  lights up, not just one.

`web/src/contexts/__tests__/SyncStatusContext.test.tsx`:
- Full rewrite for the new API. 7 tests:
  default empty set; throws outside provider; claim/release lifecycle;
  multi-channel support (3 concurrent claims, partial release);
  idempotent claim (no re-render); idempotent release; setter
  referential stability across renders.

Tests: 546 / 546 web tests pass (was 544; 2 net new tests on the
SyncStatusContext file). TypeScript clean.

Constraint: ChannelWorkspace's publisher unmounts on channel navigation,
            so it cannot be the authoritative source of "sync ended"
Constraint: Backend may relax single-sync constraint in the future;
            FE state model must not assume one-at-a-time
Rejected: Keep `channelId: string | null` and add a "shadow" persisted
          field for nav survival | doubles state + creates a sync
          problem between the two cells; Set-based is cleaner
Rejected: Single-channel mode toggled by feature flag | speculative
          generality, adds branching with no current benefit
Directive: When adding new state cells to SyncStatusContext, derive
           from the Set rather than adding parallel state; the Set is
           the canonical source of truth
Confidence: high
Scope-risk: narrow
Not-tested: the poller's behaviour against a real 404 / channel-
            deleted response — accepted as a follow-up

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* revert(web): drop top-nav gate during sync — sidebar indicator is enough

Product decision: gating top-nav links on sync was paternalistic.
Locking the user out of Settings / Activity / Channels during a
background sync prevents legitimate parallel work and offers no real
safety benefit — the bot keeps syncing regardless of which page the
user is on.

The sidebar row indicator (pulsing dot + bold name on every syncing
channel) gives the user the awareness signal they need. They can
choose to navigate back to the syncing channel when they want
progress detail; the indicator points the way.

What stays:
- SyncStatusContext (still the source of truth for which channels are
  syncing; the indicator depends on it)
- ChannelWorkspace publisher (claim/release)
- Provider's background poller (releases stale ids)
- WorkspaceGroup pulsing-dot indicator

What's removed from Sidebar.tsx:
- `useSyncStatus()` destructuring + `isSyncRunning` derivation
- `gateTooltipText` computation
- The `gated` flag in the NavLink map
- `aria-disabled`, `tabIndex={-1}`, `onClick preventDefault`
- The "Sync in progress — wait for completion" tooltip
- The disabled visual styling
- The Tooltip wrapper for gated rows when sidebar is expanded

Tests: 546 / 546 web tests still pass. TypeScript clean.
No SyncStatusContext API change — only the Sidebar consumer goes
back to its pre-RES-285 simple form.

Constraint: User-reported requirement — "we don't need to lock other
            tabs"
Rejected: Soft-gate (visual warning without click prevention) | adds
          UX inconsistency for no clear benefit; the row indicator
          already conveys the same info more visibly
Directive: Don't reintroduce top-nav gating without explicit product
           buy-in; the row indicator is the canonical awareness UX
Confidence: high
Scope-risk: narrow

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* fix(web): dismissable stale sync-failure banner

Backend returns the last failure on /sync/status until a newer sync
succeeds. For channels where the user doesn't want to retry — e.g. the
failure is a stale artifact from the RES-286 bot-outage era — the red
banner sits visible forever. User-reported: "It's the old record, how
to dismiss it?"

Adds a per-channel dismiss UX:
- X button on the failure banner (NOT the cooldown banner — cooldown
  is time-bounded informational state)
- Dismissal stored in localStorage keyed by channel id:
    `beever.sync-failure-dismissed.{channel_id}` = "{job_id}|{message}"
- Signature uses job_id + first 200 chars of message so a NEW failure
  (different job_id or different copy) brings the banner back. Same
  failure (same job_id) stays hidden.
- Re-hydrates from storage on channel switch (state survives the
  ChannelWorkspace remount-key cycle from RES-285).

Tests: 546 / 546 web tests pass. TypeScript clean.

Constraint: Cooldown messages must remain visible (time-bounded info,
            not noise)
Rejected: Backend auto-clear of stale failure | broader change, also
          gates on successful sync only — doesn't help channels the
          user is intentionally NOT resyncing
Rejected: Global "dismiss all" toggle | leakier UX; per-channel +
          per-signature is what the user actually needs
Directive: When the failure copy changes (new job_id or new message),
           the dismiss does NOT carry — a fresh banner appears. This
           is intentional; don't broaden the signature to "any
           failure on this channel" or stale dismissals will hide
           real new problems
Confidence: high
Scope-risk: narrow
Not-tested: localStorage quota-exceeded (silently degrades to in-
            memory dismissal — acceptable)

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 .claude/commands/opsx/apply.md                |  152 ++
 .claude/commands/opsx/archive.md              |  157 ++
 .claude/commands/opsx/explore.md              |  173 ++
 .claude/commands/opsx/ff.md                   |   97 +
 .claude/commands/opsx/propose.md              |  106 +
 .claude/skills/openspec-apply-change/SKILL.md |  156 ++
 .../skills/openspec-archive-change/SKILL.md   |  114 +
 .claude/skills/openspec-explore/SKILL.md      |  288 +++
 .claude/skills/openspec-ff-change/SKILL.md    |  101 +
 .claude/skills/openspec-propose/SKILL.md      |  110 +
 .github/workflows/codeql.yml                  |   13 +-
 .github/workflows/deploy.yml                  |   38 +
 .github/workflows/trigger-docs-rebuild.yml    |   18 +
 docs/Beever_Atlas_Feature_Spec.docx           |  Bin 0 -> 35075 bytes
 docs/qa/tool-audit-2026-Q2.md                 |  407 ++++
 docs/v1-archive/ARCHITECTURE_OVERVIEW.md      |  788 ++++++
 .../ARCHITECTURE_OVERVIEW_V2_MONOLITH.md      | 1288 ++++++++++
 docs/v1-archive/PROJECT_ANALYSIS.md           |  977 ++++++++
 .../v1-archive/RETRIEVAL_IMPROVEMENT_IDEAS.md |  711 ++++++
 .../v1-archive/TECHNICAL_PROPOSAL_MONOLITH.md | 2105 +++++++++++++++++
 docs/v2/01-architecture-overview.md           |  229 ++
 docs/v2/02-semantic-memory.md                 |  273 +++
 docs/v2/03-graph-memory.md                    |  328 +++
 docs/v2/04-query-router.md                    |  270 +++
 docs/v2/05-ingestion-pipeline.md              |  436 ++++
 docs/v2/06-wiki-generation.md                 |  861 +++++++
 docs/v2/07-deployment.md                      |  247 ++
 docs/v2/08-resilience.md                      |  204 ++
 docs/v2/09-observability.md                   |  125 +
 docs/v2/10-access-control.md                  |   91 +
 docs/v2/11-frontend-design.md                 |  709 ++++++
 docs/v2/12-api-design.md                      |  492 ++++
 docs/v2/13-adk-integration.md                 |  274 +++
 docs/v2/README.md                             |   53 +
 docs/v2/current-architecture-diagram.md       |  432 ++++
 docs/v2/decisions.md                          |   64 +
 docs/v2/memory-architecture.md                |  273 +++
 docs/v2/reference-papers.md                   |  122 +
 docs/v2/weakness-resolution-map.md            |  586 +++++
 .../.openspec.yaml                            |    2 +
 .../ingestion-pipeline-hardening/design.md    |  123 +
 .../ingestion-pipeline-hardening/proposal.md  |   35 +
 .../specs/coreference-resolution/spec.md      |   34 +
 .../specs/cross-batch-thread-context/spec.md  |   30 +
 .../specs/multimodal-expansion/spec.md        |   60 +
 .../specs/semantic-entity-dedup/spec.md       |   41 +
 .../specs/semantic-search/spec.md             |   30 +
 .../specs/soft-orphan-handling/spec.md        |   41 +
 .../specs/temporal-fact-lifecycle/spec.md     |   34 +
 .../ingestion-pipeline-hardening/tasks.md     |   77 +
 .../m1-skeleton-health-pulse/.openspec.yaml   |    2 +
 .../m1-skeleton-health-pulse/design.md        |   62 +
 .../m1-skeleton-health-pulse/proposal.md      |   36 +
 .../specs/adk-foundation/spec.md              |   38 +
 .../specs/bot-placeholder/spec.md             |   30 +
 .../specs/frontend-shell/spec.md              |   77 +
 .../specs/health-endpoint/spec.md             |   45 +
 .../specs/memories-browser/spec.md            |   62 +
 .../specs/project-scaffold/spec.md            |   59 +
 .../changes/m1-skeleton-health-pulse/tasks.md |   61 +
 .../m2-chatbot-echo-query/.openspec.yaml      |    2 +
 .../changes/m2-chatbot-echo-query/design.md   |   80 +
 .../changes/m2-chatbot-echo-query/proposal.md |   32 +
 .../specs/adk-echo-agent/spec.md              |   33 +
 .../specs/ask-endpoint/spec.md                |   48 +
 .../specs/channel-workspace/spec.md           |   59 +
 .../specs/chat-bot/spec.md                    |   59 +
 .../specs/normalized-message/spec.md          |  106 +
 .../changes/m2-chatbot-echo-query/tasks.md    |   65 +
 .../messages-tab-enhancement/.openspec.yaml   |    2 +
 .../messages-tab-enhancement/design.md        |   55 +
 .../messages-tab-enhancement/proposal.md      |   31 +
 .../message-display-enhancements/spec.md      |   59 +
 .../specs/message-filtering/spec.md           |   44 +
 .../specs/message-pagination/spec.md          |   45 +
 .../changes/messages-tab-enhancement/tasks.md |   55 +
 .../.openspec.yaml                            |    2 +
 .../multi-workspace-connections/design.md     |   98 +
 .../multi-workspace-connections/proposal.md   |   31 +
 .../specs/multi-connection-backend/spec.md    |   99 +
 .../specs/multi-connection-bot/spec.md        |  124 +
 .../specs/multi-connection-frontend/spec.md   |   61 +
 .../multi-workspace-connections/tasks.md      |   86 +
 .../.openspec.yaml                            |    2 +
 .../oss-cla-copyright-assignment/design.md    |  117 +
 .../oss-cla-copyright-assignment/proposal.md  |   57 +
 .../specs/copyright-posture/spec.md           |   80 +
 .../oss-cla-copyright-assignment/tasks.md     |   18 +
 .../.openspec.yaml                            |    2 +
 .../res-177-p0-quality-hardening/design.md    |  203 ++
 .../res-177-p0-quality-hardening/proposal.md  |  115 +
 .../specs/backend-test-baseline/spec.md       |   66 +
 .../specs/bot-bridge-decomposition/spec.md    |   79 +
 .../specs/bot-dependency-pinning/spec.md      |   33 +
 .../specs/ci-quality-gates/spec.md            |   61 +
 .../specs/container-supply-chain/spec.md      |   41 +
 .../specs/docs-env-hygiene/spec.md            |   68 +
 .../specs/web-test-harness/spec.md            |   38 +
 .../res-177-p0-quality-hardening/tasks.md     |  106 +
 openspec/config.yaml                          |    1 +
 scripts/deploy/.gitignore                     |    1 +
 scripts/deploy/README.md                      |   53 +
 scripts/deploy/bootstrap.sh                   |   58 +
 scripts/deploy/deploy.sh                      |  112 +
 scripts/deploy/destroy.sh                     |   41 +
 scripts/deploy/provision.sh                   |  133 ++
 scripts/deploy/ssh.sh                         |    9 +
 scripts/deploy/start.sh                       |   10 +
 scripts/deploy/stop.sh                        |   10 +
 .../__tests__/AgentModelsTab.test.tsx         |   16 +-
 web/src/pages/ChannelWorkspace.tsx            |   73 +-
 111 files changed, 17713 insertions(+), 13 deletions(-)
 create mode 100644 .claude/commands/opsx/apply.md
 create mode 100644 .claude/commands/opsx/archive.md
 create mode 100644 .claude/commands/opsx/explore.md
 create mode 100644 .claude/commands/opsx/ff.md
 create mode 100644 .claude/commands/opsx/propose.md
 create mode 100644 .claude/skills/openspec-apply-change/SKILL.md
 create mode 100644 .claude/skills/openspec-archive-change/SKILL.md
 create mode 100644 .claude/skills/openspec-explore/SKILL.md
 create mode 100644 .claude/skills/openspec-ff-change/SKILL.md
 create mode 100644 .claude/skills/openspec-propose/SKILL.md
 create mode 100644 .github/workflows/deploy.yml
 create mode 100644 .github/workflows/trigger-docs-rebuild.yml
 create mode 100644 docs/Beever_Atlas_Feature_Spec.docx
 create mode 100644 docs/qa/tool-audit-2026-Q2.md
 create mode 100644 docs/v1-archive/ARCHITECTURE_OVERVIEW.md
 create mode 100644 docs/v1-archive/ARCHITECTURE_OVERVIEW_V2_MONOLITH.md
 create mode 100644 docs/v1-archive/PROJECT_ANALYSIS.md
 create mode 100644 docs/v1-archive/RETRIEVAL_IMPROVEMENT_IDEAS.md
 create mode 100644 docs/v1-archive/TECHNICAL_PROPOSAL_MONOLITH.md
 create mode 100644 docs/v2/01-architecture-overview.md
 create mode 100644 docs/v2/02-semantic-memory.md
 create mode 100644 docs/v2/03-graph-memory.md
 create mode 100644 docs/v2/04-query-router.md
 create mode 100644 docs/v2/05-ingestion-pipeline.md
 create mode 100644 docs/v2/06-wiki-generation.md
 create mode 100644 docs/v2/07-deployment.md
 create mode 100644 docs/v2/08-resilience.md
 create mode 100644 docs/v2/09-observability.md
 create mode 100644 docs/v2/10-access-control.md
 create mode 100644 docs/v2/11-frontend-design.md
 create mode 100644 docs/v2/12-api-design.md
 create mode 100644 docs/v2/13-adk-integration.md
 create mode 100644 docs/v2/README.md
 create mode 100644 docs/v2/current-architecture-diagram.md
 create mode 100644 docs/v2/decisions.md
 create mode 100644 docs/v2/memory-architecture.md
 create mode 100644 docs/v2/reference-papers.md
 create mode 100644 docs/v2/weakness-resolution-map.md
 create mode 100644 openspec/changes/ingestion-pipeline-hardening/.openspec.yaml
 create mode 100644 openspec/changes/ingestion-pipeline-hardening/design.md
 create mode 100644 openspec/changes/ingestion-pipeline-hardening/proposal.md
 create mode 100644 openspec/changes/ingestion-pipeline-hardening/specs/coreference-resolution/spec.md
 create mode 100644 openspec/changes/ingestion-pipeline-hardening/specs/cross-batch-thread-context/spec.md
 create mode 100644 openspec/changes/ingestion-pipeline-hardening/specs/multimodal-expansion/spec.md
 create mode 100644 openspec/changes/ingestion-pipeline-hardening/specs/semantic-entity-dedup/spec.md
 create mode 100644 openspec/changes/ingestion-pipeline-hardening/specs/semantic-search/spec.md
 create mode 100644 openspec/changes/ingestion-pipeline-hardening/specs/soft-orphan-handling/spec.md
 create mode 100644 openspec/changes/ingestion-pipeline-hardening/specs/temporal-fact-lifecycle/spec.md
 create mode 100644 openspec/changes/ingestion-pipeline-hardening/tasks.md
 create mode 100644 openspec/changes/m1-skeleton-health-pulse/.openspec.yaml
 create mode 100644 openspec/changes/m1-skeleton-health-pulse/design.md
 create mode 100644 openspec/changes/m1-skeleton-health-pulse/proposal.md
 create mode 100644 openspec/changes/m1-skeleton-health-pulse/specs/adk-foundation/spec.md
 create mode 100644 openspec/changes/m1-skeleton-health-pulse/specs/bot-placeholder/spec.md
 create mode 100644 openspec/changes/m1-skeleton-health-pulse/specs/frontend-shell/spec.md
 create mode 100644 openspec/changes/m1-skeleton-health-pulse/specs/health-endpoint/spec.md
 create mode 100644 openspec/changes/m1-skeleton-health-pulse/specs/memories-browser/spec.md
 create mode 100644 openspec/changes/m1-skeleton-health-pulse/specs/project-scaffold/spec.md
 create mode 100644 openspec/changes/m1-skeleton-health-pulse/tasks.md
 create mode 100644 openspec/changes/m2-chatbot-echo-query/.openspec.yaml
 create mode 100644 openspec/changes/m2-chatbot-echo-query/design.md
 create mode 100644 openspec/changes/m2-chatbot-echo-query/proposal.md
 create mode 100644 openspec/changes/m2-chatbot-echo-query/specs/adk-echo-agent/spec.md
 create mode 100644 openspec/changes/m2-chatbot-echo-query/specs/ask-endpoint/spec.md
 create mode 100644 openspec/changes/m2-chatbot-echo-query/specs/channel-workspace/spec.md
 create mode 100644 openspec/changes/m2-chatbot-echo-query/specs/chat-bot/spec.md
 create mode 100644 openspec/changes/m2-chatbot-echo-query/specs/normalized-message/spec.md
 create mode 100644 openspec/changes/m2-chatbot-echo-query/tasks.md
 create mode 100644 openspec/changes/messages-tab-enhancement/.openspec.yaml
 create mode 100644 openspec/changes/messages-tab-enhancement/design.md
 create mode 100644 openspec/changes/messages-tab-enhancement/proposal.md
 create mode 100644 openspec/changes/messages-tab-enhancement/specs/message-display-enhancements/spec.md
 create mode 100644 openspec/changes/messages-tab-enhancement/specs/message-filtering/spec.md
 create mode 100644 openspec/changes/messages-tab-enhancement/specs/message-pagination/spec.md
 create mode 100644 openspec/changes/messages-tab-enhancement/tasks.md
 create mode 100644 openspec/changes/multi-workspace-connections/.openspec.yaml
 create mode 100644 openspec/changes/multi-workspace-connections/design.md
 create mode 100644 openspec/changes/multi-workspace-connections/proposal.md
 create mode 100644 openspec/changes/multi-workspace-connections/specs/multi-connection-backend/spec.md
 create mode 100644 openspec/changes/multi-workspace-connections/specs/multi-connection-bot/spec.md
 create mode 100644 openspec/changes/multi-workspace-connections/specs/multi-connection-frontend/spec.md
 create mode 100644 openspec/changes/multi-workspace-connections/tasks.md
 create mode 100644 openspec/changes/oss-cla-copyright-assignment/.openspec.yaml
 create mode 100644 openspec/changes/oss-cla-copyright-assignment/design.md
 create mode 100644 openspec/changes/oss-cla-copyright-assignment/proposal.md
 create mode 100644 openspec/changes/oss-cla-copyright-assignment/specs/copyright-posture/spec.md
 create mode 100644 openspec/changes/oss-cla-copyright-assignment/tasks.md
 create mode 100644 openspec/changes/res-177-p0-quality-hardening/.openspec.yaml
 create mode 100644 openspec/changes/res-177-p0-quality-hardening/design.md
 create mode 100644 openspec/changes/res-177-p0-quality-hardening/proposal.md
 create mode 100644 openspec/changes/res-177-p0-quality-hardening/specs/backend-test-baseline/spec.md
 create mode 100644 openspec/changes/res-177-p0-quality-hardening/specs/bot-bridge-decomposition/spec.md
 create mode 100644 openspec/changes/res-177-p0-quality-hardening/specs/bot-dependency-pinning/spec.md
 create mode 100644 openspec/changes/res-177-p0-quality-hardening/specs/ci-quality-gates/spec.md
 create mode 100644 openspec/changes/res-177-p0-quality-hardening/specs/container-supply-chain/spec.md
 create mode 100644 openspec/changes/res-177-p0-quality-hardening/specs/docs-env-hygiene/spec.md
 create mode 100644 openspec/changes/res-177-p0-quality-hardening/specs/web-test-harness/spec.md
 create mode 100644 openspec/changes/res-177-p0-quality-hardening/tasks.md
 create mode 100644 openspec/config.yaml
 create mode 100644 scripts/deploy/.gitignore
 create mode 100644 scripts/deploy/README.md
 create mode 100755 scripts/deploy/bootstrap.sh
 create mode 100755 scripts/deploy/deploy.sh
 create mode 100755 scripts/deploy/destroy.sh
 create mode 100755 scripts/deploy/provision.sh
 create mode 100755 scripts/deploy/ssh.sh
 create mode 100755 scripts/deploy/start.sh
 create mode 100755 scripts/deploy/stop.sh

diff --git a/.claude/commands/opsx/apply.md b/.claude/commands/opsx/apply.md
new file mode 100644
index 00000000..bf23721d
--- /dev/null
+++ b/.claude/commands/opsx/apply.md
@@ -0,0 +1,152 @@
+---
+name: "OPSX: Apply"
+description: Implement tasks from an OpenSpec change (Experimental)
+category: Workflow
+tags: [workflow, artifacts, experimental]
+---
+
+Implement tasks from an OpenSpec change.
+
+**Input**: Optionally specify a change name (e.g., `/opsx:apply add-auth`). If omitted, check if it can be inferred from conversation context. If vague or ambiguous you MUST prompt for available changes.
+
+**Steps**
+
+1. **Select the change**
+
+   If a name is provided, use it. Otherwise:
+   - Infer from conversation context if the user mentioned a change
+   - Auto-select if only one active change exists
+   - If ambiguous, run `openspec list --json` to get available changes and use the **AskUserQuestion tool** to let the user select
+
+   Always announce: "Using change: <name>" and how to override (e.g., `/opsx:apply <other>`).
+
+2. **Check status to understand the schema**
+   ```bash
+   openspec status --change "<name>" --json
+   ```
+   Parse the JSON to understand:
+   - `schemaName`: The workflow being used (e.g., "spec-driven")
+   - Which artifact contains the tasks (typically "tasks" for spec-driven, check status for others)
+
+3. **Get apply instructions**
+
+   ```bash
+   openspec instructions apply --change "<name>" --json
+   ```
+
+   This returns:
+   - Context file paths (varies by schema)
+   - Progress (total, complete, remaining)
+   - Task list with status
+   - Dynamic instruction based on current state
+
+   **Handle states:**
+   - If `state: "blocked"` (missing artifacts): show message, suggest using `/opsx:continue`
+   - If `state: "all_done"`: congratulate, suggest archive
+   - Otherwise: proceed to implementation
+
+4. **Read context files**
+
+   Read the files listed in `contextFiles` from the apply instructions output.
+   The files depend on the schema being used:
+   - **spec-driven**: proposal, specs, design, tasks
+   - Other schemas: follow the contextFiles from CLI output
+
+5. **Show current progress**
+
+   Display:
+   - Schema being used
+   - Progress: "N/M tasks complete"
+   - Remaining tasks overview
+   - Dynamic instruction from CLI
+
+6. **Implement tasks (loop until done or blocked)**
+
+   For each pending task:
+   - Show which task is being worked on
+   - Make the code changes required
+   - Keep changes minimal and focused
+   - Mark task complete in the tasks file: `- [ ]` → `- [x]`
+   - Continue to next task
+
+   **Pause if:**
+   - Task is unclear → ask for clarification
+   - Implementation reveals a design issue → suggest updating artifacts
+   - Error or blocker encountered → report and wait for guidance
+   - User interrupts
+
+7. **On completion or pause, show status**
+
+   Display:
+   - Tasks completed this session
+   - Overall progress: "N/M tasks complete"
+   - If all done: suggest archive
+   - If paused: explain why and wait for guidance
+
+**Output During Implementation**
+
+```
+## Implementing: <change-name> (schema: <schema-name>)
+
+Working on task 3/7: <task description>
+[...implementation happening...]
+✓ Task complete
+
+Working on task 4/7: <task description>
+[...implementation happening...]
+✓ Task complete
+```
+
+**Output On Completion**
+
+```
+## Implementation Complete
+
+**Change:** <change-name>
+**Schema:** <schema-name>
+**Progress:** 7/7 tasks complete ✓
+
+### Completed This Session
+- [x] Task 1
+- [x] Task 2
+...
+
+All tasks complete! You can archive this change with `/opsx:archive`.
+```
+
+**Output On Pause (Issue Encountered)**
+
+```
+## Implementation Paused
+
+**Change:** <change-name>
+**Schema:** <schema-name>
+**Progress:** 4/7 tasks complete
+
+### Issue Encountered
+<description of the issue>
+
+**Options:**
+1. <option 1>
+2. <option 2>
+3. Other approach
+
+What would you like to do?
+```
+
+**Guardrails**
+- Keep going through tasks until done or blocked
+- Always read context files before starting (from the apply instructions output)
+- If task is ambiguous, pause and ask before implementing
+- If implementation reveals issues, pause and suggest artifact updates
+- Keep code changes minimal and scoped to each task
+- Update task checkbox immediately after completing each task
+- Pause on errors, blockers, or unclear requirements - don't guess
+- Use contextFiles from CLI output, don't assume specific file names
+
+**Fluid Workflow Integration**
+
+This skill supports the "actions on a change" model:
+
+- **Can be invoked anytime**: Before all artifacts are done (if tasks exist), after partial implementation, interleaved with other actions
+- **Allows artifact updates**: If implementation reveals design issues, suggest updating artifacts - not phase-locked, work fluidly
diff --git a/.claude/commands/opsx/archive.md b/.claude/commands/opsx/archive.md
new file mode 100644
index 00000000..5e916083
--- /dev/null
+++ b/.claude/commands/opsx/archive.md
@@ -0,0 +1,157 @@
+---
+name: "OPSX: Archive"
+description: Archive a completed change in the experimental workflow
+category: Workflow
+tags: [workflow, archive, experimental]
+---
+
+Archive a completed change in the experimental workflow.
+
+**Input**: Optionally specify a change name after `/opsx:archive` (e.g., `/opsx:archive add-auth`). If omitted, check if it can be inferred from conversation context. If vague or ambiguous you MUST prompt for available changes.
+
+**Steps**
+
+1. **If no change name provided, prompt for selection**
+
+   Run `openspec list --json` to get available changes. Use the **AskUserQuestion tool** to let the user select.
+
+   Show only active changes (not already archived).
+   Include the schema used for each change if available.
+
+   **IMPORTANT**: Do NOT guess or auto-select a change. Always let the user choose.
+
+2. **Check artifact completion status**
+
+   Run `openspec status --change "<name>" --json` to check artifact completion.
+
+   Parse the JSON to understand:
+   - `schemaName`: The workflow being used
+   - `artifacts`: List of artifacts with their status (`done` or other)
+
+   **If any artifacts are not `done`:**
+   - Display warning listing incomplete artifacts
+   - Prompt user for confirmation to continue
+   - Proceed if user confirms
+
+3. **Check task completion status**
+
+   Read the tasks file (typically `tasks.md`) to check for incomplete tasks.
+
+   Count tasks marked with `- [ ]` (incomplete) vs `- [x]` (complete).
+
+   **If incomplete tasks found:**
+   - Display warning showing count of incomplete tasks
+   - Prompt user for confirmation to continue
+   - Proceed if user confirms
+
+   **If no tasks file exists:** Proceed without task-related warning.
+
+4. **Assess delta spec sync state**
+
+   Check for delta specs at `openspec/changes/<name>/specs/`. If none exist, proceed without sync prompt.
+
+   **If delta specs exist:**
+   - Compare each delta spec with its corresponding main spec at `openspec/specs/<capability>/spec.md`
+   - Determine what changes would be applied (adds, modifications, removals, renames)
+   - Show a combined summary before prompting
+
+   **Prompt options:**
+   - If changes needed: "Sync now (recommended)", "Archive without syncing"
+   - If already synced: "Archive now", "Sync anyway", "Cancel"
+
+   If user chooses sync, use Task tool (subagent_type: "general-purpose", prompt: "Use Skill tool to invoke openspec-sync-specs for change '<name>'. Delta spec analysis: <include the analyzed delta spec summary>"). Proceed to archive regardless of choice.
+
+5. **Perform the archive**
+
+   Create the archive directory if it doesn't exist:
+   ```bash
+   mkdir -p openspec/changes/archive
+   ```
+
+   Generate target name using current date: `YYYY-MM-DD-<change-name>`
+
+   **Check if target already exists:**
+   - If yes: Fail with error, suggest renaming existing archive or using different date
+   - If no: Move the change directory to archive
+
+   ```bash
+   mv openspec/changes/<name> openspec/changes/archive/YYYY-MM-DD-<name>
+   ```
+
+6. **Display summary**
+
+   Show archive completion summary including:
+   - Change name
+   - Schema that was used
+   - Archive location
+   - Spec sync status (synced / sync skipped / no delta specs)
+   - Note about any warnings (incomplete artifacts/tasks)
+
+**Output On Success**
+
+```
+## Archive Complete
+
+**Change:** <change-name>
+**Schema:** <schema-name>
+**Archived to:** openspec/changes/archive/YYYY-MM-DD-<name>/
+**Specs:** ✓ Synced to main specs
+
+All artifacts complete. All tasks complete.
+```
+
+**Output On Success (No Delta Specs)**
+
+```
+## Archive Complete
+
+**Change:** <change-name>
+**Schema:** <schema-name>
+**Archived to:** openspec/changes/archive/YYYY-MM-DD-<name>/
+**Specs:** No delta specs
+
+All artifacts complete. All tasks complete.
+```
+
+**Output On Success With Warnings**
+
+```
+## Archive Complete (with warnings)
+
+**Change:** <change-name>
+**Schema:** <schema-name>
+**Archived to:** openspec/changes/archive/YYYY-MM-DD-<name>/
+**Specs:** Sync skipped (user chose to skip)
+
+**Warnings:**
+- Archived with 2 incomplete artifacts
+- Archived with 3 incomplete tasks
+- Delta spec sync was skipped (user chose to skip)
+
+Review the archive if this was not intentional.
+```
+
+**Output On Error (Archive Exists)**
+
+```
+## Archive Failed
+
+**Change:** <change-name>
+**Target:** openspec/changes/archive/YYYY-MM-DD-<name>/
+
+Target archive directory already exists.
+
+**Options:**
+1. Rename the existing archive
+2. Delete the existing archive if it's a duplicate
+3. Wait until a different date to archive
+```
+
+**Guardrails**
+- Always prompt for change selection if not provided
+- Use artifact graph (openspec status --json) for completion checking
+- Don't block archive on warnings - just inform and confirm
+- Preserve .openspec.yaml when moving to archive (it moves with the directory)
+- Show clear summary of what happened
+- If sync is requested, use the Skill tool to invoke `openspec-sync-specs` (agent-driven)
+- If delta specs exist, always run the sync assessment and show the combined summary before prompting
diff --git a/.claude/commands/opsx/explore.md b/.claude/commands/opsx/explore.md
new file mode 100644
index 00000000..30d9c57a
--- /dev/null
+++ b/.claude/commands/opsx/explore.md
@@ -0,0 +1,173 @@
+---
+name: "OPSX: Explore"
+description: "Enter explore mode - think through ideas, investigate problems, clarify requirements"
+category: Workflow
+tags: [workflow, explore, experimental, thinking]
+---
+
+Enter explore mode. Think deeply. Visualize freely. Follow the conversation wherever it goes.
+
+**IMPORTANT: Explore mode is for thinking, not implementing.** You may read files, search code, and investigate the codebase, but you must NEVER write code or implement features. If the user asks you to implement something, remind them to exit explore mode first and create a change proposal. You MAY create OpenSpec artifacts (proposals, designs, specs) if the user asks—that's capturing thinking, not implementing.
+
+**This is a stance, not a workflow.** There are no fixed steps, no required sequence, no mandatory outputs. You're a thinking partner helping the user explore.
+
+**Input**: The argument after `/opsx:explore` is whatever the user wants to think about. Could be:
+- A vague idea: "real-time collaboration"
+- A specific problem: "the auth system is getting unwieldy"
+- A change name: "add-dark-mode" (to explore in context of that change)
+- A comparison: "postgres vs sqlite for this"
+- Nothing (just enter explore mode)
+
+---
+
+## The Stance
+
+- **Curious, not prescriptive** - Ask questions that emerge naturally, don't follow a script
+- **Open threads, not interrogations** - Surface multiple interesting directions and let the user follow what resonates. Don't funnel them through a single path of questions.
+- **Visual** - Use ASCII diagrams liberally when they'd help clarify thinking
+- **Adaptive** - Follow interesting threads, pivot when new information emerges
+- **Patient** - Don't rush to conclusions, let the shape of the problem emerge
+- **Grounded** - Explore the actual codebase when relevant, don't just theorize
+
+---
+
+## What You Might Do
+
+Depending on what the user brings, you might:
+
+**Explore the problem space**
+- Ask clarifying questions that emerge from what they said
+- Challenge assumptions
+- Reframe the problem
+- Find analogies
+
+**Investigate the codebase**
+- Map existing architecture relevant to the discussion
+- Find integration points
+- Identify patterns already in use
+- Surface hidden complexity
+
+**Compare options**
+- Brainstorm multiple approaches
+- Build comparison tables
+- Sketch tradeoffs
+- Recommend a path (if asked)
+
+**Visualize**
+```
+┌─────────────────────────────────────────┐
+│     Use ASCII diagrams liberally        │
+├─────────────────────────────────────────┤
+│                                         │
+│   ┌────────┐         ┌────────┐        │
+│   │ State  │────────▶│ State  │        │
+│   │   A    │         │   B    │        │
+│   └────────┘         └────────┘        │
+│                                         │
+│   System diagrams, state machines,      │
+│   data flows, architecture sketches,    │
+│   dependency graphs, comparison tables  │
+│                                         │
+└─────────────────────────────────────────┘
+```
+
+**Surface risks and unknowns**
+- Identify what could go wrong
+- Find gaps in understanding
+- Suggest spikes or investigations
+
+---
+
+## OpenSpec Awareness
+
+You have full context of the OpenSpec system. Use it naturally, don't force it.
+
+### Check for context
+
+At the start, quickly check what exists:
+```bash
+openspec list --json
+```
+
+This tells you:
+- If there are active changes
+- Their names, schemas, and status
+- What the user might be working on
+
+If the user mentioned a specific change name, read its artifacts for context.
+
+### When no change exists
+
+Think freely. When insights crystallize, you might offer:
+
+- "This feels solid enough to start a change. Want me to create a proposal?"
+- Or keep exploring - no pressure to formalize
+
+### When a change exists
+
+If the user mentions a change or you detect one is relevant:
+
+1. **Read existing artifacts for context**
+   - `openspec/changes/<name>/proposal.md`
+   - `openspec/changes/<name>/design.md`
+   - `openspec/changes/<name>/tasks.md`
+   - etc.
+
+2. **Reference them naturally in conversation**
+   - "Your design mentions using Redis, but we just realized SQLite fits better..."
+   - "The proposal scopes this to premium users, but we're now thinking everyone..."
+
+3. **Offer to capture when decisions are made**
+
+   | Insight Type | Where to Capture |
+   |--------------|------------------|
+   | New requirement discovered | `specs/<capability>/spec.md` |
+   | Requirement changed | `specs/<capability>/spec.md` |
+   | Design decision made | `design.md` |
+   | Scope changed | `proposal.md` |
+   | New work identified | `tasks.md` |
+   | Assumption invalidated | Relevant artifact |
+
+   Example offers:
+   - "That's a design decision. Capture it in design.md?"
+   - "This is a new requirement. Add it to specs?"
+   - "This changes scope. Update the proposal?"
+
+4. **The user decides** - Offer and move on. Don't pressure. Don't auto-capture.
+
+---
+
+## What You Don't Have To Do
+
+- Follow a script
+- Ask the same questions every time
+- Produce a specific artifact
+- Reach a conclusion
+- Stay on topic if a tangent is valuable
+- Be brief (this is thinking time)
+
+---
+
+## Ending Discovery
+
+There's no required ending. Discovery might:
+
+- **Flow into a proposal**: "Ready to start? I can create a change proposal."
+- **Result in artifact updates**: "Updated design.md with these decisions"
+- **Just provide clarity**: User has what they need, moves on
+- **Continue later**: "We can pick this up anytime"
+
+When things crystallize, you might offer a summary - but it's optional. Sometimes the thinking IS the value.
+
+---
+
+## Guardrails
+
+- **Don't implement** - Never write code or implement features. Creating OpenSpec artifacts is fine, writing application code is not.
+- **Don't fake understanding** - If something is unclear, dig deeper
+- **Don't rush** - Discovery is thinking time, not task time
+- **Don't force structure** - Let patterns emerge naturally
+- **Don't auto-capture** - Offer to save insights, don't just do it
+- **Do visualize** - A good diagram is worth many paragraphs
+- **Do explore the codebase** - Ground discussions in reality
+- **Do question assumptions** - Including the user's and your own
diff --git a/.claude/commands/opsx/ff.md b/.claude/commands/opsx/ff.md
new file mode 100644
index 00000000..69f749c9
--- /dev/null
+++ b/.claude/commands/opsx/ff.md
@@ -0,0 +1,97 @@
+---
+name: "OPSX: Fast Forward"
+description: Create a change and generate all artifacts needed for implementation in one go
+category: Workflow
+tags: [workflow, artifacts, experimental]
+---
+
+Fast-forward through artifact creation - generate everything needed to start implementation.
+
+**Input**: The argument after `/opsx:ff` is the change name (kebab-case), OR a description of what the user wants to build.
+
+**Steps**
+
+1. **If no input provided, ask what they want to build**
+
+   Use the **AskUserQuestion tool** (open-ended, no preset options) to ask:
+   > "What change do you want to work on? Describe what you want to build or fix."
+
+   From their description, derive a kebab-case name (e.g., "add user authentication" → `add-user-auth`).
+
+   **IMPORTANT**: Do NOT proceed without understanding what the user wants to build.
+
+2. **Create the change directory**
+   ```bash
+   openspec new change "<name>"
+   ```
+   This creates a scaffolded change at `openspec/changes/<name>/`.
+
+3. **Get the artifact build order**
+   ```bash
+   openspec status --change "<name>" --json
+   ```
+   Parse the JSON to get:
+   - `applyRequires`: array of artifact IDs needed before implementation (e.g., `["tasks"]`)
+   - `artifacts`: list of all artifacts with their status and dependencies
+
+4. **Create artifacts in sequence until apply-ready**
+
+   Use the **TodoWrite tool** to track progress through the artifacts.
+
+   Loop through artifacts in dependency order (artifacts with no pending dependencies first):
+
+   a. **For each artifact that is `ready` (dependencies satisfied)**:
+      - Get instructions:
+        ```bash
+        openspec instructions <artifact-id> --change "<name>" --json
+        ```
+      - The instructions JSON includes:
+        - `context`: Project background (constraints for you - do NOT include in output)
+        - `rules`: Artifact-specific rules (constraints for you - do NOT include in output)
+        - `template`: The structure to use for your output file
+        - `instruction`: Schema-specific guidance for this artifact type
+        - `outputPath`: Where to write the artifact
+        - `dependencies`: Completed artifacts to read for context
+      - Read any completed dependency files for context
+      - Create the artifact file using `template` as the structure
+      - Apply `context` and `rules` as constraints - but do NOT copy them into the file
+      - Show brief progress: "✓ Created <artifact-id>"
+
+   b. **Continue until all `applyRequires` artifacts are complete**
+      - After creating each artifact, re-run `openspec status --change "<name>" --json`
+      - Check if every artifact ID in `applyRequires` has `status: "done"` in the artifacts array
+      - Stop when all `applyRequires` artifacts are done
+
+   c. **If an artifact requires user input** (unclear context):
+      - Use **AskUserQuestion tool** to clarify
+      - Then continue with creation
+
+5. **Show final status**
+   ```bash
+   openspec status --change "<name>"
+   ```
+
+**Output**
+
+After completing all artifacts, summarize:
+- Change name and location
+- List of artifacts created with brief descriptions
+- What's ready: "All artifacts created! Ready for implementation."
+- Prompt: "Run `/opsx:apply` to start implementing."
+
+**Artifact Creation Guidelines**
+
+- Follow the `instruction` field from `openspec instructions` for each artifact type
+- The schema defines what each artifact should contain - follow it
+- Read dependency artifacts for context before creating new ones
+- Use `template` as the structure for your output file - fill in its sections
+- **IMPORTANT**: `context` and `rules` are constraints for YOU, not content for the file
+  - Do NOT copy `<context>`, `<rules>`, `<project_context>` blocks into the artifact
+  - These guide what you write, but should never appear in the output
+
+**Guardrails**
+- Create ALL artifacts needed for implementation (as defined by schema's `apply.requires`)
+- Always read dependency artifacts before creating a new one
+- If context is critically unclear, ask the user - but prefer making reasonable decisions to keep momentum
+- If a change with that name already exists, ask if user wants to continue it or create a new one
+- Verify each artifact file exists after writing before proceeding to next
diff --git a/.claude/commands/opsx/propose.md b/.claude/commands/opsx/propose.md
new file mode 100644
index 00000000..05276f4d
--- /dev/null
+++ b/.claude/commands/opsx/propose.md
@@ -0,0 +1,106 @@
+---
+name: "OPSX: Propose"
+description: Propose a new change - create it and generate all artifacts in one step
+category: Workflow
+tags: [workflow, artifacts, experimental]
+---
+
+Propose a new change - create the change and generate all artifacts in one step.
+
+I'll create a change with artifacts:
+- proposal.md (what & why)
+- design.md (how)
+- tasks.md (implementation steps)
+
+When ready to implement, run /opsx:apply
+
+---
+
+**Input**: The argument after `/opsx:propose` is the change name (kebab-case), OR a description of what the user wants to build.
+
+**Steps**
+
+1. **If no input provided, ask what they want to build**
+
+   Use the **AskUserQuestion tool** (open-ended, no preset options) to ask:
+   > "What change do you want to work on? Describe what you want to build or fix."
+
+   From their description, derive a kebab-case name (e.g., "add user authentication" → `add-user-auth`).
+
+   **IMPORTANT**: Do NOT proceed without understanding what the user wants to build.
+
+2. **Create the change directory**
+   ```bash
+   openspec new change "<name>"
+   ```
+   This creates a scaffolded change at `openspec/changes/<name>/` with `.openspec.yaml`.
+
+3. **Get the artifact build order**
+   ```bash
+   openspec status --change "<name>" --json
+   ```
+   Parse the JSON to get:
+   - `applyRequires`: array of artifact IDs needed before implementation (e.g., `["tasks"]`)
+   - `artifacts`: list of all artifacts with their status and dependencies
+
+4. **Create artifacts in sequence until apply-ready**
+
+   Use the **TodoWrite tool** to track progress through the artifacts.
+
+   Loop through artifacts in dependency order (artifacts with no pending dependencies first):
+
+   a. **For each artifact that is `ready` (dependencies satisfied)**:
+      - Get instructions:
+        ```bash
+        openspec instructions <artifact-id> --change "<name>" --json
+        ```
+      - The instructions JSON includes:
+        - `context`: Project background (constraints for you - do NOT include in output)
+        - `rules`: Artifact-specific rules (constraints for you - do NOT include in output)
+        - `template`: The structure to use for your output file
+        - `instruction`: Schema-specific guidance for this artifact type
+        - `outputPath`: Where to write the artifact
+        - `dependencies`: Completed artifacts to read for context
+      - Read any completed dependency files for context
+      - Create the artifact file using `template` as the structure
+      - Apply `context` and `rules` as constraints - but do NOT copy them into the file
+      - Show brief progress: "Created <artifact-id>"
+
+   b. **Continue until all `applyRequires` artifacts are complete**
+      - After creating each artifact, re-run `openspec status --change "<name>" --json`
+      - Check if every artifact ID in `applyRequires` has `status: "done"` in the artifacts array
+      - Stop when all `applyRequires` artifacts are done
+
+   c. **If an artifact requires user input** (unclear context):
+      - Use **AskUserQuestion tool** to clarify
+      - Then continue with creation
+
+5. **Show final status**
+   ```bash
+   openspec status --change "<name>"
+   ```
+
+**Output**
+
+After completing all artifacts, summarize:
+- Change name and location
+- List of artifacts created with brief descriptions
+- What's ready: "All artifacts created! Ready for implementation."
+- Prompt: "Run `/opsx:apply` to start implementing."
+
+**Artifact Creation Guidelines**
+
+- Follow the `instruction` field from `openspec instructions` for each artifact type
+- The schema defines what each artifact should contain - follow it
+- Read dependency artifacts for context before creating new ones
+- Use `template` as the structure for your output file - fill in its sections
+- **IMPORTANT**: `context` and `rules` are constraints for YOU, not content for the file
+  - Do NOT copy `<context>`, `<rules>`, `<project_context>` blocks into the artifact
+  - These guide what you write, but should never appear in the output
+
+**Guardrails**
+- Create ALL artifacts needed for implementation (as defined by schema's `apply.requires`)
+- Always read dependency artifacts before creating a new one
+- If context is critically unclear, ask the user - but prefer making reasonable decisions to keep momentum
+- If a change with that name already exists, ask if user wants to continue it or create a new one
+- Verify each artifact file exists after writing before proceeding to next
diff --git a/.claude/skills/openspec-apply-change/SKILL.md b/.claude/skills/openspec-apply-change/SKILL.md
new file mode 100644
index 00000000..d474dc13
--- /dev/null
+++ b/.claude/skills/openspec-apply-change/SKILL.md
@@ -0,0 +1,156 @@
+---
+name: openspec-apply-change
+description: Implement tasks from an OpenSpec change. Use when the user wants to start implementing, continue implementation, or work through tasks.
+license: MIT
+compatibility: Requires openspec CLI.
+metadata:
+  author: openspec
+  version: "1.0"
+  generatedBy: "1.2.0"
+---
+
+Implement tasks from an OpenSpec change.
+
+**Input**: Optionally specify a change name. If omitted, check if it can be inferred from conversation context. If vague or ambiguous you MUST prompt for available changes.
+
+**Steps**
+
+1. **Select the change**
+
+   If a name is provided, use it. Otherwise:
+   - Infer from conversation context if the user mentioned a change
+   - Auto-select if only one active change exists
+   - If ambiguous, run `openspec list --json` to get available changes and use the **AskUserQuestion tool** to let the user select
+
+   Always announce: "Using change: <name>" and how to override (e.g., `/opsx:apply <other>`).
+
+2. **Check status to understand the schema**
+   ```bash
+   openspec status --change "<name>" --json
+   ```
+   Parse the JSON to understand:
+   - `schemaName`: The workflow being used (e.g., "spec-driven")
+   - Which artifact contains the tasks (typically "tasks" for spec-driven, check status for others)
+
+3. **Get apply instructions**
+
+   ```bash
+   openspec instructions apply --change "<name>" --json
+   ```
+
+   This returns:
+   - Context file paths (varies by schema - could be proposal/specs/design/tasks or spec/tests/implementation/docs)
+   - Progress (total, complete, remaining)
+   - Task list with status
+   - Dynamic instruction based on current state
+
+   **Handle states:**
+   - If `state: "blocked"` (missing artifacts): show message, suggest using openspec-continue-change
+   - If `state: "all_done"`: congratulate, suggest archive
+   - Otherwise: proceed to implementation
+
+4. **Read context files**
+
+   Read the files listed in `contextFiles` from the apply instructions output.
+   The files depend on the schema being used:
+   - **spec-driven**: proposal, specs, design, tasks
+   - Other schemas: follow the contextFiles from CLI output
+
+5. **Show current progress**
+
+   Display:
+   - Schema being used
+   - Progress: "N/M tasks complete"
+   - Remaining tasks overview
+   - Dynamic instruction from CLI
+
+6. **Implement tasks (loop until done or blocked)**
+
+   For each pending task:
+   - Show which task is being worked on
+   - Make the code changes required
+   - Keep changes minimal and focused
+   - Mark task complete in the tasks file: `- [ ]` → `- [x]`
+   - Continue to next task
+
+   **Pause if:**
+   - Task is unclear → ask for clarification
+   - Implementation reveals a design issue → suggest updating artifacts
+   - Error or blocker encountered → report and wait for guidance
+   - User interrupts
+
+7. **On completion or pause, show status**
+
+   Display:
+   - Tasks completed this session
+   - Overall progress: "N/M tasks complete"
+   - If all done: suggest archive
+   - If paused: explain why and wait for guidance
+
+**Output During Implementation**
+
+```
+## Implementing: <change-name> (schema: <schema-name>)
+
+Working on task 3/7: <task description>
+[...implementation happening...]
+✓ Task complete
+
+Working on task 4/7: <task description>
+[...implementation happening...]
+✓ Task complete
+```
+
+**Output On Completion**
+
+```
+## Implementation Complete
+
+**Change:** <change-name>
+**Schema:** <schema-name>
+**Progress:** 7/7 tasks complete ✓
+
+### Completed This Session
+- [x] Task 1
+- [x] Task 2
+...
+
+All tasks complete! Ready to archive this change.
+```
+
+**Output On Pause (Issue Encountered)**
+
+```
+## Implementation Paused
+
+**Change:** <change-name>
+**Schema:** <schema-name>
+**Progress:** 4/7 tasks complete
+
+### Issue Encountered
+<description of the issue>
+
+**Options:**
+1. <option 1>
+2. <option 2>
+3. Other approach
+
+What would you like to do?
+```
+
+**Guardrails**
+- Keep going through tasks until done or blocked
+- Always read context files before starting (from the apply instructions output)
+- If task is ambiguous, pause and ask before implementing
+- If implementation reveals issues, pause and suggest artifact updates
+- Keep code changes minimal and scoped to each task
+- Update task checkbox immediately after completing each task
+- Pause on errors, blockers, or unclear requirements - don't guess
+- Use contextFiles from CLI output, don't assume specific file names
+
+**Fluid Workflow Integration**
+
+This skill supports the "actions on a change" model:
+
+- **Can be invoked anytime**: Before all artifacts are done (if tasks exist), after partial implementation, interleaved with other actions
+- **Allows artifact updates**: If implementation reveals design issues, suggest updating artifacts - not phase-locked, work fluidly
diff --git a/.claude/skills/openspec-archive-change/SKILL.md b/.claude/skills/openspec-archive-change/SKILL.md
new file mode 100644
index 00000000..9b1f851a
--- /dev/null
+++ b/.claude/skills/openspec-archive-change/SKILL.md
@@ -0,0 +1,114 @@
+---
+name: openspec-archive-change
+description: Archive a completed change in the experimental workflow. Use when the user wants to finalize and archive a change after implementation is complete.
+license: MIT
+compatibility: Requires openspec CLI.
+metadata:
+  author: openspec
+  version: "1.0"
+  generatedBy: "1.2.0"
+---
+
+Archive a completed change in the experimental workflow.
+
+**Input**: Optionally specify a change name. If omitted, check if it can be inferred from conversation context. If vague or ambiguous you MUST prompt for available changes.
+
+**Steps**
+
+1. **If no change name provided, prompt for selection**
+
+   Run `openspec list --json` to get available changes. Use the **AskUserQuestion tool** to let the user select.
+
+   Show only active changes (not already archived).
+   Include the schema used for each change if available.
+
+   **IMPORTANT**: Do NOT guess or auto-select a change. Always let the user choose.
+
+2. **Check artifact completion status**
+
+   Run `openspec status --change "<name>" --json` to check artifact completion.
+
+   Parse the JSON to understand:
+   - `schemaName`: The workflow being used
+   - `artifacts`: List of artifacts with their status (`done` or other)
+
+   **If any artifacts are not `done`:**
+   - Display warning listing incomplete artifacts
+   - Use **AskUserQuestion tool** to confirm user wants to proceed
+   - Proceed if user confirms
+
+3. **Check task completion status**
+
+   Read the tasks file (typically `tasks.md`) to check for incomplete tasks.
+
+   Count tasks marked with `- [ ]` (incomplete) vs `- [x]` (complete).
+
+   **If incomplete tasks found:**
+   - Display warning showing count of incomplete tasks
+   - Use **AskUserQuestion tool** to confirm user wants to proceed
+   - Proceed if user confirms
+
+   **If no tasks file exists:** Proceed without task-related warning.
+
+4. **Assess delta spec sync state**
+
+   Check for delta specs at `openspec/changes/<name>/specs/`. If none exist, proceed without sync prompt.
+
+   **If delta specs exist:**
+   - Compare each delta spec with its corresponding main spec at `openspec/specs/<capability>/spec.md`
+   - Determine what changes would be applied (adds, modifications, removals, renames)
+   - Show a combined summary before prompting
+
+   **Prompt options:**
+   - If changes needed: "Sync now (recommended)", "Archive without syncing"
+   - If already synced: "Archive now", "Sync anyway", "Cancel"
+
+   If user chooses sync, use Task tool (subagent_type: "general-purpose", prompt: "Use Skill tool to invoke openspec-sync-specs for change '<name>'. Delta spec analysis: <include the analyzed delta spec summary>"). Proceed to archive regardless of choice.
+
+5. **Perform the archive**
+
+   Create the archive directory if it doesn't exist:
+   ```bash
+   mkdir -p openspec/changes/archive
+   ```
+
+   Generate target name using current date: `YYYY-MM-DD-<change-name>`
+
+   **Check if target already exists:**
+   - If yes: Fail with error, suggest renaming existing archive or using different date
+   - If no: Move the change directory to archive
+
+   ```bash
+   mv openspec/changes/<name> openspec/changes/archive/YYYY-MM-DD-<name>
+   ```
+
+6. **Display summary**
+
+   Show archive completion summary including:
+   - Change name
+   - Schema that was used
+   - Archive location
+   - Whether specs were synced (if applicable)
+   - Note about any warnings (incomplete artifacts/tasks)
+
+**Output On Success**
+
+```
+## Archive Complete
+
+**Change:** <change-name>
+**Schema:** <schema-name>
+**Archived to:** openspec/changes/archive/YYYY-MM-DD-<name>/
+**Specs:** ✓ Synced to main specs (or "No delta specs" or "Sync skipped")
+
+All artifacts complete. All tasks complete.
+```
+
+**Guardrails**
+- Always prompt for change selection if not provided
+- Use artifact graph (openspec status --json) for completion checking
+- Don't block archive on warnings - just inform and confirm
+- Preserve .openspec.yaml when moving to archive (it moves with the directory)
+- Show clear summary of what happened
+- If sync is requested, use openspec-sync-specs approach (agent-driven)
+- If delta specs exist, always run the sync assessment and show the combined summary before prompting
diff --git a/.claude/skills/openspec-explore/SKILL.md b/.claude/skills/openspec-explore/SKILL.md
new file mode 100644
index 00000000..ffa10cad
--- /dev/null
+++ b/.claude/skills/openspec-explore/SKILL.md
@@ -0,0 +1,288 @@
+---
+name: openspec-explore
+description: Enter explore mode - a thinking partner for exploring ideas, investigating problems, and clarifying requirements. Use when the user wants to think through something before or during a change.
+license: MIT
+compatibility: Requires openspec CLI.
+metadata:
+  author: openspec
+  version: "1.0"
+  generatedBy: "1.2.0"
+---
+
+Enter explore mode. Think deeply. Visualize freely. Follow the conversation wherever it goes.
+
+**IMPORTANT: Explore mode is for thinking, not implementing.** You may read files, search code, and investigate the codebase, but you must NEVER write code or implement features. If the user asks you to implement something, remind them to exit explore mode first and create a change proposal. You MAY create OpenSpec artifacts (proposals, designs, specs) if the user asks—that's capturing thinking, not implementing.
+
+**This is a stance, not a workflow.** There are no fixed steps, no required sequence, no mandatory outputs. You're a thinking partner helping the user explore.
+
+---
+
+## The Stance
+
+- **Curious, not prescriptive** - Ask questions that emerge naturally, don't follow a script
+- **Open threads, not interrogations** - Surface multiple interesting directions and let the user follow what resonates. Don't funnel them through a single path of questions.
+- **Visual** - Use ASCII diagrams liberally when they'd help clarify thinking
+- **Adaptive** - Follow interesting threads, pivot when new information emerges
+- **Patient** - Don't rush to conclusions, let the shape of the problem emerge
+- **Grounded** - Explore the actual codebase when relevant, don't just theorize
+
+---
+
+## What You Might Do
+
+Depending on what the user brings, you might:
+
+**Explore the problem space**
+- Ask clarifying questions that emerge from what they said
+- Challenge assumptions
+- Reframe the problem
+- Find analogies
+
+**Investigate the codebase**
+- Map existing architecture relevant to the discussion
+- Find integration points
+- Identify patterns already in use
+- Surface hidden complexity
+
+**Compare options**
+- Brainstorm multiple approaches
+- Build comparison tables
+- Sketch tradeoffs
+- Recommend a path (if asked)
+
+**Visualize**
+```
+┌─────────────────────────────────────────┐
+│     Use ASCII diagrams liberally        │
+├─────────────────────────────────────────┤
+│                                         │
+│   ┌────────┐         ┌────────┐        │
+│   │ State  │────────▶│ State  │        │
+│   │   A    │         │   B    │        │
+│   └────────┘         └────────┘        │
+│                                         │
+│   System diagrams, state machines,      │
+│   data flows, architecture sketches,    │
+│   dependency graphs, comparison tables  │
+│                                         │
+└─────────────────────────────────────────┘
+```
+
+**Surface risks and unknowns**
+- Identify what could go wrong
+- Find gaps in understanding
+- Suggest spikes or investigations
+
+---
+
+## OpenSpec Awareness
+
+You have full context of the OpenSpec system. Use it naturally, don't force it.
+
+### Check for context
+
+At the start, quickly check what exists:
+```bash
+openspec list --json
+```
+
+This tells you:
+- If there are active changes
+- Their names, schemas, and status
+- What the user might be working on
+
+### When no change exists
+
+Think freely. When insights crystallize, you might offer:
+
+- "This feels solid enough to start a change. Want me to create a proposal?"
+- Or keep exploring - no pressure to formalize
+
+### When a change exists
+
+If the user mentions a change or you detect one is relevant:
+
+1. **Read existing artifacts for context**
+   - `openspec/changes/<name>/proposal.md`
+   - `openspec/changes/<name>/design.md`
+   - `openspec/changes/<name>/tasks.md`
+   - etc.
+
+2. **Reference them naturally in conversation**
+   - "Your design mentions using Redis, but we just realized SQLite fits better..."
+   - "The proposal scopes this to premium users, but we're now thinking everyone..."
+
+3. **Offer to capture when decisions are made**
+
+   | Insight Type | Where to Capture |
+   |--------------|------------------|
+   | New requirement discovered | `specs/<capability>/spec.md` |
+   | Requirement changed | `specs/<capability>/spec.md` |
+   | Design decision made | `design.md` |
+   | Scope changed | `proposal.md` |
+   | New work identified | `tasks.md` |
+   | Assumption invalidated | Relevant artifact |
+
+   Example offers:
+   - "That's a design decision. Capture it in design.md?"
+   - "This is a new requirement. Add it to specs?"
+   - "This changes scope. Update the proposal?"
+
+4. **The user decides** - Offer and move on. Don't pressure. Don't auto-capture.
+
+---
+
+## What You Don't Have To Do
+
+- Follow a script
+- Ask the same questions every time
+- Produce a specific artifact
+- Reach a conclusion
+- Stay on topic if a tangent is valuable
+- Be brief (this is thinking time)
+
+---
+
+## Handling Different Entry Points
+
+**User brings a vague idea:**
+```
+User: I'm thinking about adding real-time collaboration
+
+You: Real-time collab is a big space. Let me think about this...
+
+      COLLABORATION SPECTRUM
+      ════════════════════════════════════════════
+
+      Awareness          Coordination         Sync
+          │                   │                 │
+          ▼                   ▼                 ▼
+      ┌────────┐         ┌────────┐        ┌────────┐
+      │Presence│         │Cursors │        │  CRDT  │
+      │  "3    │         │ Multi  │        │Conflict│
+      │online" │         │ select │        │  free  │
+      └────────┘         └────────┘        └────────┘
+          │                   │                 │
+       trivial            moderate           complex
+
+      Where's your head at?
+```
+
+**User brings a specific problem:**
+```
+User: The auth system is a mess
+
+You: [reads codebase]
+
+     ┌─────────────────────────────────────────────┐
+     │              CURRENT AUTH FLOW              │
+     └─────────────────────────────────────────────┘
+                        │
+          ┌─────────────┼─────────────┐
+          ▼             ▼             ▼
+     ┌─────────┐  ┌─────────┐  ┌─────────┐
+     │ Google  │  │  GitHub │  │  Email  │
+     │  OAuth  │  │  OAuth  │  │  Magic  │
+     └────┬────┘  └────┬────┘  └────┬────┘
+          │             │             │
+          └─────────────┼─────────────┘
+                        ▼
+                  ┌───────────┐
+                  │  Session  │
+                  └─────┬─────┘
+                        │
+                        ▼
+                  ┌───────────┐
+                  │   Perms   │
+                  └───────────┘
+
+     I see three tangles. Which one's burning?
+```
+
+**User is stuck mid-implementation:**
+```
+User: /opsx:explore add-auth-system
+      The OAuth integration is more complex than expected
+
+You: [reads change artifacts]
+
+     You're on task 4: "Implement OAuth flow"
+
+     Let me trace what's involved...
+
+     [draws diagram, explores options, suggests paths]
+
+     Want to update the design to reflect this?
+     Or add a spike task to investigate?
+```
+
+**User wants to compare options:**
+```
+User: Should we use Postgres or SQLite?
+
+You: Generic answer is boring. What's the context?
+
+User: A CLI tool that tracks local dev environments
+
+You: That changes everything.
+
+     ┌─────────────────────────────────────────────────┐
+     │         CLI TOOL DATA STORAGE                  │
+     └─────────────────────────────────────────────────┘
+
+     Key constraints:
+     • No daemon running
+     • Must work offline
+     • Single user
+
+                  SQLite          Postgres
+     Deployment   embedded ✓      needs server ✗
+     Offline      yes ✓           no ✗
+     Single file  yes ✓           no ✗
+
+     SQLite. Not even close.
+
+     Unless... is there a sync component?
+```
+
+---
+
+## Ending Discovery
+
+There's no required ending. Discovery might:
+
+- **Flow into a proposal**: "Ready to start? I can create a change proposal."
+- **Result in artifact updates**: "Updated design.md with these decisions"
+- **Just provide clarity**: User has what they need, moves on
+- **Continue later**: "We can pick this up anytime"
+
+When it feels like things are crystallizing, you might summarize:
+
+```
+## What We Figured Out
+
+**The problem**: [crystallized understanding]
+
+**The approach**: [if one emerged]
+
+**Open questions**: [if any remain]
+
+**Next steps** (if ready):
+- Create a change proposal
+- Keep exploring: just keep talking
+```
+
+But this summary is optional. Sometimes the thinking IS the value.
+
+---
+
+## Guardrails
+
+- **Don't implement** - Never write code or implement features. Creating OpenSpec artifacts is fine, writing application code is not.
+- **Don't fake understanding** - If something is unclear, dig deeper
+- **Don't rush** - Discovery is thinking time, not task time
+- **Don't force structure** - Let patterns emerge naturally
+- **Don't auto-capture** - Offer to save insights, don't just do it
+- **Do visualize** - A good diagram is worth many paragraphs
+- **Do explore the codebase** - Ground discussions in reality
+- **Do question assumptions** - Including the user's and your own
diff --git a/.claude/skills/openspec-ff-change/SKILL.md b/.claude/skills/openspec-ff-change/SKILL.md
new file mode 100644
index 00000000..d5f12043
--- /dev/null
+++ b/.claude/skills/openspec-ff-change/SKILL.md
@@ -0,0 +1,101 @@
+---
+name: openspec-ff-change
+description: Fast-forward through OpenSpec artifact creation. Use when the user wants to quickly create all artifacts needed for implementation without stepping through each one individually.
+license: MIT
+compatibility: Requires openspec CLI.
+metadata:
+  author: openspec
+  version: "1.0"
+  generatedBy: "1.2.0"
+---
+
+Fast-forward through artifact creation - generate everything needed to start implementation in one go.
+
+**Input**: The user's request should include a change name (kebab-case) OR a description of what they want to build.
+
+**Steps**
+
+1. **If no clear input provided, ask what they want to build**
+
+   Use the **AskUserQuestion tool** (open-ended, no preset options) to ask:
+   > "What change do you want to work on? Describe what you want to build or fix."
+
+   From their description, derive a kebab-case name (e.g., "add user authentication" → `add-user-auth`).
+
+   **IMPORTANT**: Do NOT proceed without understanding what the user wants to build.
+
+2. **Create the change directory**
+   ```bash
+   openspec new change "<name>"
+   ```
+   This creates a scaffolded change at `openspec/changes/<name>/`.
+
+3. **Get the artifact build order**
+   ```bash
+   openspec status --change "<name>" --json
+   ```
+   Parse the JSON to get:
+   - `applyRequires`: array of artifact IDs needed before implementation (e.g., `["tasks"]`)
+   - `artifacts`: list of all artifacts with their status and dependencies
+
+4. **Create artifacts in sequence until apply-ready**
+
+   Use the **TodoWrite tool** to track progress through the artifacts.
+
+   Loop through artifacts in dependency order (artifacts with no pending dependencies first):
+
+   a. **For each artifact that is `ready` (dependencies satisfied)**:
+      - Get instructions:
+        ```bash
+        openspec instructions <artifact-id> --change "<name>" --json
+        ```
+      - The instructions JSON includes:
+        - `context`: Project background (constraints for you - do NOT include in output)
+        - `rules`: Artifact-specific rules (constraints for you - do NOT include in output)
+        - `template`: The structure to use for your output file
+        - `instruction`: Schema-specific guidance for this artifact type
+        - `outputPath`: Where to write the artifact
+        - `dependencies`: Completed artifacts to read for context
+      - Read any completed dependency files for context
+      - Create the artifact file using `template` as the structure
+      - Apply `context` and `rules` as constraints - but do NOT copy them into the file
+      - Show brief progress: "✓ Created <artifact-id>"
+
+   b. **Continue until all `applyRequires` artifacts are complete**
+      - After creating each artifact, re-run `openspec status --change "<name>" --json`
+      - Check if every artifact ID in `applyRequires` has `status: "done"` in the artifacts array
+      - Stop when all `applyRequires` artifacts are done
+
+   c. **If an artifact requires user input** (unclear context):
+      - Use **AskUserQuestion tool** to clarify
+      - Then continue with creation
+
+5. **Show final status**
+   ```bash
+   openspec status --change "<name>"
+   ```
+
+**Output**
+
+After completing all artifacts, summarize:
+- Change name and location
+- List of artifacts created with brief descriptions
+- What's ready: "All artifacts created! Ready for implementation."
+- Prompt: "Run `/opsx:apply` or ask me to implement to start working on the tasks."
+
+**Artifact Creation Guidelines**
+
+- Follow the `instruction` field from `openspec instructions` for each artifact type
+- The schema defines what each artifact should contain - follow it
+- Read dependency artifacts for context before creating new ones
+- Use `template` as the structure for your output file - fill in its sections
+- **IMPORTANT**: `context` and `rules` are constraints for YOU, not content for the file
+  - Do NOT copy `<context>`, `<rules>`, `<project_context>` blocks into the artifact
+  - These guide what you write, but should never appear in the output
+
+**Guardrails**
+- Create ALL artifacts needed for implementation (as defined by schema's `apply.requires`)
+- Always read dependency artifacts before creating a new one
+- If context is critically unclear, ask the user - but prefer making reasonable decisions to keep momentum
+- If a change with that name already exists, suggest continuing that change instead
+- Verify each artifact file exists after writing before proceeding to next
diff --git a/.claude/skills/openspec-propose/SKILL.md b/.claude/skills/openspec-propose/SKILL.md
new file mode 100644
index 00000000..d27bc531
--- /dev/null
+++ b/.claude/skills/openspec-propose/SKILL.md
@@ -0,0 +1,110 @@
+---
+name: openspec-propose
+description: Propose a new change with all artifacts generated in one step. Use when the user wants to quickly describe what they want to build and get a complete proposal with design, specs, and tasks ready for implementation.
+license: MIT
+compatibility: Requires openspec CLI.
+metadata:
+  author: openspec
+  version: "1.0"
+  generatedBy: "1.2.0"
+---
+
+Propose a new change - create the change and generate all artifacts in one step.
+
+I'll create a change with artifacts:
+- proposal.md (what & why)
+- design.md (how)
+- tasks.md (implementation steps)
+
+When ready to implement, run /opsx:apply
+
+---
+
+**Input**: The user's request should include a change name (kebab-case) OR a description of what they want to build.
+
+**Steps**
+
+1. **If no clear input provided, ask what they want to build**
+
+   Use the **AskUserQuestion tool** (open-ended, no preset options) to ask:
+   > "What change do you want to work on? Describe what you want to build or fix."
+
+   From their description, derive a kebab-case name (e.g., "add user authentication" → `add-user-auth`).
+
+   **IMPORTANT**: Do NOT proceed without understanding what the user wants to build.
+
+2. **Create the change directory**
+   ```bash
+   openspec new change "<name>"
+   ```
+   This creates a scaffolded change at `openspec/changes/<name>/` with `.openspec.yaml`.
+
+3. **Get the artifact build order**
+   ```bash
+   openspec status --change "<name>" --json
+   ```
+   Parse the JSON to get:
+   - `applyRequires`: array of artifact IDs needed before implementation (e.g., `["tasks"]`)
+   - `artifacts`: list of all artifacts with their status and dependencies
+
+4. **Create artifacts in sequence until apply-ready**
+
+   Use the **TodoWrite tool** to track progress through the artifacts.
+
+   Loop through artifacts in dependency order (artifacts with no pending dependencies first):
+
+   a. **For each artifact that is `ready` (dependencies satisfied)**:
+      - Get instructions:
+        ```bash
+        openspec instructions <artifact-id> --change "<name>" --json
+        ```
+      - The instructions JSON includes:
+        - `context`: Project background (constraints for you - do NOT include in output)
+        - `rules`: Artifact-specific rules (constraints for you - do NOT include in output)
+        - `template`: The structure to use for your output file
+        - `instruction`: Schema-specific guidance for this artifact type
+        - `outputPath`: Where to write the artifact
+        - `dependencies`: Completed artifacts to read for context
+      - Read any completed dependency files for context
+      - Create the artifact file using `template` as the structure
+      - Apply `context` and `rules` as constraints - but do NOT copy them into the file
+      - Show brief progress: "Created <artifact-id>"
+
+   b. **Continue until all `applyRequires` artifacts are complete**
+      - After creating each artifact, re-run `openspec status --change "<name>" --json`
+      - Check if every artifact ID in `applyRequires` has `status: "done"` in the artifacts array
+      - Stop when all `applyRequires` artifacts are done
+
+   c. **If an artifact requires user input** (unclear context):
+      - Use **AskUserQuestion tool** to clarify
+      - Then continue with creation
+
+5. **Show final status**
+   ```bash
+   openspec status --change "<name>"
+   ```
+
+**Output**
+
+After completing all artifacts, summarize:
+- Change name and location
+- List of artifacts created with brief descriptions
+- What's ready: "All artifacts created! Ready for implementation."
+- Prompt: "Run `/opsx:apply` or ask me to implement to start working on the tasks."
+
+**Artifact Creation Guidelines**
+
+- Follow the `instruction` field from `openspec instructions` for each artifact type
+- The schema defines what each artifact should contain - follow it
+- Read dependency artifacts for context before creating new ones
+- Use `template` as the structure for your output file - fill in its sections
+- **IMPORTANT**: `context` and `rules` are constraints for YOU, not content for the file
+  - Do NOT copy `<context>`, `<rules>`, `<project_context>` blocks into the artifact
+  - These guide what you write, but should never appear in the output
+
+**Guardrails**
+- Create ALL artifacts needed for implementation (as defined by schema's `apply.requires`)
+- Always read dependency artifacts before creating a new one
+- If context is critically unclear, ask the user - but prefer making reasonable decisions to keep momentum
+- If a change with that name already exists, ask if user wants to continue it or create a new one
+- Verify each artifact file exists after writing before proceeding to next
diff --git a/.github/workflows/codeql.yml b/.github/workflows/codeql.yml
index 320ead1f..59149d6d 100644
--- a/.github/workflows/codeql.yml
+++ b/.github/workflows/codeql.yml
@@ -36,7 +36,12 @@ jobs:
         uses: github/codeql-action/analyze@v4
         with:
           category: "/language:${{ matrix.language }}"
-          # Issue #48 — Code Scanning is now enabled on this repo, so the
-          # SARIF upload (default) succeeds and findings surface in the
-          # GitHub Security tab. The previous `upload: never` workaround
-          # is removed; analysis crashes still fail the job (unchanged).
+          # votee/beever-atlas-ee is a private repo without GitHub Advanced
+          # Security, so the SARIF upload fails with "Code Security must be
+          # enabled for this repository" and blocks CI. Skip the upload
+          # until GHAS is enabled. The CodeQL queries still run — any crash
+          # during analysis will still fail the job — but findings don't
+          # surface as Code Scanning alerts on this repo. The upstream OSS
+          # repo (Beever-AI/beever-atlas) is public, has Code Scanning
+          # enabled for free, and uploads normally.
+          upload: never
diff --git a/.github/workflows/deploy.yml b/.github/workflows/deploy.yml
new file mode 100644
index 00000000..d1ae8ad9
--- /dev/null
+++ b/.github/workflows/deploy.yml
@@ -0,0 +1,38 @@
+name: deploy
+
+on:
+  push:
+    branches: [main]
+  workflow_dispatch:
+
+concurrency:
+  group: deploy-prod
+  cancel-in-progress: false
+
+jobs:
+  deploy:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+
+      - name: Setup SSH
+        run: |
+          mkdir -p ~/.ssh
+          echo "${{ secrets.EC2_SSH_KEY }}" > ~/.ssh/id_ed25519
+          chmod 600 ~/.ssh/id_ed25519
+          ssh-keyscan -H ${{ secrets.EC2_HOST }} >> ~/.ssh/known_hosts 2>/dev/null
+
+      - name: Rsync code
+        run: |
+          rsync -az --delete \
+            --exclude='.git' --exclude='node_modules' --exclude='web/node_modules' \
+            --exclude='web/dist' --exclude='.venv' --exclude='__pycache__' --exclude='*.pyc' \
+            --exclude='scripts/deploy/.state' --exclude='.omc' --exclude='memory' \
+            --exclude='.env' \
+            -e "ssh -i ~/.ssh/id_ed25519 -o StrictHostKeyChecking=no" \
+            ./ ubuntu@${{ secrets.EC2_HOST }}:/opt/beever-atlas-v2/
+
+      - name: Restart stack
+        run: |
+          ssh -i ~/.ssh/id_ed25519 -o StrictHostKeyChecking=no ubuntu@${{ secrets.EC2_HOST }} \
+            'cd /opt/beever-atlas-v2 && sudo docker compose up -d --build && sudo docker compose ps'
diff --git a/.github/workflows/trigger-docs-rebuild.yml b/.github/workflows/trigger-docs-rebuild.yml
new file mode 100644
index 00000000..eae58176
--- /dev/null
+++ b/.github/workflows/trigger-docs-rebuild.yml
@@ -0,0 +1,18 @@
+name: Trigger Docs Rebuild
+
+on:
+  push:
+    branches: [main]
+    paths:
+      - 'docs/content/**'
+
+jobs:
+  trigger:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Dispatch to docs repo
+        uses: peter-evans/repository-dispatch@v3
+        with:
+          token: ${{ secrets.DOCS_DISPATCH_TOKEN }}
+          repository: beever-ai/beever-atlas-docs
+          event-type: content-updated
diff --git a/docs/Beever_Atlas_Feature_Spec.docx b/docs/Beever_Atlas_Feature_Spec.docx
new file mode 100644
index 0000000000000000000000000000000000000000..96d04df588e9132089e75eb816ad05519fb1ad29
GIT binary patch
literal 35075
zcmZ^}Q;;uA@GLkwW83!38JlNp+qP}nwr$(CZQI6gc)$PMy&Jm^+Yi;%5!v~W5#3Q;
zo$^wkV9-GSGbYgDwEuVVe}_N+&2Dy%#`N<4zYv)J1JQFdv3B}@fRO*2OMME?4gnYl
zi0D5e-2X4o*v`ns#>Cc{&fUhE?tiw{B+tkWF(3x(kv-;cP{~7AZovp%qJ==%-^|2?
z`x)CI`FKSyHq%wo!t>3p=K`8FJ$k-XjUN8YB8W08rlC>&gCT*~VSHTZ-l-XPZSoCE
zFls;e9QYvSS;$SmU+9&;6hVmmTvVSwC}x#)fL(N1c6^xN&X20pRKzfgznV6rz;fW$
zNGjyie3xsr*b?ZkLW1~%ZKo_BH-_Xbeq0M`Y`kbPPW0^hO>EY~SI*ncFa|TQAt|tZ
z7Hl3PV8ehv^xuN#I2h>fFT=Df^a0})9gtzVS2r95MM(X01Yz_CZzQ5AMaX?JN&y7j
zfDRD3PT7ZAM;qg9o^#_br(NG@=$%+NlI$w1K;2L048s3W{QnX%F5F61*boS)_Z}1o
z>Hi4%|CaE^XUB1CwDt6xM?~#(nXZHF#cHD=XJTT!(b#H?e>1hwN0C)UNhV*y6)Yc%
zg)p-teN*iT$Z7Uu+G+ZmM>5xVwXG9}kW8}h$6D!Hc_dLB5e!?e?)RnU_i^cr2c@jC
zc@vf2x2>b~AxB=sP*LaE@9Q@0=zGO%4@zr-I3<H>Ic_f|<@>hG@0-2|qe9|R8!cr#
z2?OGLFu5^n;j?JQhH=xISafLS<_;{UyPjTj=y>c7T!q!Uq1QGkV6sGK%0&^|%5z22
z@X=-I>O#0CPCBF3&c@pI-fn1-bW*c3QAZ<N5=S;Ik3S2mGH~}ggDDN|TjO=jNh5c8
zNQLWDUbVA|StsUV8%J))VmfYb#pGcy&#<L~(EB)~i}07kyD<CAYe!5MuySChO|K%l
zrVq_MKlI9G>&fQngedFixgnE%s__ZQ)@JtxtDE8|%)f~;>jbFyl+W#;>HCV#!g1n3
z$}R*9^Q19+=Nw2nRz$r<*)_AEM0Ej<Jt@|?yaEk=T@lV)EG`Ya*nGO0@K7mcI_l5r
z9O!=IDePu;Zw+I?9oMh>kDt6Lp4x{1t!qgEv@|arykn;=a*xK}fRCmpip|b}h;#5Z
zwz<5^hZeaPXFN5RTVW?dz;wiU<!p|Htb88tQgoJ=XA`qc4*${{JNMGjlbNS$6LU@w
z3RGeePPSy5m-K1el2zWp=gW36wbwTE&`eoGrju82&+D|^>PvACLAKbO2k!xoT-MY1
z#8t<}h;(*;sR~#7szz&|weFai2^qv9Jo%hEuY>3(Yqo9n686CTMW%Cfp3u)({V)P|
zG88W+qbHG93Ng2VQxGf_mudoaVcoX3s9)XD_#Uz+K-rYvH!WUJjZ9a^ZuaQMq0bHQ
zep`Ez7cn7?#n;*K>wdNo%Wd$v^0mtR%ljI4KlxO4kfOu=z3KU+gQglTCxXFEo1$`3
z@o}rCcYE_~bVm|fwY_kwS=aqp!`-v>O)>Yzrn~#QbBo)z!`V47W2^UR%1TgI=iNM`
z!@JAjr*_*yb9b4<w@~%j*_`s;!B8Y!;zI_1{?RD?HnFT0NJt%<GCgwp4?Se^`ekkZ
z;)UQnqifgm=l!$Et*6E3r*_9mZTE!{EAfyrX+lD*dy%a`W*g~vB;3uQ?dh*pLORH9
z9P`|75*;;3@%S^<j;@$GnH2Kd@p^cY=}qS%^}`weuh4iWld<Wgu<3^T%|}#gM+>8o
z2VHwXox?|E7J$IFrfY+O%a0X*(QIiB89CPa{wyx<qf8E>yEvuv<+yK}frk~-F=O!I
zYu)b)-YSg$hJfGqEYGp@yg>Wty|%*qyY4AM@eN|_16oHtr;>f`cbxz9cl_hIu|}@$
z?Mkuiy+w}SO@^&!|4J!L`Ib52b1jdk<1m2CiI1K>^((}VH|r?jhIixB_wM8ce{*_H
z`)A$M+xUArW<x;I?#)>=Z*OaBZ*%XrHbD`~=Aj<_&UmACXhUQRwsUmvdE>3Em2GzW
zFvJX!-Fbj&HOorFF7G&Hb+qOtybG}_tVH6f7@0u4G}n?9*)IWb5AlEybD!w0v$WdY
zOX%e~F*8#n(rUUryXuV|@b8^cDFvA%Bd1yw^s(s^)WFk_1v4s5a=|AuhdN2Md~Q><
zRmg5!@?5f5^g&N_SyjVx<buFA%UN=v45PqftdvgV&&yO>R9}en3?fBGfFi&%>P1px
zhhBNoE#y3_Qau75kDN<z0b#YtmP|2)fmYSYLl7rL#nQl>Dp3G9M>3L)^I|D*siEYf
zXSGDq?C*FdiszLqNG-diRfEgj5a&0X%wQRP-yWCU?jA^<R|WVx7-xjs$nR(0Apza_
z-_}(&^i%x6Va|Ry{j#q$y1qv8myx&q=6@TM*#*~C_cl#-Ztlu46Q?h0uc8vgplmxT
z2zsR_xS8IY`s>?=d)qf)D#)|WyG5Isj=630Nfv9Xwp+~F6*mCw3Zqvr@%KAy5CMf5
z*JNu=0Ku&>(^4a#^j_lEXGY*T%FpVf;iWOTn`t}o_a>ddK<x#o55Gs;MI*-VtCreo
zWj#;Dqnc088HT`h#1AL@*K}CVw@&6|==9R)4W_mfa80&ny&|95*a`VBO{bZ9u_;e8
zulMQ2^d%L$T?sg@3+T2KQAz&WQ_K9WN%B%MkJ>@`%$PLV4qdyjk11w(_~=F|^?i|H
z3J(Hz7weG?n(er{9K4eQsnfqetwUX4z*Rv{<=~{n;;Er>3WR_q6_m*bq=LG%2xU-=
zBD8YaL;$BWNX;9!EHhH|n40}W-PloDZ(cpaSySVj|J|BBLd{x^?n@fyAPO_ntC`fu
zj)_7dXANtDL`sTQ*<f;e`I-A$OBr#+W;=A1S8a8Y&(=~rPat)^*Zf*us?X&@nktr5
zDlZrU>;97YuUpAQg*vkF?M%V(G-_6BAC;q<H5tvvs;!PCS0Jpari&*?p9L52-?fO7
zGrxDsyj*)KC9cw{%7jCR>;#C-x)dFhF;E+dIt+Vx<BvqD>s)3Z6&XKL$8wqv2{#In
zTl+DKTf=U0*Yl`R=vmQJ9IXTqo%%`QxL{dK^aMMhL?xjs$Z&W|!DQwExGaqKzXfPy
zdm{QCQ-4OMmib_6)kd`L^9Rvs%ndHkj$9{r6owM0<leo`fs@P%E2G(P0DhJGPo#YD
zm)RYet}Ye7^0&&*KF1g<!HOTS1qJodLawmC#9^a9`XMP?a9ugdOdfKq0(B(Fp114W
zRkME!$2jm(!9tZM=85DY@u`MN=yqb??2scLYLz~Zw~4bg5R^Q}quUbr7H)Y1b!!7m
zMo&8s%Rgaxw5l5MZ%}trD{$Qi9oDI11g0>XqyHyl_pJ1#<dYe18CW<{dMHK_h#TM!
zOwlKXK&#$ziHK4URS_%-lg3E6;xK_rF=s-u3bsgj30nU6IsIcI^I=xBhAt^4>MX5Q
zw_9K`$7uSznDhG~Sk+@ND1n3ycR{`&QZyo2Y#`Br>mCuBgt-i6A;8iu4$ZzP$xa|D
zkgZX}vL=NHidG@lz}qt~wkr(+yj$CPx+3+jn)FVV`1cv<ouHs+uWn@EXpj@iL7TlM
zULaC)*L7gpP5f{smt3bga@en_E_b{QH!jiQwk>qK#j|1D)^!2;MXWAjCZ>ePzHb}l
zK<_-Iu`YE&qGt5%A?)anfss8E;0CH{3Qn};G)t*q@MKZ4tECtFyoxzm@P=EmE+D6j
z2|XnTx&4bG(UdM=W@V5b1w}3Y73TdI=Z$=ym|SaPm6~X;#~nM>Eqa<z(JK+x$gNfc
zCT9@C)PUYaE@S!9EQ`ZBL+K&n@K;G4HyuvS?%tkW+n$Bkl$1UcL=GyuQ_vlSC?-WW
z565eaY+<3}8LCK}JhvnB)&j7pTm<!5MOf|czj{OgmpoUVN$!BA2z$V3%E9A}#i#kH
z=Way_x)QL~o-}oTm(Gs#$stOIHaT}sK;MgvDX8nlyO2Br5<5Xr2nhKc1mm_fEx03x
za>xlLDIOKGl_~Bnd!$Z<^4xRyslPWO)0yzb5|zI$c3uw~t*t^`XG<#;$M(BuSq1el
zU9@GIlZ_(Lv21HgwcSLL_V)hsLpV=O17;G0DB(Fyqt8<T6^({6N=AhAXL`?M$*eXY
z`fU+0n-$lEtbqTN#DP$(VT&a+&XhgDq723x>iVB5->?-#ry2m#wIwsvnF(PyAUI9P
z58w^L0!x9pIR#hX^5Ke^k+eZhGc8YKGacEhCFGIv%uDWcxmO2~i+Dz&d1gr{hqeVm
zVW}SB+1~FFZZvzk&h=C%Ajl1)j39RcfZV>hNs_YABZP_1P!LnkDL6{penH^@Ek<5G
zt6TyZ;az1j3*msrbLab{@FmM4Vhw=-CX`XW5O&zf#H~{`O(7BEDEA2~R(d#u$ux?q
zV;SrHWvgz;8U2HpYV2w0u}9U~Lwji5e!gdAvZHE`#lEj;x}Iy))}Rdu_u{lZm_j~v
zD}~($)4Yl6N2O9qv*T8=X9_W|NRG^>WiVeBgqk|pMM<9xExOpKr8K`Yf|lDcsEXz}
zxDztWsX`Z}Gl9~MTV1D#bxhpXJnEsnkU?`f^yhvlf5y9;XLyfuEzVC{A&4ERz}<nJ
z?<1Bh^q<&~rly&D8N@&@z>rxoQlsjJM%C=>U%vKGW5nWG*{<Ggm0$|j1HvGiqmY{Q
zC93GTvP2%V1u_MZz@5VHN|rBv_=ub=)#0TVr21iE+y;a9N+!iMZekgWlYrsq@7DDD
z((D@y-^=6D6g2SUX);g~^|!;p3Fjr)rd>{#?EcBm?UUME-BY%i?_neb?-O$x{}mR2
zMqjaXevFz<o3#GZBU8W(;%P^_G-38Z;eR>#f6N*E-N1}LceO$lz1B{qoR^+aT~!Nj
z&mSt(NIp_Ae6lMq<GW3R@Y0-8BQ<g>yxL{vm*9i;rmRaKtI{|8TYDROJy5vpws2!H
zUBO2@Y>%)<R#tpRR%@s|)u#CVE)To*VW^V-%tDm!w2(#5|Drwb=G4gX9e>m}FZWb}
zBES}$%9}*b2S(XCrgtgxCJ%K{Udm?}lxC{ONg@s4r)-909oTi5({Q@RINez8oSsw0
zJiZFfi*d;HZ<uJWW+_i~%GI^EW>d<NG}=a1=dEDUD&tF#o@qC!8?b2d?c%IC4AbV|
z(p^VrPXux&i$}^P$UV}yYR4nCY}Nwk;^odP{rxO8|82Y3*jTfmbt@Q)avwxxmIu$S
zg^iKQ>wV_mY_;7L`d~Y_&9!s0kf^*~Mqi<IX3#0pC1QqU24`Re#(ugGlMr;;`81V_
z#Fb8Qs-Lo5;3Pe&wU2gJUets;vSA~-1sTtOr;K~~ZF}tOGeFNnmg98YmQ$R5`B5VI
zA5?hqDmenw+g3HGi_cRU>n2zUE(-*?)>nGXlOt48zr7LzObWTA(^iT<=ao*gzilgy
zq1F^~$%N#~#-T*9uxtY9p}zW7S_N|<?Fq!IjZuq`oZhU%E<t9!kNn&?S}$}BVtLhL
z+&%i1D}=3C5`@;wrg-(V4YXpStHh4E4h9*0!v|ktW1Jbo)b5a9XF!CZ!%S$--=R_A
z*DzdqjYj>5<N3UOG}`)pzjv#$#yhJhadUO4u5M!n4;o8L={Ibj{;Sa6D4L;*ZFDT#
zicQv*bWl6@TZ2M#C|K|&YgsjPDy(7aawQU?&n?odCPS`^P%>oZkvz^N1Kh);bbi0i
zemmNQ?*#ui0Mi`Lpn=^6b+J>&`wmc=>4x}Ph;Azs82btx{QSFwAt>stFHsZpqUW`G
z6podj@qQrrTIH7X&Ami{(rIFnzr9q_K`OSgF-a&dU4l(kY*h5gf!rjrWm?UYQ*H1I
z)m^+N>lTm<SyT_~ro(H)Qw2>n!tHmgyP8M<0f*}~H{kHn?d*lmu`QQ({xd$Q?Xk;x
z)kiJz6U|SsJ2!qm@Y9p5N{tP-U>@Er_k<0X@HX*s<Uto`Bparyhvco&S{9*?sR)6X
z2I{FksamCjTtbqg8Wf>A*bgnzIvwrXAEYZp@^6>KD`fUP1J`lUVgqug3+1cG9Rp;0
z)g-QVrN<5$`Ib{RG$xSV91{WWMyZbUnX#mzHMTSPHeM(&;FqBstGcUMb;i0geJVA=
z8yq2iZvTOO$n1WC-BD&S>&w|E;n4;e``TXNoZ2bUSy<OBMc0OMi=5MyBD(+7XYHwm
zxtJqp<b0d96|U@K2fe-TleP}--B2uknpT~cTTeeD=aT&Ba5z-aRmQvj5g_MpG#7{R
zU^7zR|LK}3p1QLxq2S|E<O;hjg2S`9zX1MQ$@WezM&_C`(jgE2E{0geft^fFzcgQi
z=zQNcyK@Ei*;RI$IfKAA5zPew<srQ;-0^!e@x5`8_9yxCEv;VTUkCq|O;^bFo?*wY
zlu}!-e4*c+lZs??bo}#|GqL9_0t0=5gsw&7vYNKsA}XsAW%pG(mYZ{g57+ZTrh9}X
z71*hkSfmH%s`fxrSv!{bM~gH#4-UfadL~Ed6i35!>j)LWE3CIkat`F6m<$r@2zwb7
zTP}~^$1?zQ9@VSsv+gxdSZ2WOF7I;3B66&nDpp(BqhNVhYedr^nn?{-%N%PT+w(~p
z{X5eeEbl`awS3QXPPN|3@G*F2F`Cv#7WGm2jnnbgdGJ_ZAB>%k)n2qMT*>At3Ovn^
z?Bsr*8UY`IZpjFK+o!zBKt?Xiqhfz3!M9I8{5U9!vw>vS^7QRXm{MmO^KLd7Yc*rt
z$E?&yg%4-`4Ly6x9}SmC6hL13>9QahTuaOzT1_0$Z*-k`v4QEnffQETvlFwpYh)|O
z(AgS#!kCdnX(BMZU(d+&=JvCTRHuMRER~lfH5M|h<E>6wayUi2B20cAuXUF(Jh!@|
z;=AC0l2$3^#_tcsg{6nSII9!?OL(xoy4xMfqhl=2twQL?X&p5NX@Z{A8T7l@PM;f0
z<j!yuQ6We;EA%33Q#00eW|e*`U>i!{2$!=y<Ln3$Lk68hO)6fk*klIbvK^s(W!_O0
z0E*g3EG?L=d;7&B{_iK}&v5Mei9&qydKBygbMW{$O=lyBSi<d1mi#0G>r&*dlW+a&
z%gr+&!*w-8sro!fbpfrI6nLk8A-KBb>v2Gp=L%xQlG0+o_6vAoA`^X`D3AM&@q8(C
z)O*d)k3nJxmdvrW$hih@9`}ruR>x=Pm0fwy-q(4mH)G|uIUWrjqUR|B&;<6T^cst^
zk2uQdW*XblF|c0u$Hk@f;1V|#o4IGi*2W@lQE>!wA7liGz;5Q$YNuPTceTdqZCiA7
zn#^<>wowFk2~7{TD-*#RcnirJ$@^*JpV1h9I2}XddtG#nE=8Cls01eWKw2e=w_TvM
zk!SbfWKs<OFW^p&jg1HEw5W~{iv%+bI$8#ZYZ^vOzFTn{Z&3iKgwEvzbWTNLwBX|9
z`q{gY71j<WhTIu*@*vbP`b{T}5Nc!M^q-P*46V|dkNMd-Si~XwBi2mNiO?Ful@o16
zW2avGZ7`)6Nu_OMWgMS(&Gxl<EUA|@>(gX298xjs!vuo+Bb@v6@?p_%(ZD)$W9cUG
z{qPa71Zn8QUdd%z_+KG>?hULGqh^TSa^8F!l}lT=lAB}m3P&ZXHI`AIY~%V_+JgvW
zR^kWvg!ZlI<)QM;T=2I)mG<%RLR$yQxP3=mNHi+sqsEvEXJXinAly^rAm4DHt#|4R
zJay<k^RAx3HIMcL?0<b-z!~vO9Gm~%f*%n;CRX(-?Mi3~YU^U?dv`F0uAL+{Bk3(|
zW|yhEGiF}oUYd>;<CRv9b{mCDtiF{F&!aONB3UFY8f&EGd{#u4XOS>0e4j$%S5JCY
z(Q*jjEWf`g$crW&L=F0!1wEDVRootlCW~6J3f}z`tPLwTWjI>1jO)rW1;}VYX_R#e
z<~WSW1vvD9Hbi&@&8Q4xeNg()_P?L*y0RecyGJpxVnevGvdkw631KerG0_VEkVi~}
zS8N)yn6Sc0mnzE{RQ(B4j@EpaK&u#;^^Xfn4D&N?pQBSUrch2c$iIOFBbAA+lpB+m
zLu!(x70&4>&gqNHqq%;i1bJ&v;Q3h-6#2K{&@P53MM!||u>8eTauV(T&;&H7SR(I9
zEG302$A;kfx5KKGIn$s}RZ<z?$eQ_p6VU!qx64&UbjP)mUVCS#ENlzrI98>;cpp^|
z=|h3M^KEE$g-CHH8>>puC?Q}*zXE2XA;SFCjTy$#;p#id!JUfbfDZO3n5_`HZRbML
zQmdV0-<k-<LFvQlj9zYfdq9M5Oq<E%ILrq7I8Y`s&g8PQbJ{{1W1&^R!`~GEpEA#v
z5V?Qmw6*hUT-Zp@o}${FkLsX3k5g2yP|D-^ffKS)RC7P-iYV>-eABFgW==pF=_34a
z#e&^StZ<=}n#eh^t%>A%<Hg;uK#$!r-{lt1$~j(Z0Q6VG|GX;f{&m>+OD*72>#IoE
z+g`152D2}%DY%L{)$ivz>U8?okZ>#@24@Fbx=?PBi{};9eS*m+@Tc8YhO5=yPT5*R
z>wAL)lLYyEJ=N-GGbdFlGSd#4$+X;C8UGroRyJgZfx>Iv9>@GPv(|YX%4XgYv;0o^
z04UtZ?Ha0YLA5UUVQ`{9|M)Jx8XmQpL*R85)#vHb1d|JwD{Sk|^(MnaOYX?L2yYKR
zN{s!TBzz^_)n*`K2Wa2$d^@ojOlYdNW(q}9zxtfUqjD)Wy_(4P%z5OnQmvPw?s4AX
z?DI_>?ZW}(*)sJ{V6_CYbD-?0nR54WrSFfBeOu=c9>DNBAYMZ8ti_7k4rig~qnUzh
z?jj~Z4FZ!)$+ZYt=O5Z-y>cZiqs&q9pAF$GdN!T#B#+B+PP<UhgH5sC<>8g^{SHtM
z<FQ1R>6Q?mz1O(br0~na#f&RbU@}3lDM1U_F{po%FiWqFM_)P+d8&D(i|`y-ycI(H
zm0(0OISg50F2+U9bM}GUf2J2)&R6m!Pe|kSM>@KZ{z{Cb^Hzy`N<*&L<3VbjZ5K}&
z(~ILbw6qIzIbgtcReic7xVu7aN~#>R92-;|M1}_5hpfeW^ZUY+@;jJ4$i?2gV55lg
zF~!pEUxsEZDfBR09D`kVpWjl;H6Juvd7T?a=p@@ll|tj0TQkX9<WvxPW@fr@(;?Fv
ze}O4RQN7WYfYf7(?B^v-d(4cf*G5%0EqfFqMj_$AlmKuUG|AYhM7u|!p<#cHi!ZZY
zF>`crCa#A&x$1!hRfV_|{#XjL->YI^5)5|pXphY=g=^3!lzx=UvxZIaVlO^UM9<G@
zTB@aH5YZopTg!Ha?k_<t?{J=RC^e6utJ!qmSd%1^x`Z|smmx4wY-4N5T;HGYMD41$
zv$Gi2SB6c7fI9+kVqIdGW?6+*NOm2JuDc?Sl~6@s%1U{}X(Ni3TTyB!9_o-yfvUo~
zwInHk$-CVYRca$xNEiZ=tAgNx-U<{8-Xn8-%C?9Z&?Ut@l(SzlKp6801_CN@jK|j{
zPX;Yfr|=++eS_Xu@ce|0A*Etzr@@N2ahBm|vYP^=QYw_(Kj}E29G2dl8$}J%<tvZ^
zF}{RhoP1zB0#r8t>(41^*&hThjC1NV8;zd{ir$m)m_Ep!?1;E_+VclC*~(nXLBWG_
zuv-bgb6lA+if1tZv7lzNnR-%U`%PsXySrEg4Xf3awdeLm3R)gCV+0k$vH$2!C9cv1
zfeG}_|G{&6<9qjqgCEiz2%l3yW(<ex;2_yiiad7??iakE^8L{dr<1^@EVGHXCM5Y&
zNMuaCF9tY|KJs?Mt~l|?B@6^N)MPT3yTm>8N0~*BDqM9lV_oIYk^%9EXkPT4k$D!Q
zzIyPyVd$d;sC_UWcH1Mr9;o!N7mch)wkxRVo`wI_0C3noX2J*9IHe9XLi456&X=zo
zeD%F$uFud;k9+9K*hAvdl@F4^lJM{?YJ`G^`cv0BOvq@2B81v%dB+Y3=J%h-y%<;0
zRdaBUtw?N*VZWx-Ao2R<Rv8pBmRUW^kU&JKIlroh%Z-oa`DEYm;|4XNfkL8VjONJ?
z32H8#JnMjY{_Af{p&-7zxV*e5(^EwA<*%de-V4bxZg+VyPQWj#xf63(Qrqf;u}X#^
z*Z1u_b^l1rG$PNKePK5<_cp59nhl7@QvOJ%s;jQzpJ-gsw#<D7vZP5;Z%~|6<aLCR
zG0i<fwNu?bWtvjPQu`W2LomVL7U%KTEvlov3S3r(&iLmlLzUI=g6BP40lUahFknmC
zGQ>ZI!g&$~pMVe1dZzw4-u(lVv6kn}Jbm@a=loY(QtE+A$nNa$@-7YfM18mYdZihT
zvqoUG3}%_a@w~H<CGce%RTSb=xe#W*#Bv-TCa=){MI75&eTIbMNP4K`fSID;0>k_A
z<mQx9n&7(`sq#(0X;)X;I5Spy$!Y2f3&hOFIYzes=4$RFx?an0-RIDAHF9TY%aC4t
zIlL#l2T%`DnI!l!e-;OMG6JMcpwmV53l7ckDon2;_Z*h0)15X0`@Gk78RM@-{U3``
zOXgK!ZE}5OWf<OrL0K($4mrR~r8F_hviwd9ON@mM4ckpIu<!=5)Jpe6m!tR(JtJrl
zdhD0%8qj(dSJm?+LlPP_T(C5!v6~gU(ZsWL@G@zP%&5>DuYsXh_mYG~bpbcOs1l0c
za5{=~dRKBddF4E0^HjjA>GofjbgOt-X|8rdacSj*1;s8W20rpD*c(M^u?kUQ$CNBb
zbQk62A|yAr@UETxln?kv{@=4-<YqdqhOeq(Zc>(4W2P=#Uh1vzQEuU*0;~i?V~mz1
zL~~I;)=h7>KlN*rxlfZS)!k~QpX?2EyH&P1625n<LaJ%fC!)?W3w3Zc1gl#a*HpM{
zF?jk*;hrpl*XW2wkJy38r!TP15qxJuZVE!8LN`(5=$Dh#;OM`}IGcrs(fM6x{yF6B
z2E?4}{}pbqbE?#g5XeecQ;#5`8Rf<#xOa%L(pcfj!StG9>l~E_=4U$#3}K1;!JQ?2
zPJxDJux&=mtw>tzBdcD?1yk0nNg{pVS`|gHZ^ghuDyNB1_J!~<gk2qB{mwAku~1ax
zjPb;r5Aso#Oq!N@iW#Sk)#-=JW}jNKCFNoy@3;KQiBiYU7Cm?dk!sB)1MRRhm5SsP
z7|tbPJ~#vJY5K>3`t*NhOq8ICh+|K8T^^As!DeNsn^dPocCa6um3Z(_D?X-^RMYR_
zjorfPUN6$7QGM=VAsM1YVz7u@jlz2rRo|jw$CdR=Z8{W`BNm816E%Vh2bT#=UkpWY
zm2}YwA~Kl=Z|U^k5Nxa(S8(^Y*BzIRSX8srO{7K9a2noC1V=hKLiXm<^cmoB8x?TG
ztnOrS@Z7DjccZ!30xwGSxl(s(B}eEX=!YqCBVKB;gq~uprnnDXlyi}c+n(e~W=9k1
zh_H?Ib_uVH3Tt=^SYbK>C(~)s@kEYy+)8)ngujBCul_seK!s>D$B@ZP6<mcZE+awY
z3L=k`m1~gWUf{=5nr2jQTjq~qpC#4O%3}+|T_nzUsUOPZMhkdR$}h$_I>>k7)NF+p
z&?i`<Cgsf}uC9BvlL^D_uqvb{?(PHeY31}Jou|TTifQ61oMD8F0EQ)u=$iD^(D#E=
zs$nP^jJ?gF^!mr=tc?0c$%+E^kP5Z*pRpOuLFW6Y9tMx&$uV!=19oz;T5Yy_`KOdC
zdZzn$vC4Oc^E1(^d-0b7TAEM6XmBG&#X>35zv2hnhW-P8Q(&UH`WXD7P=@d6E^Xsc
zN!y~>x}k?M?0r*cr|x^otg-~986@uJOcQ%SIOa-iZJ+4ate?=#j6d<jKjrLv(ehr0
zr}W@=@uPx{uJ&%*m@>@?mfHl++(Rxngz0Fa(U4J^n7caTGDZ099f|+U8qhB!aXBr;
zked3_Oh0}1tTwMG$KjiWco?Y~K)+|c2X|yp(ms+D%0%ICRSu%g;0Grjg};)p@Ay9k
zQgFkgf_G%%-|+LdW%Hf+PL0J*v>halvUE5Egy=bm@i?pZ0`4Q`DfxWf-=7u^4HF?e
z1d}_<EJDprJs*X+MN#7jXpL*tk*Ab2CXkg6N7YYupM8tmKVR{<pk*2Tb0`qdi6`2R
zDuF#@R>cRL!r=-iDuy*GRXR;h)peY*PXgsrP0M?$9eO1xv?&`yHHrdxVBY?wI)_^M
z!!!5saM^JGwqONpKiE`*rer(Mm`Q_j8}d1yI<xXr*lj<k-4gl;1G)8ax-7dYII9qh
zTyxE-RGO4DI+%~#F8aJkDPcHGw|E0_AFd(vh3mz>qU*`Z$tNeRBpQ_C4zwu_t=zh!
z@r(44`<EHdAVx-etyeUIrZ!j|77>GnLx4-Iodao!lu{*~fTdc(WV2$9eDxU8zDH@G
zA0>-bIvO<(1hikd!d$`I%haFRo2mx%B%k?{q1PYjQhIjEX>mLG{dPXx-lz#3j3%AQ
z<?;3Tq58g`xW5f`@roYa&U!Vx*@by^!9<I~ri7jF6jX4_?&_BNhy214+4_U3DT+Ae
zpz#B*T=EYJtH+<*$G?a2QK)Zlufn;Hb@7<3Vo{}37>l-FVq})Rn~IhKfAO4T!KVo!
z|Kz#c4ijonJVTkK*t4qe7RL-9_C^@#F$5e|LO5S7L?X=dt|Jd39T0#Mf8|6)+!G|f
zdN9}yatpaC9hCQNv<`e7Y@?RaJ1J@LhvEvgtPSf5xP~1H46ggF+ElYDqI&3JKqfp?
z?N$@Rhz*7xu628~IE{|j=Y~+3Uu}AFkfsfSCwhgI1rGyvhcpCsGBrO*AyZ{&WaUC2
zEXG62FmV`FjNdo<zKRLjY0Zbl<5Kf1f%XQ}wv|bMwX7vgJj8_7(xKZEgpv#x5b!$(
zwfO6!1-rQv;#OYEPoS`8u=h{EO_4u|H1ch5^0*(Q3{k9%<i*Gl^4;R$MfF9l1@|W4
za>I1zjPIW-d)4eAUhaRP8<HpRb3ltsiVxy4|5<VqfJwoOioGhR(jIux;NaTNMCUs|
zpz$jYgNBI#-j|Q^X?$jKM>)1oS8R^(6z&4*Y5~}GigJ@XF_uZqA5$I}j^h{EDL3PB
zUZ_?;p~UZ&l8FdM(Y0`k-WrnwCW5wr5e6O7r_A$SsociN^!q;{r=WHAIO)^%TR^KW
zlks%!kc05Vy8a;(L7BlaXrk1K=X0H5+KSmXI;n#TsdE-Qu7|3^jCzqAA(fJoEmSeH
zxAk663H~0O=VtfY%AT-Y9Yf`uZCh*UW<6d7aD3pggFe;W<7(<i6YtQNufz7O@!b>M
zL_}T7al!%Vr2R`tCo+r+ed!W2Gn>rTj~)`Y=a??z)g*8=2xm<@WQP{6`eY@%xkmcu
zfHwQ8W#a=26RIN5JtJ25>%jgqW;Eb?c(h_PjmQbjEWE43B6KWp(ExE2*iC^0p5<uy
z$!<9ae;nUDh16&-fT&)&M%)$2_C~!W?i)RZ5U3M;>a)&9Dw-I|AOD0(2Su4vwSPM2
zZtBg3>!gz!IvbF^pHD8GP?D_7szdxp;B1`62jqo9o#yJJw-rf(%emAr2>GPfg*&=@
z*o8vN$CgcUkOW>}b2&a2CLyDEw!enveP7D2?y&!h8DunUE2>DauO$HKdoFF6Pj;f8
zzS1#EN-g1?IcR`xJ4oWUTI2sU@xUBSTh*v^vR@jdoOg*DK<8-dVjHydPj5V6yZd57
z!9Hm{5CjgdgY>vi$SXnRoCi|aP7c>J<!X#hCC*trfl#PnseNgkWLB>p%_rOCo`Os+
z!6!d1N{Q+H98lbts3Q>@hNbG<X78`R0Jq)WfWnF|;Plgy^Lm^Pk!qO61C|C#?rRBc
zHm7Xd%v~_f_qSUu6k7>|Syxd9neH6O4o4>4I(xjRj=)@yQcAyZE0h&DmIxM+wP2U<
zWn!zQsua0MD>K0afeZni=*mZ?j11LKYmiZVww2%YT*{y%<f*s>es5CEHnb{(D<>T`
z#W8cP{X)L^<cr*miqq4_kYAUBj`{<CkD>B+R~^kKmZ@+AWEC`x2U4xzoKPTLQiaT=
z^)x3yt{y|(bcr9>5|`<FqU#Tt*=at_yai(LkNYJU71UV5;Q$pCG{rEt1h(nbHI+T<
zCG)FBoR{EuX@KAJE@U7t*3%-I7OXhn8N$%Ys3@VmqdGC56N;&5i*e{?-PJY>21L}%
zsDSMgi=Wt-M>}(9LrYgH1|ah$TET=7hkagP^HvM>iaiN3WjZHkePe5yh(b^B#Zr7x
zv>Trk@I5CMydWW%gWdQxm;lB`FzsT^Idj52b<+*_)5s7Tbzjr)@980yWJy&(a#=JH
zgd$-j4;Nun7BP*pDtl{w9U}{Wz5|tf>yVqR_G673Y_}_<U~XPadM&dsXMF|s+VQCq
zy0iHw-N=;g;Zz5VA-O$a05YqD?*V8GFGZHa&Y~WVrz2_VB+T)zG9^{Itm-`~SMVxi
zT{h}L@$|5C=czuyfl|n|>1<74?Tu59l>NH0L#33`{_8MRhYH8CPYWo!kAWI@RI+W#
zjVcE}ogoM40e$e{$Wb$_Ac}995uCo!r7M-+eb~o~1%fRTa}teQo_6HU6C!S|>da8d
zK5-fr-5-sx6q=7G9@D1x;^B(dTjb{{`9<5$q4e$}6lYu-`*#6)Y4NT7&-yWK85ql@
z8g_x5S{l@hG94vNh?miU#0*xtCtX(Vf8c_qy^#rZBeYxhTxxq9A{0&MqqLZ+LO(NC
zpR1$53>WoEHkMfe-fJ40@Ar$j(woen7FcVupQ7Ja9%AIlU7Z1OD6+pmEn2IeYt{lS
zmBcxZ?$|FN5|=v!BJ5fu{4!hCRz_D70D4yjQ;O-hbZhEya+dIswm_<sq}>`2dozi3
zqoZmb;NE<)N=u0MEG@PLiOd61UZlXVlOEg3?|w{Fm?3nLY>p+Q3n{ane@ylxXg20%
zEGLPt3BiiO@c0k(wfELD1Zk&bB=MVdlE~!r?})C>$L=z?Y-G$uifU+7q)Mc{$9+=2
zi5Nm!?`Z~;toml2a?ok#BQ71prb>>kV}mY^Kd2YVde)c3B)#Bh(CaVR`&JL=SWeEg
zlRp5NFIttjRm1(goYkB}6If_C@07r&wba3CE$^~(P@`*WDrV0yvMI^~5p>Hx>4@VM
zto>tt<fP9!-T4!_=6BEuze%g*vI&L^oSjJfUPRExG+rOfYEPiQGz78=ZKa}k4(H01
zS1h|n`5>Hty3L5mv0{!9g^?SYuo>r*IO+kH+juLdNkGYWPM_QG)pYzPuxxAGWQo@`
z3FEZortgs;ao@4v<}I~)%vNXDBz+?91gk5Zrk#79kdoA53Fj&2P)}n4V~A4yOgP(X
z)&227FY9i~6#JjJajH{!q+Idol7K@$v&deLmq&=cQ|*3LYRBU*=75Y$OsT3_1=-E0
z-^Pa4hxh+XHQe9s;Bl>RL+BM%v?m{&Y;F|z1xFjS+1gVn!cSlk71Ld*O`M?k+oryd
ze4AkfevRF3b7j8b0qr~ql$OP}Q0DdUU5QSzAv}McDv@cjtO|F+wXCuj^vFB=-dR0g
z?DAqqxNmTv@M|J=M&)2|fXcpWX2XC#N|$|ZV5Kp_h_anh7CP(5G~Yyu54dzxmEUQu
zFAOV_2U-rhiAs)PkK<$&9!xE9D4>F9Pb63D+~D~+RoD6es`)r8#3KB6RcVtAI;&#q
z0$fy@0BL2Vio-MDCSK5uScX>T;33e#ZmX$~@1Xp_Om~9P%BJIx3D7ghK0|g5{3X&s
zL|6Ia6SKI0@6-Q!b2!V0m6215YntC9k38EFTmXwe#l3ylh3`(Bx2p<7jA+H39^>T8
zAVqtGQM90nWSl99jO<}|ErhFMe|m``9ZMpRY-G%wIodAm)jn1WvLwk`kwVZcB~OZ|
zR(Hir+N^2H&sqx#*mKQB=?Pyrn)utE99qcl!`2_+h8^XXFjlyte@Qus6#HQRK>Def
zW){`C+A!CxH9sa?#t$gj<pa52-mnFN;|S}7$M_N>wcm<-U~A11u~Yc&xzC-nyPFDE
zKdkWL4Ar$f>RWkie=ql%0@5p}O)919%yzEg5P8y83w^gcu{jzaggncR1_NL>Yb(HK
z!wd)l9>V{dA_m#)!VVmx$W<go4;e_~x!MuHaIU7!B*ZFdRApOG$BGS`9!Ti?t^y5=
z#%*}DRmFBO{eUJJ)JzC+%_OtT(g&7}`BBwK6S4&X+<5%O^a?1x7n4>ZfjUt>opZpq
z#_UO~H}@pZy2^3FP4n&L_&9mKMGep#XAKSu)KxIPY}f4)KddZQbYs5W@;~w<c3Fq5
zp^Sv?|Bib(BxBfc1=UVlkt7?l@bmJSMul9+EtJ~}Hu>uQU}V0+X<vb50{?;Rg^q~-
zg>n&3GTbwWz2{hplLYs4U(LvjCCMLqX5o-oJ*mIxS8}$;B~I>RyQKc$h6z1u<%x>{
zU&RPeFt;i?KoO+Lg1K}=Bj5_b?+D6eArD!90NvC13+Jv#XbaqKTk6dMlelF0fwRX%
zXuQ@;M+)3AG~=bqVCgMxPuSIInL~P@Pppqwz*G=HK|%T?plAxw31MR(x*m2w+@7~z
zsL{Ug+tK^`#0%g*3T*yPV3rn=e-Xk6n`ahc{1Q^LKo|x6@MvTW59&0(KdVo3-in7M
zV!!aN910W5`%FD27erR%#FNtyx~$sx9vvcax{f%ma2LcgZqa`EyC&PN1UIg!9P1j!
z7lrtws4fw8*Yb~rBP&n(lLDuTh-u3uWN~R6C<3-3aAtONR8(s*L>V;4Kn7R2fUkyH
zpeziiX)0_`=-!vHrxnB%w>#OeS0c>&kBcB32sg2XM6O)`E~y8iaY6z~AJeqHR{=Yu
z4+m~VQ=uKUTXt{KS7j{#R*_PfzuTS+E_(&?M)IG<c+Dd3`*Y;c=)H~jy_<0!^Ru(?
z`K5udQwY9xe5#ss@_b3eto>)DVH_Mw0IMN*Ev)U6g#4scu5U6{^|>U|l9`>EVH#i+
z&X>@Rr88<Y{J;xb)-e;606Y&m^rk^j!Lo()PdUN>-b2#b0u{!w!~A&C2G^?sND?q?
zO9Q<Dd{^WE9_HP41onSdph&J1eeNGLrB@<ek4Vbqiy*8flGj)_F59CbmdbKL6;==8
zNE`tliyXPy4@C4liAZ15PW|k~j-yL|ymXmpkg-%sL%I3Ef>u@zYqGJ{iBjp6fcT05
zpX^XlWQ@pvh@Qw&>o7k)jtJH{q>?2+y!o3CR)kN2hBeOy>2$=#q+(sAu(d%~@lQ73
zpgRPVS5Z%_Q*L1Cq-sCx!Hf$b?Kl{R8D{KLGLl+f`5TKgW0%ZSb3xmz8-_feMPaou
z_{9_pr~^zo5T&)mY7@Al;|`9v7J6?^5uLX7Ni`8|=^3}XjkTRCDL@*b0#XU+e6ub5
zDU$0i2q(--Jmzo+vQr_aFtO)>^-mdYVXKaW%wl=W8eXlauJec7xU{tKBxw`<tSTMK
z?ZEsD8oRh9&jR0m_KjZ<^A;2(_^mOPGI@SuOVGjE0d|vV1;jVfMpGH_RPmE%L3Qya
z*sQkNZ^{@H0gcznY2_s8`^fv_wL!)Sm2(GGG^jzc=$@=AbT>1Lt&p^keE+>xS5o>G
z1`eG+6Rf(7CMtW3M*uq`i)u|Pr#*k@XkTRPdMmPcUgG}+dY%+w-NMyL>lJueP_a!4
z*<;Se!xon>6L5b{c2ap*9oiwTQ(!s+GftJJ&1x6MHlh4w$4VqrnuwpcZhydP)KJWn
z`N|_d&*P}(>7Wv_3|N9STbRRi)$@DBaR@j@E2bQH#12N}eZ2rc3tRn}dTrX9y*WPy
z%9CXJN|ZKK|HB^8mWybUc5X$Bxv!NJ4ATk1TmiO0YT*|yQTz8b35O}!u{e^H4i~|F
z!rIavaP8+xf+@@jLiJApPTciXW=Nb|VbBWYkw;r!O16cH?Mw(%^K$keiN-ncTxB~(
z?Lj*}Vcf^Jw!wa`1vV8yhnE)<q40+C>HE?AhMhzuF=Tn59_md^$=O5lG+T<Dv8F(U
z&>?41v3*wFPjOFB)A4pdWT0Jd#GPG)6(3wTv?i!mp3Y7c4Mg9@N_T6+CTrE!PF03=
ze<7?z&bkmgm)?qMx@6>yCKeA6EF>)<`p2KKNa@6y*>MK{<hGrO4Y+UD?z)TZW^7ja
zOiE7%_Hl_OESGj)MF`y<a~vd=VjM(%au?}N8+f0Di)slZIg-AEW2l%&RrlrwzkAbb
zC1AyK8Hs++cl|rVw|C3S&-?!1KY!J)@BNR0)L$=ws^+>hw*rRS)KU@#p3<Ne;B9s_
z2-ETYy~DlCR*N>A**DE6bL<PPnuVH}bEfGYsL#41ThW#s{#_jIrOQTG?h7Ige3^sg
z6}(@o^8soK$hqxXNFWzp!uTmT2y04vO9&hQ$vS<$+Kw!*N)qpqGprAa4^(Aob`Vml
z-aEA)I@?ium671F9ITLG(`<V%b`m74w&pNvjyW6Ljyt!e!38q;{+XS+4bG)A3mhg7
zAK=B&eqpIS#Oemmzkg{*9;~?2<j?&I{H83+EkV3P{2?Iem&hQ8;lXt)QZFO3>m5F(
zCSFr3>yVQxoy)Hxoj|WU$w)%Hk*HduJzX}97HcFcVS=KAyYfUZ^YUzMJeah+KQkXL
zP06@1+^mH&Gv0ro_Fz_~jJL?kNeqRAU3e(xqk>3WgEMu9&fq95D4j|J)u4fSgD8hv
z6=y!mzE8vK5=~EmAD~DGWr5>Mft!nXXpZHKYX-R%*KTteXkeq^I6955!#ZzWM->g)
zy#56p>B$*gJ>sU;UiQV~huke&#s8G#eemI~#$&KvB+IR3GNP6?w0ZWP!fx1&b!x@%
zj*<)MT4EA`7KgX=hLuUrwVB~XvkUW<D`H=H{$0V8n9|xDgqvHsmuDq`&-vMxG1kDe
zV70sS9XoEkQ0GA@RvzGS9Owy({MO#u$~U4isXSe$O?Wj(zmE@kLC`+;pHacCgDOp`
z0Rca-sll(7!YJ{P28hDFuzsB)J5DWEI_^Gs^IsC~=AKqufGf(ir~Ls0b+H~_uL(Ji
zk*uRe?mZo`QZ12?s%1O&BNh)n^38LJ731mXlldb(`s}M_N@Uoy6VmxcV}PIPm?ci>
z0`cpdIVsS<g3&d@j2pt^=9=&X?OTIH;;@Az$Bc@C)0!NNRTtMGAinB`#_E`aQ1J3J
zM=*^WEU&!bP8K`8QSy?Sf#$GMS=2F`B>!j7w*0kdg>TAo57jWN!!}mdIaOni)*+f|
zA3>*x2Q$dWlDP2I!P`7+ZS=W`Jn$&@i3Tw<i)an;NBZ_@a<(dO0+Un!l&AUjKW7LP
zF_nLS6W!A+37#uFE`8!jDHk&FIf97*1E*bKJlTfsA$yRftyx1Ga`6)4Tfh!Q2sKN2
z{#d)`Q902zss^1*z<U6~An&2}?*SLrd~Xycty9k;mLloGMoFDAl4kq(c+r(QbeH+V
zcTf>jx<~W{@x-Na4`P@&v3(CJc77IC)T*4~vA2-eby23_9Gc_-$8>VN(v$KzVctbL
zhcu`~ufaCNlS6h@W8oVGb*lCb((bNEpzjM=TI_ES9l1--hBJF;S<AfC;$y_@d}K}h
zh%85|H$b{iWK8;HvW{aiwADr@J8{#Pf6EK2sdF%^)lLoVFw>fGZf<7k8ltz#l-5W5
zMQAoErSxs6LmhAJm&kQY=y8_2w=BnNdj~jfbJc@|G4&=nW2M*1IM4i<VOvtqgS_wr
ztMvmhJB8PCdRAQu&aAMEqW3)qd$dPiHfnY;f{2R84Tq!!d6v<ia(qRHk=QFkF}6&+
zQ$iwfY*nu0T|6lMLd<h;kk7&C>)i04K*uJ75i1tVvx0}|SpK*<ze%Gr>d`w;scI)W
z6;^v(PcguA+HPU21B)k)LiFoUK=%AsK6pW8T7x-0)VRW^OY(TW1vo5Ip^=}|(lQGn
zg*rdHgBP3}rgSWGT`Y5OsQYMLbb#1=M1w{;(U<t*ob%Y7FzEa-dZHG9`<=%9sMi{d
zW0xJ>bLEISZXk~SZa*Qk#)dGOkg<Vhki!0x0X1b9)Wkp_iI2VKl<`6btyOweHQwPc
z8=|6N=b~?|!~o+rfSL|Dx?Kj$$oF^+Api2eep=G5N@Q77OlOF?Q_BzHqC&#~EXwhz
zi73W~rDQ#;6xPHV)Hk^`cfK57Twd+C8HOAcmX9%H&^Pqovx8NRm-AU;yG=(t{>Ao%
zY;f{5fS$qa#5ZlsD@e)3<~eSc)@}`mt%k|{j%@~8+|mD6E+O^sJ|4k1>Q|l^{V`Aa
z4e}9c<ag{lL}Gw=Rj9V;liX6}_Gb3lHmP%0%ZL;;8WXau%XC2CK9-NNIU&H1ZTSgV
z3nY%GkUzc0^537FRJt~Wvr^Zr%tJ7>-Ci#WcNm1aavRtCxg+T_exTj{XP4G_JnmmK
z?ptQF$9xK-x0_1b4)|hUZm;4@OqAoHNmKQjM}4}^fP;-3)JN=q_?icnc3hK@uBvzQ
zt47!0%ET72Xy&D>0WL_r3-LYPk?uXyzX-!>H!V@I12gpzo@MhKLU~m>e!c;ar?4<2
z>EEedJrnh2X~4E-T(+>Vki)F0`WdMP>H<^3<FKCg<P3Hp+0hZ^WM(#D1$SKlwqfgW
z%!OARN&(LdF49cs+EzU)c0HK6Bmz~VJ{TTkL5|<UUMptpX#RzctDVi<O$05Rx7#Ue
zuKVrtRkk+rTbs{GudU3^wwtbb{u%Q1+=hy_v#sSl7{(Uow&x{V2eNdRcbKpQR}d-I
zN;w4E{S08vn&6FI;|f;2np~eC=98IMSdS*&H8^~y9%AH|o@qFN0a6i8V{VwE<FY_d
z-WknAvC{%V$H^}7Gpaft#NE#~t1R2x%Y@2HA`CdgmtjdE5@(m0{b#}Ml~%6gl4JXh
zuG+#`Ufjl|YK5v9F9?gL;%`=V0*<~|rTHwXfnMtl&m?udA!J$Q^rlgwEQwhp3%yY^
zXEcCJ$7<g1lV5_NT~y)2n{4K6_6p*fQ!34VO%_6WHn1y&X)S}uRvMmlO}j61Q{HhK
z#ftG~s?(I%`CDy!ky_WY!In3ZLOrN!{z_#s@3G(@6#hs_ulrS3E=cVim`;)%qq#<;
zkO75g!>~(t{7=r6+)lr_=^>oX5!lA>L%Yej3mRTK+~Zrqr|+_bStAj0t0!FJgNgRF
z=+WG{nAZk6m*3&XS0sf;+b~8>#X+<7Q#T9vX(}^=UVyt&vop-Va<qO{gXNtxl<5{m
zEf63&2f*djfWSXQB2V%vuQLhE?&`fw$QoeV6zIH!JCSIjeO^@HFwE53fPzgBnk4D>
z$Xj$xGHZT$T1l*4apPYCuxw6q$k+qT;(5Mv(wBKQvh+nh6D6<ikbv-=70)?CjzAY<
zg4^94m7*xiH^spK7Mw*4F(BK`44*R`ZF&!{(1SgX;A+W=UW}5@qf*F(!t8G%h-UP{
z2CvlQNMK@YShGtkX@JM)zS<WWn1sW?gA0;EzfQr8n!%Xq|7U#rF|E5RoA&ThLbKnX
zZ6YcaV{cYa$j-HvIC$J>qlD3nmbS``S4080;bUux7`j&!%i=JKudiZ>mD?|*sAqiu
z+f0~SYDJ^B<qqDMQxgON$K~KT0fZhpd8`5+hPl=Z#5K69BRP3}MC{U;O!Qc2BaaxC
z861+T204agoq``o&UL)D>i9*3t`G0wHX4-@!$_i9Px2Q6LbuJTzR%V1>w-5eW%EB$
z%v9mzh{Dx_mPa=KS95O}Raeuji{cjC-GW1KcMt9oJh;2NySux)ySux)6ChY{2y)45
z$(!%n=ic*c-yUN!=2)z!R&`f(byfB3S*j2v{-h6zF>8t}S3FN9cD2%W9&rh;_Zp8X
zJ1SuwPiGIig)i1KfHac9E1EB((7i1gqv-D~P%itqt&@s1?bCNFheRju1tBdT+3~D|
zR4glAqdmu7qi>G%)iO2D7xSgQnPz&NrI@*NFBZZUuNflVRq_bf^aG0@xM|K}yp$W-
za@Mqyx^AAgw`~ZdJ|vIgt%gOGWj51sc!eDug>iQ_Hg?u`zG|@Iuq^r4JCTk$;7m}b
z(D0jBJf3`4%l0a2?teE4Z*k}&U&gXnv%x()+W4rSbFre?T~7aCLMU^;khFcXzn~9_
zf0wqNgdgPSJmy<46|pOX-{SOTiBdU&KX6$~L*X6)uP5M(R_gGy(@}m;Qg*~)GvQLg
z1EggKL&PL!ITH0~#$HjUHr*FxRn9?K`U05!fuaRYHh8Zg32#0|h$(Lu=bEHmeR{EU
z=&s=a&08=|4bj1XVO(v@EK0pX;!!KuC9d70>NEAMvjcJsp}WpAzx|fjw6EbUq4^qc
z?R8(5S>#vz*()>{Tj_gva#D`SWc{h?n*qJM8dcWo2UjKnRuPTKKl-m0Nj{*8h5%O}
z@3R>{1Ig-E7@pT*N2)M?GmuYdA=+**EV@mBs29EEl2HKmpKFJNyKR&+A3HgjAh`5R
zQNlW!P<Te<g5y|Q8<|)TF(8!$#SiuP=*!HvedG(7UW`zqjT~U;#KP6*Taqwy73;WA
zfTBQ-@Ntpt<B1Ki`Kjn`^i<WEJb!Dk*C|tjH~h!%!Wo+wf$hl?UFDQWu~<!{Y1kt@
zkx<6Lk{Xpn?qGhJy6OTHgIbLAH}_&_&&zGNk2m8pt8r>!>+}qfkrkTiB(LWjfdX&G
zmu3BBh&q9+?7kw}@g{kKBgzVcU!-55HgfO!CW<pD_^p#n)sk!!nJsNi5*DUa6BIaA
z5yPn}Vw914B~r;I>wxPSWEiyNgHHvv^vt*mtw{`d@Ia+96(O&d!VC6b(t3e_R2!{b
zpJ#a@8BRljC()i3iX?orexyifv=1g}kp_Oh(b|{JN9!WNjVnf}*wTb~olYf^+x1B<
zQ&Fxr+7DJQC0<j5sAKZohfbj+uPs9&i9I^5R*c0gr}mO=_3lQ_e(<bN73u_50}_IS
z+We8J_a}ij^aQ%*0^0x@siiPpE8<kOa<l@;92^~k%Ak4!(7AT1L^ln-FM@V4EKz(I
zNKWjFPQ0m^hX@;EOxcWv_m_RO_hp^#lGnxu86@QjIdVV>k1IM|tyGI7HC|-pThY`=
zCxn_{4|^i|YjtPBtVP(i+KIx_T&Q|bKIu4%qnw47)9xKjCP(l&S%m$)8rMYE`i);x
zZaJ`#kOK$t*nx73W3Avz3)Z55748eB05$Dxoj*_P2s@h*7USNFjHT9vj@Y)zUo~A@
zsLVDBtJH|N0H=4a(RXPweLrw2Ds{@Sx(1i90Jvq2qM!v<@=?LkGxqzya75l8*a;A_
zw>$O`jifCgOoXA08_N}!jEbV@l7N5fBTaa~5^4rV^sYziKo_3guB&xCh+|I{%xLg;
z-nucQ<sRsjh*-IRiZ&zgYJ~xav1psq9ybt@^1PNsd5gm~VsV|r-uHu?APTCzURS^f
zL(tVGm#jQAfUvMLDzWMPK`6r2H6rZ(qxl+u_%g`2dqhxG{lhVez1ioL?4C~g9cJG)
zeYo!`E2VBxhNf-JR{>hyF<;+@%T-HkL)XIyD2t&vT<FGH%;dCHjqC**1?HU2ks~ov
zS72K2+hk6}v(fhw%??ft!1`u?LsZ_WmlE9KF$X%td5Zt$hA^hS$K%R$DN?E{-#DCO
z1!+vFtWGi1fBa2DI|glri<Hc`3*@-Io{7<mjXe^W)Xb*AGV1f=r3URg3Wp8?`zxso
zi9LM(z6>B+w4swVIuFDLjQ4&1a`xPe`u#{F;KEuwoSlA?FJDX3jxLl+TrzmjA}B6x
zL53CVi5;yQ`Ko~t*;`876*i;=aAVT4KN|Hu*~G&!FbtscTt<IBrw-R~p=heMx)~bx
zwQomnC7JvRl)cN0(9pCp5}Sjc)$f>(gTup8Rrbi%q<vB6=*kKU7O%U0k+UHvhi+T^
z?rRK*Jh&f*&DbjT2m~(~ZV?@r2&V4st>ClwBv5t^5E8!ZGOqu04|xs%o%jyXD65|8
zko>wM=24PgNa|-0opXnn=+ffk5p-!~Jz*j$FV&r#o#JOHDDV$G#3eS1Z@Z;%DKCN7
zRrqKK68fV;Qs%+KFB6;TR@N1W8-B#l($2<q(|0;PwKeZG8AyG<<51S_nsH7TQ&%uQ
z7CqKu4M`o4vGs8RAvWHbd=bEkw#8P&HZ=`jL#Pv!l0s}xIHInd(z8gBQACUjNCO&q
zBB~20B`9K~4}wyb7>sv1@tuc>l?VM&Ouf<_FMCfFgSj#vpJITQM2+@PwDb&?d5bX^
zl$Fas6lu#1K(|p>@M@fSV=32C=f`^gC>ErjdxL}Rlkox4Tgiht9p~ET%-AyCigkj7
zzSU#MUR!aDfe%*OUw^nycHMux^HC~Ce`wh%2$rhCg3T+rxxTs@>NSCgheA%D?kuSG
zd4Ojzj|&Pu``Hh(J%Ahxlu!;EQ_WkhVn6s>w=4J3GVw?~INfrXt{lX?Fm}iTWZeUL
zw2Gne?pM~z5qxYBJg|4>XQoFvVJbu#NDd*XRsQ9D{uS{D@U+S8&)$VzR&<NOQs!r9
zylYI@gB#Mt@DadydOSMX2-o8srj>$%y5o21QqU`AR1w@gABeb}7=0)yF|ARPY6ael
zfG56U8K3bG2|m=+v&q1>g5rH{+`6(XP=gn|07sdnQmN+GoFy0{?8D^(F$Y(kb*B;O
z#lOB5<fLcKszXz2K!0qNyppmv%SI|NE4a-_=LOfQ{qjYN?ZGWqducPs-rm6|15a-x
zor{Z6m~;SoJI;0$g2s!+f0<^QAdoH=z5WX&HQjl>ey)vfSJJ~tE@Axx@PMTe543S!
zXx!8gU~gB++_omm<Mu-Rk!Grq(U)@%^4QpeQn<l+fVKwil|zHT`HX>LoaC@6uL^ZF
z4a1%3+(i5^z01u{mwt;=ZZ9#6-=lfhJhcNH=Aw5B6sk~giSu!lENV(<h<9H2>jJ&H
z+mQH+&rZ%|sx2fdcMilu5hPwxIqAC8xDkR{tE6Iqvq%H@+u_HjK9QV3ssnKvvp`2_
zWIO)xsHQY+m!GJ-<Yn8;I4`+l1(YGo_@xA5+K^sW&;6=Gt2!|FC(hyZMGOeMBmE9Z
zm*nv;SRUPlnjSSbb_=6w<p?$zVFhL#M=*(U_q#o#wpR0XXbddM<D<{(Un(=SzI>Z}
z>O%oYdUK=*Xm{U#%?&L!QPmBRc7!QO86Pb+VfKZwp;MG(?G;jBN)4~J0#qDtTflsE
zOqBgt)a7G003BVs!GC75Rc{s9724H`>0}mK>7u3{r)RT)>djXp3?d9Nu@A!t_Pu{d
z%~J7-%VaB=(c+y5YU8V!r=jN^3GG@F6Ew>iV!%~XHp330?L;T^#GbNDGlH?Wf1TFk
z04@obKYNE_)TvYDCsmt`ay)qlNZ_@0<6Ve}d?3<~!|X_W`9%Si5uHJ0qN5W+61g7X
z@Gazi@Y!bxB4~WlHh%5l6}C%i?iFIFpCx(SDKc<Tj088sr@z#2;&>B1PizBH7{fr|
zIch#*$Llf9bmUQ(UwO->3s<ez&IrM{k<huZGA7%uFpb$ZNM<yArD}V7vX$f<D>mG9
z#n_50Li3q3)RL28$LK@@PdmGg_;@iWJ5}c}Xf$sES-3n{fPsIOP`DUk5e=2eKyW!+
z_EtM*^F@KUYF4v7%!PJER}0>^V!_tEW`==c0-}W`N7E1)DK=vKn0r=Gu#j=*n|Obq
z_%*qLZeVwjUv99Y^3Y{Df^5Dp#QFl7kgntx^^Y(N>T`;VZVsm`HHB-~zMpK6`5bMh
z+UZ5Zqse5^gw;+)2_w}&ay!CkzZ0t3m%!8?S;5HD=R$?Ewi#mMB9~_PB!;(8#((fg
z!$bCfJDrA$_<GD{fR5MN@Zw0Kn2eGcci&Ha5mYK|Ay|#216SlLN?_WzWLeNUz+{;u
zVpgg+|CO9&p2YEv2Oq9yl1r0|2}8v#B9LU3G{jajn*u40Nj*%o0t|w+A1h3ZTG_Lb
zSYG|%${7ga<r3HSY<fSwl&jOtO2aCllcD%Bg66btbHyw4{EI$o%VuCGI7{<OMDPla
zH`-49h<$ed!U|G#rYHwI<yd@m>M*<FPWT*Y{Zau7@nw93&z-FL7nM|qs=8etUQ;^i
z?hmzr1&5{lY(Rw=;!?`{i{FQk{Y#9xGY6P5ss~)1mGe2__Wkd%Ai=C#P6xU|+9oI@
zcYxE3xAAEPnyA)-$tc%2?bCf_*MzuXwN9^!h-9uGPRKni-2fQm$1~3Hh&U?jw*6TX
z3fZxI1p*X>m7^ojYnnuj`WbYkj<Z40I6fiwmRU}-K((dl3QT$pJCO`3Fdd~;@*iCS
zm?wM~8}6nGaAPj+^Z7&}uOPej!pD&~of{&rk>cYkS*Yssn$89HZG<Q(sr{=t6f|X2
z7F;0Y;?;-F^eCb{E!x%rMJ0|0Ua`q&d`WmbNCV&Uz;QE{nn*k2<SS8yOOUZyI|&#~
z?l9UZp>UCsrR{7_coCc;Flb^c0T!xHetfreyTu&x(Ru}7ph}b9l48ZOC3{IyqK&Bo
zCpdL)NfNB;MC^)oz54P*N8S3G6_=0H6J+(CUA@XYQ?WLYPdDdo`y+@>CZT3N5N&UA
zsC-T+9qwvTB9)kj>UgjyxMcc<d+X1HVc^YPa1o6yO(Wv69o8{KcWZ7FR&U*Gd|Z8Q
z*!gs?O2gB<by$dXI={gttD^T6vM|()>k6V>gRRZUW~g3g?X)0xD0HYbgVM_Mgezhx
zbg4C9?tS3bn{8`94Zrn7zjasNq2+SfJE9!w91D**I?pE7ozLf|#F}}z;Y+D_u9b0^
zgw+oET)l^w{l3*H2VnT>B_=6%F{4;(*8@3n9;%DVR@Uz?5}=%<7GLU<U7wU$GIuFi
zF%g`^LEGMrSK=Y)A?V$$^^?4l_;6xC!q(g_(b|E9wcVg#Wjiv5ynJelwCJezejx$M
zw9@{j;7k*+;jnbzVA%AtNnyoU2@SRq`C|J=IL4l%0;@+9lu2MoBZ^qSfk<Ir9v||@
zPy}wRNhFD54yZEfL>lyLyTurz;<Z+YOyg3KG9nVilh*MRZoKqHbmk_S>t{|$HT3mX
zh^FtLYaZ$Pj(+v~^qcP&aw5_BZcn{WWMVGI>=2Mb5KP|@OeKuWhvB!XMG28Ig0WP1
z=WB%bWm8cdj0=9D2RcHPrMD7R2BuTcr#>IpP^Dhw-Z8FW9NjCKd62k_gvf!&p)rV{
zNt{I6Mrv<{$ZVCuu3<uE2&9v^5TZ0v?U{}5RsOzHnK5^bh%L-Q!+#z@q{X=Ue&`HM
z&(c0(OA$mBWF?VIC_F;1PsNVq{SYqwP!1j@F!o~bbZJ>56s~jpqcKUZT;|Ae%(mQg
zZ&+4Hhpyjbqq(5az2LpzN1C25D;Lha12We;gg$5Q_@;$I`Mg`dL=Owwm`$5$P2j$u
zwl@Yfw)SB;g`PBt<dmXSJ8bfm@WY;Vt!b7B<q75Wb87c3D-SMzB(7DQ8*mhCoKuWn
z=ica0hc)b5yGbolY!2xjdPL;H)R;v-Y68o1p!0Gm5`%W5=^U^c3d#fb$@5(Pw7RtP
zbd~uk#lw=Dj^}Nst5tork~i1Op`8}Ss;Y{8ns|K6{9yCojlkm=p1FT{MCfra5P3Iy
zk(<h-%wlKQ*KG7~^iIE0uI@H<-0|@}_4geNEq3?EZCR!Xg{E{Cv4U%T?Z@5bN)nJ+
zEsc2wV)yagWM!M1F;O1ntFcS&jOw<?FKQd(;q{UP+%N+=(j>suvD5RY7Wi8oWAu~}
zU<GOWWRumXku<8I;l=e!D;-CwV}2{Q&_(8x^{9molK0w+IvQ;QR8Z+bjh36^VyEh$
z_2wOe5@^a?r_N_b3%#xW4G>lO$q7QldW&=MU4jqROkMh=lfjruI%dam3StMQJ6VXq
zR8RsucKK@@!Q_YU6GX?3>?oq^75SBQ@XN`Zc-s|A;OBWI)3fDN0vLi2ge91@2<74r
z74l;Z#lma)wf3f#@F=Y1lBWB%k82xHxN;IhI!y{k4oyp2@=TAu8`j%;K_&!5YSjy{
zdq!&Z%ZRtwvXjZ0jsokm@TNw!nTBX8?Vh2b4JL((XYz$5!7t#1@}-NrT>kMYFeuNN
z+DT$A%g4>$3#Y6B!VmV9?7nFTnX)s6tHliMG1vRZV;t8oTrDBqQB36WE_4LdtY;7^
zd28=zT9R5p$6(@U^JMG^7_=elgKKu)9>fBFj<_4(Zy5n$*D8n^F%Lk{2$Cql{DQ1>
z95?xuWdKooUXn)5QrNqqRW_?<vY_1x_s%D4aWi0l6(5APSjb2WY%9el>bgze^hMJe
z%k!a~c^=NM)tkbLV`HK#AYW@Ob-cNOnOU%+Oh}m%k~11wOn!>%##_0&VISiuP#UQ5
zY?SK=1=5XXhpXhrsk_7QgLeR?0Q0b8Lp4T&)(zBekKT9TGt1_4E9>|lC33W_RwdGJ
z7;clC<P!JXzn$x7WA5b7cSlFr*`4^>-+xWIf6chv60;1+rHI<mpHxqiZq9p1&J;{X
z>N`AWRU;pzLvq6T@}}c23VVLBfaZW4Qvxw@%{E}7m2vLr<fI1PuS#XH@m_(hF0fNk
z2Rffp>ZB;1!8Ib8RK{HcsKYjo3_{kjZ(H=05Yq-&FOcWF^*p067@II-@3vH&Uj04p
z70(T7)wjQtQ?894T>ED?SL{Mgvc5p^I1P|e-}xB2@uDTfZKQtn+{3%_)N^-PUMO7r
zZeJ|_v{IPaN|LI1aZ@aCApMTJ)8nLv!}+DBG2?^6g$_MI`yc5&&se|f+ED>)H8te`
z)(j1Vou!o)!anP5rOhVs#b;FZ{ruLu#yp4$nyL(Urgm#3gW7fVC{^?rbKV%PLS+#|
zQj+tTbd_iw2c-%d`H21y#XRjHR76jETO3?_)z-8z+#1PGTcdf533GrC>CI2!79`!|
zy=b2bm58v2&sNQtWEsZ{;5qf=3<_jCtkB9cDh!GPocU#Y6OGvh6Vf&wChyT2JyN%=
zACS^rv(#2+6DB3%HjIl*XZf~-8ylLLtSwiSH&z$s+Re4VHpCsaUShwvV$n?UJG%;B
zn7K}DEx*0n&D2p`+bUH20R;IRmZ-sNsk^3o44I=CLuqI07+~XpX)}qDfu=UMezsq$
zFJgm9h(F?;Xjls&WOmwB^bmj1q=3`>g+z-k{pFnKAV#7^?d#jI%CoNlX9GP#_AG2A
z;-UBkC<y8fgO>H>u*Aum@ReskZwIKQ**V60tg(LVE7;n<?4-b@sa2hm^4TX7wrk<^
z;eL`%Z16E4gNo-vx-q$x8W75v*>YmV$=;^!mB*I`>QqM)7OyS}TH=*;97gox|3E1#
z0CPcqwuD62ca3N!qj4w}t5ldTp`MrRw`H@<g0@sto-?@Zi(Pko*6589tS~#{6?mz@
z6q1rA+<E{TENvhhDV7pj>PH(Zh0GJ7u)?=EL@#tLn|W+P<ua1%aXB4b`(asdmRNMQ
zW@~DKmzcD)<rA(-3vnMuQ4E;`4bd0u8A(@J6AYp7>T!bGz9|mTeP6z`W$Rdg)|t1*
zuBggJucJh=6zTRMKYE812jo*5q^yf~o}JH&Kr=pzo*|v<tLv#IO6|prV3x?GDj+W{
zaWN6bT3z)QxUwe`-UZ<*x3JS<$AwXH)^Y&i5>%YrtvDxeH%K+yhAfH<QDf-)R(_%=
z1YBHb<8~pMnSpLUiPxZ~%>oP@X~M~k;)F5)TNleEU1krK)=ywAh3JTpU@3DPBC@(}
zl(O#^Pb<lwh*EV>m5>4t5%AG<Mqbh-csaw)D(a~S6Sa@OqFR=c;}SiFS%FT7MM$P2
z9pahNJbgRZ(9KI2KW|jmq;J@K3Ncu2gYr=anlWD7;Mna`ihQOFHX`r4d)SLAKYq5D
zY(tqAJ4unH)>C3BI~yGrF04dXrBjyS#+4YF0{8|y)FdICL?8?Q5xWRd_%b><+aQ-S
z5NWR)I7+4k3B9m>#SEbp30mSNLhI$k6jNC@hFn(S^tpKq)Y{^YK9WjRDp+kwYO~IE
z3By=(#2@;hfYUy;-~$>(Q)<j>?medQ!Y{^XM%Cr%0g&0KmY29``?JPBr)a<<H@nC0
zoe9*DHw=?opgn!*1+GA2qt>^fw}sL1`MxS)#bc4eP@uWt;?X^tnAbackh)Q2c0K5y
z_+==8r^BOg=zj=r6NyGwL6+@f_fs3sz#$BqVm-XX_oldC9Ro3a`X->YYq?v5ZAIT*
zLo`D)v$+WwqfouC-l1Hv;PN9InhWX*>S_l1oVKwE&35YS)x_PxoY@bf@QsjVU5GH=
zwDAC^cb+(PlXym8|CGE{Rk66W;@%<u!a0N^Di$FcAv%#@W<D<W{>y?{7>mpZf|)od
zb5)(0253W(lWya8Cfa?Q0n4U?fLNRo_~Hgp^YXVHt9==BddxU^fX{Pf2sSMKYYSk*
z)udRmT5k~`GQTtgUuq&g8AZ2|$9nGFa66}f0*Jp!RuSC$$Ft@8zy-G!0vnCF`P0F+
z56QP-Uaip)?rj8RViM89fea}Yj=-fya&+DE;?t#JO)d4xqB|yP&=MNL>*(5cqv&5z
zXhSGyR@59L+A>p7nUaY{m~YyOo(&y>=YJ%%k(uHuK89mxJ9k#OnF`Bh3P-Ke3~fvi
zYZhC4dylwI1vx#wz#?FG_3jzD(lp%$p#bI2ca6#|i(x0=Gl&=n?^YOnyJudu*hoSk
zTys~nCRK_c7cj!6Z!Zo|8nj}t0w0ynm9H|~FMt=&xarzyr%JC-%W8KnFQJ%<XzPt}
zXYJHmRFBL*Y+%>ffGhA4GGua>XQJ2PZTVbJp${jq;=K(>RWCL*Y)b4hql7a!UCa#!
z8C)H2Zcv-G+FPN7h%F$ZZp;AzQ{Fjmyh#Tma*tt}W6x{E0M}h=y57siq>|1-i1U~-
zBL#11<yhld{l29hXIJzwQ9OOD+pNVe5-jc=jIx`&TI~UOVA7{?TUa(t3+c?)Z$@8-
zXJuk$6>Dl<kOs3_CwsNCEF<Ow6N_y(&?>ECcN9va(fQ(NF(jPOCC<7^A?37NFlX#5
zny(sH*hy}JPLiL9Ig`~d6rD-Nw2eK0Wi=}pRMk=8BuoxO&VW%w)hAS-Y`lbBP|+YD
zvH>O$3R1(chdoZ6p)W|Y>+RCcGZr<T@}+fpW^R}y&c$_Oqpzye3>7s-Bu*6H-c6}U
z4x2|e?CG}>wu8z;-yhCBtS_|0E}oV;%}+*KQ=a{`#cM@bh@PaqlA#8rq$UD{p8?IW
zQp8A~&!!tNOG=)(&cyjcPm=Edh|G9Vil=G7{L?WL4_8pfuv~y`Tt;HqBSBKFo;Fm5
z2tuEZ%&nex@<^mvGEmF=iJH9;)aVoaf+DivTQjykS{dQIy~^sz)u1Te?CGc|UitVC
z7-~s87I)AJ9PJoERVfp_-KI5$gzxdj2l6nc4qKdwSRlN`0Z!;hQhj`U{Hg^wbTZ3P
z-%g=WSBoio!YOQ|;iN(R_D*-@E;a%z+!ETjk#=@eVIuumscwpi!rzWrFPgw6d?)tX
zg=IY#35l`g>F#73ZyFYezo+TmRl#oWz>85_*y2?3pEs*G2Fgok$r)4&xe4fG2caqr
zu6oz-k{XE<UEDFoUNrso_Q#Ow)X6b*{;cd8DR_Jn4wL@igKwZY)JbXb?zjf*(<odx
ze((~((ahhX$1ZIo{R^vYo$#gRmI4~?i#n-;Agd+}pBaRnq2U+(bEgY=xG)22dIYD4
zFO?dH)r1y48-jR-H0+FVM4BmWlBCYZh9}o4t|m%aw{V#&elL&iCZ=y1)2_GPUV`qf
z@kr;-IQ<%l8d{`gFb5Nu|4?Szpds3?5!QJ#T~BFZi4UZ!xGbj_kHZ=&2Zam$)wfKG
z4@`LJ1Io$EV5eD`xtSSP@XXmkRj5}A@n9Q-t>|Y+=I>-vgf$i)N-~5*QBd~K1aY8L
z^QzcCqEhBn<e-OEeMjuESYXW|JIT3hD@LbOL=~nNl11gSXW^%KjF4sW5S?pR@~1$^
z*C2IXfb$fgMhKaq$~pI#29E|OG=KaS*6ih$E{^OYTf70;#jVqk8*9L6N*hual%J|&
z+^HZ|MnsQMbbb6lTJe<IJVwKCasG7FNGje<P3hjUDOcso5A8nc7C37||8e|2He6<g
zD)~^@%%VxIIyhu-L^6%<=ujq`;YJJB7ye2s!@O!>NU~d_Z|8K?G0mjGV#cTk=Y;Ew
zMoB0WH4En)D(3C+mQU+6Qz7K<=D2g7KgHWBWD0Er2M%$3yHDLtzYq+Lu-M7{ks4vZ
zGcALDVx|OU$sB*Xc^Ph30&CdBG_BJKUue@gXh5|Fuou)2T<0uWjTbn&hdG}c?)1J;
z(;K83NU`AOSs%B+0ci4W3T0KJNNX=4W_5mZJvln+MHzTjqKx<0snPcZPOqgtV(Zhh
zA<)MQTt=}I#yd<g)ktti0QgkRQx!6fjG*_maI<TbhzVn$f3wv+3IIE&H_xBv2p>w%
zRx=DpzYV`DQ8S#0Hn$h;Dqgm8mkkqAbkq9GMb<}YUqa6?rf4k+Cx`5cVCm@Hs6q1e
zF8Q6dadZsd$2c*AQqI!=3&gjTLaAXFLqd__Wx{&{htEM%HrSh_HHw=hRt_I(w5YZc
zZnPKLOCt9v^hcU?W7wxpo+4&!u9&txgTh#pgRrtztWv2Yil{}nfOA{gha_#-hkmwj
zGj>G>RHVfl5X8}#&#y45qu$;z9`zGx`PSDU5At_4IO4J?66Fs0`IZj3vPlv_n(eBr
z6BC_-Yp1?r>lU62&1!MDZFA4ueBbfV#sf8?Qo=Rl(_*ONEM7Q*I)s6RUX>;$#Y<93
z>5~p3O{oXmqQ=US-HEabDoYshLVtvxNWfn%1*18$CQ-mhPv51(NviPG8L`7>4$I5~
zR#3vLP8*iAQ1yulXl+nJJ|Nl?tiZybfx;vdkAj%#CViTMPusnoQDp)jU1$03#T_kF
zJe0&~-?jkivMwJ#gzTQ9B5XO6$<;EVU^nQ4s)hOp_sWXHkP0Gw5QQph?^iy;Zr!P4
zEe?7td!ezrcWuN_qdQAS(@<JvEuAPGhuP&-fu~;cv|dnQ)$FD%56+CO=M=IvUm*UK
zqO5W}S>m19<r4*Y__oaVBPeEu_k0$EkR~pNDl=%tHHi;chiw3ZN#gGGEhcK)h!k?T
zMFttZGRLXzeTKnK&dlJ02|^@;*HXPLqHsn;g9$QOTMD0H-o2dg7FaBhY-c{WSk_=S
zR>+0sQ*0U(k6eu|l^j?>WoUKcJFEaLyEMpXcIzqVSz3CUdcKm#MjIx6hkXmW9$_ad
zE^rze-u_Up9VdPR)m)8f=@-n2EVB?XQpxYaf}Y_O;V7-ykX}U7PFyazqwGBrjsb*|
z_n}rC-s*HvDXA0<2Sh5=BZ=uS)$GB!a>7QZEC+2M;U+I+7p<r3rEzirUZyAoWVs;r
z%gpMlwtGt0HN}O6%*F@2&E2!ig6!C!_mIQDuq+!OQXi+fQ6l;JQe3YiKO=Q0Y^l07
zXZp0Gxa#Qg2zuwdYRO^yG99`ZBYaCi4>`(~{cyd1r#8tv!FR<hs%V&x6ZCrLIeBK~
zqE9m)Hc}p5UdUj%ID_h-MZ2{!8Cg;&#%zJrUExwQQF1Q$z+cGYp^Tg7VR{o*eDK|C
zv9|YR8c#FO?BNS@)@CNBZD*?7tlZ3a^sb_qhCf_1EJE4DXRKBMm76+GcBgtzfyR2z
zW-MT)5AR1c9IgyIEAA)5FYM31J|&&`zKTf#wf3d%2DXSD`l!_sQ*E$Q6X0CYTt{P)
zXL>%r{=4J^Ge1m-KQI6Q0xAFi+}|Z9Z0(%Q^lg9C?AZUbDz{1t|BMp3*}do&ML=te
z56mDjDwD)bS(9cIMXVeHdHt30A)zmk1U``dR6a-J!D%vM-6*~`G%*N$GU(md;gDT~
z`7AGT4Z!Tfkegc)E{ueL2rn=%y3qS86v(TcCo?wW%XoZkK`SSQ!Z2S9*^T^NEb&4^
zOsMz}ZIF)#{3A|ADnQgSAboV92#OO?-ylTl_iQPFp2&kE@k4d+zd<8WX?<91_znua
z3F_}p5O5gNO~kKIFsX)(;6GIKY3mcR(Wu5$LKrxmAv|O@Jwh}z4Ny`7ZG;wZ2<@Ik
zl?*N=y|W;BoXcAEtR;d9;PnRB7AgSyGoWuG83<%FKg|QqWa0zGa}qs;e>FA`ARTTp
z67NNruDnXt(kDXvy-%O3wAI6kKe>*~L`5MC#GhlpD}Io_o8uV9-ci=JU+<wuNM)vs
z94i^XN-Q-~_E%RmffDN&C>hY{>Q3`@%GQ<8VCf!m>x#)@WIz@ya<Vj#>`eyr_OYzH
zaGTeO9TS=kHLH0|D?rgM&$kn*U96lPGc#qJy>Q643iq<$C>$VkM3+jGV2-n^lzO6R
zFNx9gjnPybzLT+2BX7Zzm^PQ0KIIpxjL&hT4Vjqh<iwaGdxSxLh9!F}M}DRz-?8&1
z&3OT?o3WFoXjh_Wha(%}@;z?^+kEwsbG5wKBsR3ue8<J&h<-=aEB5tEd-sM8!MT_;
zlffyI(wTX6rH&aKFf}rET#zMHg<IO^M+N1!YsPji9*I%-8}%r2i3^;;u+retC0Cae
z*Fzxi!$F|wkN)$j5Dt~+l9#^Z<Y6ubBNOYVkd_AQ`Vw*^(_Byp%D~3YMsQa+U^I<(
zLK;~RTD$nXcH$&+YgB_ElcSBng2VXugVk$=EKs*_B_pfB;-~9MW-TO8-`P(D^W9%L
z`yLdKfr~sX=khQph!qu$*pKH$@R;^@xThUKqC^r%Nj=+`)tVi8qxBtSrHOA$ltN(w
zHnhuXYHiu-iCb!QIP_d9-i~fr2rLJ9RxjI}g|{nBdDxbgd%qZ_E9q-W8lRo#O)lhr
zS2w>^wsbPD-)S-S9+Mer3Yo^+15qKgpI+mRy;QKIN(Fw(tm00a7nZQ(L|3+C@bD$`
zK+C~?lv*JPncal%w^b9e<}N4G^d-~$@iFQ;MMyhA$k)b@9S<m3%-2BI-uBLWk_k`X
zzh~@!R_=j(o3U^8VPtG9t$r`cBMRd2x%sUqkK~WZ{p;6sEp7CFOzrqsNsG6s?O#bh
zmBSGn6U!bzL1`q{L^cHgddw)~2*MT(8H>ypaq6@vaP7w>D%!7PMwCij?0hvoiN&-i
zE!>OhF4LZFs3>dTrF1Q~Y>}}}gZ4n>%ey94;Tn2n7D>=*fwgTi?fP`!8km6$7KDvu
zvVE`|GBh6sTFu`pfPlgbhUaR@+Fy8{w5v1Dp%+_|M(^84Oge!`>-e6I13w=<+G6(x
zAj&6O*f#;FG$f?m2(lg0k_0*$DPOkE@xI_19eZAyFT7{nLivOm%=n~Jhv&S*15bE7
zUNRLisXiQ;iF{^h1ApMO=i>}#*XE`teuabuNH-9hIcE=t#Txj1b`0Q!b1o6H*_age
zc5cnx`=e2K9uG;VU_PgPGQ^eD4c17?IzDp*=sVNALt;U2Kwa$&g{gdYUal#q?Xv#*
zQ%6kwz$~ilg_6)OUR#WXd!Ol$c|Igb5s=Jiv?YJ^zckp~SY5O6t<kR36+N@&2wPS1
z)p276|DF*j0BL=CQ!6?+)Z2erxKC(Gy04d#nCN`tp(1rWYOFCHQhGFBIk$na3u*!@
z`NU%Hz|Y0Ml6`fhmGun%w>tb;Ge|2mKcE;003a0}008lC18!k&uA^^bY+?9A(DR>$
ze+c>+#hW*>8g=xY9~s{&zJI6%?<`yv@e0Wp3=wgE<A&>Dc{3{4b?+lFvA(!w&0K;h
zFWdCqSlb<IFM)Zc)`r~9o(RR53iMM1ICdXmh7NOhcVg<-S^}%M04je!24r7Qxvk5c
zwxCbKgCdwLfh5yLBTas~I@nM=c2J#PkbB^~b*GX;prZRe7t{x-D-$yaW*^I>iB#MR
zIwDGGa3^npG=nA<V>xvwdmzS;ijeny!>qwBfr{ApxC=z5n`R}N%`DGmPabzAk%nOh
z$b_rwbJO$L^%*K)5I>eml`n1H$4sOSxel`gq8aX-elLU+Lr5YqtU|EQXiH+e>XT(I
zc?#VGMbuNV<X&8v_*UTH{FYcx5435Q`(EKFzF5LYrtoz;7ljVNagaNanXQr-S)ZBY
z8W|JT@QrIAAV@2Kj*GhWhDb&*6DBvTdEWt3g83tGm6FiLS_-?=*cS|Bh$Tw!i1-BB
zX*~L4vvJ|#<`%DZ2g2YnbK=vTO9vOWX_1Z?hzZ@TcrwQKn4JP_ge>p8p=Dr$Wt-5&
z(>fZNb<Df!JBUaJ08_h#6G2w$vXAz$Rm8r^Un<hR&`+pGEY@ja8~WjEyx+)LUAg{p
z=Gy+kT&x6{QnuDlY79Omr&~KphYq47%IIsRsyVaXVd%adU#q5b8geK7{$(|L=b6?l
zi<>q05swE>5%2yP?6~y<8*8`K60iGGOssAnv&^RH@GxYnWJ|+lOziAcirs0}hhuQb
z7KbIon`f)e3j0RQK2Pq%?khLy&KH-7$^CPAu0(Lu&ee^bLb7h;7;vQfp7XS#$GfY+
zk+$mN)_p0lk4F<E&?6tOC|B}8&55!%E(6SQ_AUJ5ZEpdbFIb)L_V(5XhkakA&n%wG
zt-hiCt=Ru*=fK_+`@dBl!^@3HIC%Tt#M=b;QKIb6@BCS4EM`&+m<~Qri+HS^V^v8o
z&zMH;P%@93SH!^4cN~vIzTDE1EJW#eMIbWy>0p#a0JT(LsaSbuM}CghTvZ0y+=lxW
zGo$hAFiu%QDCr7DaWMV8x`KJIP%nPWIYKB7S<+0NMD>PUuy3@>9&T+lO#x|I{pp8G
zX|Oc<`2pjt@Km3b4#O6|^p*)V7ZC?MvXSJsiUJhNR?<1!AF2iJR4gb^FWSaWY*IT(
z`+0TJ5PX^GB<U9|P^I_Pidr}qY}iN!{0<h?;c)zN-m_Hb+2As?G#5m#qls>3qTn!S
zi8)=;CF1=(WPjR%8ONta^S5DeeY>#Uu0M*p{W&H^`r3N>HgrGi!5_mT8O8?-L;x*R
z$=~7_LLEleJ7xj<`5FL52_M#9Mu)S5O5>&N?xoG+%7S~15yjdnjS3xAD?`-BchH?f
z{j<ga*7;muHk~5VdjxxU*Ggr<n`nk31&UGp${G3_I<+pb7fJ7LdZ!qxui#sk|14nt
zuUr~fTH5{pgv(|t?K+e<wF-j(0C+R4e~#rZqnMvCZj(w2Kd2>?D`45+MHC@!W{7DR
zbM%2h(pA~tY%VJ@8ibYWhTzf9Jto9*P><e=Mm1__^F_H8v#y2f0z^*%i=-d((H0Jf
zA}B0rsdD@7OW-o%hJ}(d`bW$rX@5vtp#|CoC4UGEMEQvEG=`k~Xbuk2nD@=Abo{c^
z_I^N<cO#_v#MuL^XQ-jjSf6&CGD%QSEUZbH1{d0oxpqEv=9wHELaE)x5|4GTH#8d2
z+Q7(v=)t5j1x2COCN5$Q55z@mv}sNx+E-YjC!Arl#>}Iolh?qMH@2X&z+e6-s>kVV
z4*4{CWnxuneI@~pO~_M{<|Ie{nsn$2Od*G_8cBe-F<=&FH71DWG_!zhUg60E3!erw
z_tLh*X_8zLF))$qjzaT6d`h;gWoWQC`Kr$?wAw@(pAFh~-d*{E8X>s4;d+YhX*!DX
z8so$Ftj|D0#OU|#bNxnhKqw_HS>Ek#KsF^VdES!lVlJf3{fu%`q#DxiHLgWRjq@})
z_FsIejMO7p0zud&-$!6LE~qoiqhF1rXYCZ0j%<sf9iRa^CvEG&Fs%-v=8QwthtA7z
zF(^c+$*U3~6=O~IuiIB$e{TNl!BYEDlf0bLXj|mJLRdLPCZVn*CNE$wynPTM=jaHa
zW#$8OfA@vPX@*`b7!(e9RYCp&($*f>qiV+m5LX^`8pwpwE}oCA<!k=|&%dW5<XI8#
z^qUJ-`)0oWQK4T{yggq0$HE*?NoZv$k0`eYPX!)43$_ee<O&dgDJszi9#9u|t+Rfo
zm!A=#)@P+thsAB!@en8wTsmr~0?L$yqMA>X5=aElZ%?8S9do`~SxelkUYSDVi0&k>
zA3Jan7dMgYCpRh{H8zpIRa8E)o@Od)pYaa1ZaQh2HLS-%PsX-t!Inj0ek=x%^tHek
zc*ODmli1cH1T-Ex)4|~2D1hx7KE<{sCx?0%dU^~_i5~A5317MR%sb4MX<+H#)z&qm
zj@T@9v}i&FO<$!3WP{9skH#Lg*aSD#L?$ZH#URuD&P+{w3-=aqoI)N89K$8b9%<<Q
zqzZXL61asTtYK^v(Go@OTW~nzrM=<Y?Qrm~khJ~I{mnw{46Z4xkq(6!Bndu_)d7T^
zQsWt<2WrF6QMZ9QEeE)(XVKu2a<vr1Yo2Iceih1gE2pU*WA^#hjGF465sBKSYect$
zO_(1B<HNbsbCCw>ABYxc&*1v-KXt8)witqflqY`;f0ey%8(}6k<?A*2uG1C)PWHrf
z*5+RW+j#MmFj2Zn^TCpygTMYWIz2eoJ<NNVWwLLt8BXv|j~F0}056bG6=#pe*aM-y
z6jNs{Q)eiuuVDWczr2hxQpKD2S>L>Lq&EwqCSYk{_f`v2OTpPn-&W&?DXv%Qe$xQx
zXO!MW0;QH-8E8?LrG?!UfjhtuL5>wF*{H+1=kIb3I$auR2B!855BK*Of>0O72g42h
zxcrQV3;T-oRPU^y>aE9Ho^GsKo2V_BO9Ktzs<-6O7WX5DJW&Wsa6VyBUT>in(|r=f
z4iI)Kei;Jm`?7x-Dsu>d{ekR;jHUEkX0#%IXEjQ?d@;Yec%!v|%7fF|r7Bj(I^o-D
zZ`=&&OY#+}zUi}H^v+e-z#Nir$Rxu~$3x#8tG2a80&%-#cub$a&iQpI8x<@=5m24G
z<QfLvE8=HH&Af}0q@7p00DE&}BZYK*G5^(;P<5Ecgj5MZ8Kj#<he0gNuz<TmPMs<4
zEFW~|W^=Xhp!fk87F!zN=ePvdPzCCNm|p9*6H?b}{G8^WyEceD)j|9LQK?P{qS&$r
z!Ok(l2~j-uycN+PPxWWqk3NP7tdPJw5;|L?+s{<gszE%tyDYs=n->fhfi<?+6=p4k
z%oxUi`^b1+3rzTu;_3v>SPJUT49I{QVRT@=C2X_Wv&M|QRYu}~&*V1?3wA>BJvM0;
zuS!<yx|2WkAOGafW$cslDq+l<SohwpAL9Kd4%4-_wX-z;A>Q-}KB%6z5No?MZh2J!
zjWFMsWoJdm9&g4N%gBJA<iqW)FbLe%<+t8J)0RZ75SUz65}3gnAE4pOEu)!>L(9N-
z$T83`2HN~E98h-VwvpJhB!O~zUV^=8az*io9JwrV2NKO;*zKau1ESwp*+Z2ZN<WGn
zD<~T3D~q1MZ4Y)YmNs6~uvI;`#Ts@7o)q80{98kJp3+xh-&p_QLj5pM|8zmy%Ie?z
z`<<AVyo;Wx!?HfjX@MKAgJEPKKoWO#eeVNq^>DwvTmOuTTFtK`U9?|ak`GX~$0?=C
z9k9Ah2x{m%LFJdV4@>pEqMbA(ss<btly|{qf{j_MI-LN$LTo@zSgd=Gem2htyWO91
zSs7sLSYgo+R1M5-3-%lMNaa%5gVtR1^V9B^s==M&v)?+97Can1@ixWE-z@Au!Oss1
zYw_ks{E@(#f2%jU`(v3VzHGOb15_1~Wb@)(gI{+C*kuj(Kdm++&9>}}?UQVwDuLp%
z4(>GYxXMgk`nu{^9;3uc(~V4|RzQRhAV5A|f$!<TL*)_O2W}u%&=ur!;!oO`YM-bK
z(9H|a&FAyY+0dEq7~V{(_)GoN7)|{iET@9@FrR!8#W3h;SG--k!4YjOY|{m9?>gNq
zy)>cJt28hrG_KMX<GUt1j*?_o(-WYh1keYHcxYF>LKbjn<UDJV$e@0rBFEv6eAcW^
zO}0A`)OTH2A0*HT<w%iD0Cr7{zc6-)_x3&l7)ujRn+O%4>M1AQY;9D-24XlxJlWXI
zm?(IpB;_00<$aXE{UUPDa=;gCFf-=eshE<xW3Z?NT*eJ_a#nM|$!>Ha5t}%jF}B)w
zOmmC&sd{*@_+hlq%+%l;m`MiDDKzVCi3~ymRd)&Lv!!H2W?J58??#S4S^^tt7s6VM
zJ|F7xXQRN$+OOc27=sW-T_yMz4u$R2G42zkrN-Z8t_f6-yr#xeq|SCpCmOXY9}hps
zgE|M9XbKWzcO(<C3-6k6M|?-Jh~dJS;$ui}=ax+Lg`8pRrL28Ua(P6ee`WNQS7cSa
zAJ_V~`u^(f;=HNvFYfNYbl`v6;pGvF2=9K=(wH|b{gI0KgHL@6y?>N+JZ{xu7au-A
ztAB^PJECQNBxE)eSE;=;WgR>VeXyttzw7qk)X@!MqCqi^nWdesFxm1YnXb(8#U#G2
z#~HE!f0>>*m=;^i*a7t6>bv&0*fp~S4QdHve2C1f_W#eCx!pKcHx>(7xH;Pe<r|2?
zIu7(!K+xns6RVZCMW3%jgt!-X`RE1^7}Pjzn6!@&>R97ac8U?3bB)dVV<N{^I4TJ=
zLNO8)lh?QUI$KPw{B3P_ue!n*qk6(>(B*l3t|r^EUTv_9_np}3b(ry)BnC$qI7bN6
zyV#JggPdg35-DsKiDI@2YbS$W07@GHT3gH5I<XlY3Wi6GA`h5q!tRkxO)jlXUg+so
zQ&if#Q@%uU07tZwDfoR5uR0ceRg29<BR#sTYBbvN*N*_T0%_D!i4tO7THe0ed+b&l
zpaa0HqJU^yGqx<#%e*|Mpw39;CNu?tZ&8^&<n}0dU_MdAYu7KqzspTBl!XjD&rNzF
zQA3a}JcS6&*#*Qwo;|#lEy0$ED!{-^I$f=sJ<q%ee3VxBobi49H^1*!z5J~n_ODX^
z-&RY>%#o4oH?_2VE8G2#TH5N{*}dhB{;-|-aqB<46MwD#DR;uwPodpKYIc+hH#$o~
zjo_b&mr)62eTY^^S0f{Kn|OGm8I@!BzHxla>;JmyxuDk$CCgFvsd_@CK!&xrx_D`m
zhet_E1r;d^WLHMftdy{|y%~3;NG?Z!?mfVO%|(umxmd>@XNZzYkZGxq)%3jahqV2Q
zih3D0E56InwjoCjyfhg)#6ynHaM7pMw_M}S{>Wt-RZ)iwjBET$b{;}@aH2e>xw8pN
z5A$KxC|`jRT>Fkc8%dCp3Vhd}fEYZKBEOTeQ_@}`zs!PmwnA4sO*C~&5bkyexW?1S
zX$-(om2qRM1iu3WQ$^jPyWNk|8{d1<)rrQNjz+K^Vjbu)9bXOx4G+C26uZi4>fbl^
z3T*_~Oj}F~++y#Xx^}MnxQ6{{x0Tzw-tuy^wEt?1vwrIcR8^Ln`4ET`%SQ0SXwmWV
zy;`mHjtEY>_bCf}YJiEiP2#{{)G^9N_S1)bY0#Jg73Mguq>%Z~IXbNjvvHpsq-OzA
zrBM^Z8Xyw#v|!@2q)O=K{E@`C#V&<nX|67R4>v*s*!ue4<T>ybvHl}ZT}$&H_1FKD
z=iIu@4n8{AmYOxIa?B@rmf1Se^wVpn)puE5J&aG#S<|gra>SyE=~6x)kBK&86BoOB
zXGIx9-!mmps`$FadxOS*LZH55a`OD%zUJd^FM>EnCTeO`fXd0mU5BE?FLQL#13aU2
zuK;|IR)JraB<p=5W@Q$q$kBLt$6vo{EkQt6bi+(l^8~9C5SB|ih4o;j2khBx4vhjS
zlhI8WS$tzw;-^sp(IW2SRwA;JC;F<Qo=k#LZI?+>u4|l1B2*dMg^<S>|FR>6;hpws
zd@F;0<vb-0XS#O}ZrQ})%0LP1+w>$MvRCNBS2k?6jSD4YR7=7q%fyPVqVXm1*$u6|
zWlW2w$1l0&Wpt>vC(we!vm6_9<(@jOF3fawF}dU+lnQ?^6V)17vZ-?Zq3?8tLZGFi
zrqXhSOrrb-hg+)zyNmo=<#Xsk%#$JAB+1izoM~x^;EU&Eg5V3w?Tf8E6XE>>B)BL5
zWCR%<_HSKq1Ggr;oF`Kg+enSVRwEnRdV6{P7=Q_^JqF}LhbRuULj(a-SfO^BMUs>0
z{YCHFJI&v>q1#XS4p`1=2I`i_U!f+&zcy5ynrpw%=8=0vdIJ8>OW<#&=2yx8Z<`w3
zQ_TgOH`Nq)+l=_dPqVO7(AF{gV>cpwdflQ29~t~YnP82wJ~`aDY=cVNp_2;$s9q4Z
zaC>X65wE<#bSHcr1^)7N-_7h!+3I8vq~?&aL)9P+qf$oAv+VwEaZslhL+Gg9?ohJd
zGT`2$u|1CIQG_a=D?W>2qi-pJKgtAJL{-Ps?XisR?UX-81dtwU`l>&_k$xTcSF^TY
z&QYkB&pnh-3-yWzIP(}`k?xuxyD>o-n5#2c8k*D8=yq5#7&&uvO|9TFkI^daTzsDu
zd3)HY<ryMFTqydLRXl<3z{|b>J|M}E*`^zs;Rc8d7wg_|7NX;nl1{vaBm?B-S{RT&
z&?-;s>&J!X8BnA+3zdI716vQ{jnDiT$%m^RGqa$giv3mFAyeJ4>O1|?6cAeL|Bc)8
z%aHzW2ptdz835|d#re+^?%N{sSNjoZ{Aho;K{De1Gvr4`_s{kN^8W4m#l!mv{3{$A
zg|UQtQ!#?KANny9em04>+h2fhL5}?ollV_6^mj^;L8elIIsgD27r=km(Vvv!-{}8A
z=~o)R!++mV{|SfC`wRYafBh%;KS%7x=H8#}NB3I{e~Hh2!v5_1@0)#p0|5ZGe}(<M
z_4hmc_dTSa@L}V>SpL_J(og(<a{cDx{?&ePon!WQQvN6APnLgA-T#exTfhFn{Gq|W
z1Aq0dek8enw)wZ*_b-C}C0YA_NyydWUxa>1+5J~Szo&|SaztSH7olHx`Mp2CX9<6z
z^{oFw|H8oU=--n}Khe6j|25_GJAvQR7(WT{+yA0*Ka(226Zk!e@RI<j!!HE>N-6w~
z{yl>Ki5_@E{~^tP#d5#nf7ON`G2qYE=ky;!{3qc0uVVZ?O#4a7%=s@;zq;@LCr<ut
zfYjgiV}8-gUy1xzMt*n9e=-8>_7|mJ-SpoX`P~ov37>WU3;wHb@H_n15&d^R20j1P
zk6(%W*M9uI>i=ZK!s{<e25%|Re|&)d6DR+cZL9acIQbWqzgPkN!w_KtU;>1^-Rb)L
H`0@V-Pdc9m

literal 0
HcmV?d00001

diff --git a/docs/qa/tool-audit-2026-Q2.md b/docs/qa/tool-audit-2026-Q2.md
new file mode 100644
index 00000000..5aa38874
--- /dev/null
+++ b/docs/qa/tool-audit-2026-Q2.md
@@ -0,0 +1,407 @@
+# QA Tool Audit — 2026 Q2
+
+## Summary
+
+The 10-tool QA pipeline is foundationally sound but uneven in documentation quality. Wiki tools (get_wiki_page, get_topic_overview) are well-specified; memory tools document latency and search semantics clearly. Graph tools lack concrete return-shape examples and have inconsistent branching behavior (dict vs list returns). External tools handle fallbacks gracefully. Priority: standardize graph tool docstrings and snapshot-test the list/dict branching in the citation decorator before Stream 3b refactors.
+
+## Scoring key
+
+- **Description clarity (0-5)**: How well the docstring tells the LLM when/why to use this tool, with concrete use cases
+- **Example count**: Number of explicit usage examples in the docstring (inline calls, mock returns, scenarios)
+- **Return-schema completeness (0-5)**: How thoroughly the return shape is specified (typed fields, optional keys, sentinel values)
+- **Latency profile**: Rough p50 from docstring or implementation (wiki <50ms cache; memory ~100-200ms Weaviate; graph ~500ms Neo4j; external ~1s Tavily)
+- **Known failure modes**: 1-3 documented or inferred edge cases
+
+## Scored tools table
+
+| Tool | Module | Desc (0-5) | Examples | Schema (0-5) | Latency | Failure modes |
+|------|--------|-----------|----------|-------------|---------|---------------|
+| get_wiki_page | wiki_tools | 4 | 0 | 4 | <50ms | stale cache sentinel; page_type validation; missing channel |
+| get_topic_overview | wiki_tools | 4 | 0 | 4 | <50ms | tier fallback logic; topic fuzzy-match; no clusters |
+| search_qa_history | memory_tools | 4 | 0 | 4 | <100ms | embedding failure fallback; qa_history_negative_filter; empty channel |
+| search_channel_facts | memory_tools | 5 | 0 | 5 | <200ms | hybrid search fallback; time_scope cutoff; MMR diversity tuning |
+| search_media_references | memory_tools | 4 | 0 | 4 | <200ms | media_type filtering; empty results; link/pdf detection |
+| get_recent_activity | memory_tools | 4 | 0 | 4 | <200ms | timestamp parsing; time window cutoff; topic optional filter |
+| search_relationships | graph_tools | 3 | 0 | 2 | ~500ms | returns dict vs list (branching); entity fuzzy-match; empty graph |
+| trace_decision_history | graph_tools | 3 | 0 | 2 | ~500ms | returns list vs dict (inconsistent); SUPERSEDES edge detection; exception path returns dict not list |
+| find_experts | graph_tools | 3 | 0 | 2 | ~500ms | list_relationships semantic scoring; _empty sentinel; token filtering |
+| search_external_knowledge | external_tools | 5 | 0 | 5 | ~1s | tavily_unavailable (env var); tavily not installed; search timeout |
+
+## Per-tool detailed findings
+
+### Wiki tools
+
+#### get_wiki_page
+
+**Docstring** (lines 17-28):
+```
+Retrieve a pre-compiled wiki page from MongoDB wiki_cache.
+
+Cost: $0. Target latency: <50ms (cache read only, no Weaviate/Neo4j queries).
+
+Args:
+    channel_id: The channel to look up.
+    page_type: One of: overview, faq, decisions, people, glossary, activity, topics.
+
+Returns:
+    Dict with page_type, content (markdown), and summary — or None if unavailable.
+```
+
+**What's good:**
+- Clear cost and latency expectations
+- Comprehensive page_type enumeration (7 types defined in SUPPORTED_PAGE_TYPES, line 11)
+- Graceful None return for missing pages
+- Smart fallback: detects stale activity sentinel and calls get_recent_activity (line 46-66)
+
+**What's weak:**
+- Docstring does not document the stale activity sentinel retry logic — callers won't know this tool can invoke memory_tools
+- Return dict keys undocumented: `content`, `summary`, `text` fields all exposed but not in docstring
+- `_cite_tool_output(kind="wiki_page")` decorator dependency not mentioned
+
+**Proposed improvements:**
+- Expand Returns section: `Dict with page_type, channel_id, content (markdown), summary (text), text (excerpt for citation decorator). For stale activity pages, returns synthesized result from get_recent_activity.`
+- Note that `text` field is the citation decorator's grounding field (line 75)
+
+#### get_topic_overview
+
+**Docstring** (lines 85-98):
+```
+Retrieve channel-level summary (Tier 0) or a topic cluster summary (Tier 1).
+
+Cost: $0 (cached). Target latency: <50ms.
+
+Args:
+    channel_id: The channel to look up.
+    topic_name: Optional topic to narrow to a matching Tier 1 cluster.
+
+Returns:
+    Dict with tier, summary, and metadata — or None if unavailable.
+```
+
+**What's good:**
+- Tier 0/1 distinction is clear and well-scoped
+- Explicit Tier 0 vs Tier 1 branching (lines 104-116 vs 118-140)
+- Fuzzy-match logic handles topic_name lookups gracefully (line 123)
+
+**What's weak:**
+- Return schema is sparse: mentions tier/summary/metadata but actual keys are tier, channel_id, page_type, summary, text, cluster_count, fact_count, slug, cluster_id, topic_tags, member_count
+- Fallback to clusters[0] when topic_name doesn't match (line 127) is not documented
+
+**Proposed improvements:**
+- Expand Returns: `Tier 0: {tier: "summary", channel_id, page_type: "overview", summary, text, cluster_count, fact_count}. Tier 1: {tier: "topic", channel_id, page_type: "topics", slug, cluster_id, summary, text, topic_tags, member_count}.`
+
+### Memory tools
+
+#### search_qa_history
+
+**Docstring** (lines 54-67):
+```
+Search past Q&A pairs semantically for similar questions in this channel.
+
+Cost: $0. Target latency: <100ms.
+
+Args:
+    channel_id: Scope search to this channel.
+    query: Search query.
+    limit: Max results.
+
+Returns:
+    List of past Q&A entries with question, answer, citations, timestamp.
+```
+
+**What's good:**
+- Cost/latency explicit
+- Embedding failure fallback documented in code (line 78) with bm25 fallback
+- QA_HISTORY_NEGATIVE_FILTER applied post-search (line 84)
+
+**What's weak:**
+- Return dict shape is vague ("question, answer, citations, timestamp") — actual shape from store.search_qa_history() is not described
+- Citation decorator dependency hidden
+
+**Proposed improvements:**
+- Note: `Returns list of dicts from QAHistoryStore.search_qa_history(), decorated by cite_tool_output(kind="qa_history"). If embedding fails, falls back to BM25. Results filtered by qa_history_negative_filter if configured.`
+
+#### search_channel_facts
+
+**Docstring** (lines 150-171):
+```
+BM25 keyword search over atomic facts (Weaviate Tier 2 / tier=atomic).
+
+Cost: ~$0.001. Target latency: <200ms.
+
+Results are MMR re-ranked (λ≈0.6) to improve diversity when multiple
+paraphrased queries hit the same top facts.
+
+Args:
+    channel_id: Scope to this channel.
+    query: Search query.
+    time_scope: "recent" (last 30 days) or "any".
+    limit: Max results.
+
+Returns:
+    Ranked facts with author, channel, timestamp, permalink, confidence.
+```
+
+**What's good:**
+- Excellent docstring: explains cost, latency, MMR algorithm (λ=0.6), and time_scope semantics
+- Hybrid search with vector fallback documented in code (lines 181-196)
+- Rich return dict: author, author_id, channel_id, channel_name, platform, timestamp, permalink, importance, confidence, fact_id, topic_tags, media_urls, link_urls (lines 212-230)
+
+**What's weak:**
+- Return docstring says "Ranked facts" but doesn't detail the 14 dict keys
+- MMR re-rank is described in docstring but not all keys returned by the internal store are documented
+
+**Proposed improvements:**
+- Expand Returns: `List[{text, author, author_id, channel_id, channel_name, platform, message_ts, timestamp (ISO), permalink, importance, confidence (0-1), fact_id, topic_tags, media_urls, link_urls}]. Over-fetched (k*3, capped 30), then MMR re-ranked down to limit.`
+
+#### search_media_references
+
+**Docstring** (lines 248-266):
+```
+Search for images, PDFs, and links shared in the channel.
+
+Cost: ~$0.001. Target latency: <200ms.
+
+Args:
+    channel_id: Scope to this channel.
+    query: Search query.
+    media_type: "image", "pdf", "link", or None for all.
+    limit: Max results.
+
+Returns:
+    Media items with URL, type, and surrounding message context.
+```
+
+**What's good:**
+- Clear media_type enum and None behavior
+- Hybrid search with fallback (lines 272-288)
+- Post-filter by media_type (lines 296-303)
+
+**What's weak:**
+- Return dict keys not listed (actual: text, media_urls, link_urls, link_titles, author, channel_id, channel_name, platform, message_ts, timestamp, media_type, fact_id)
+- PDF detection is heuristic-based (line 294: `.pdf` substring match) — not documented
+
+**Proposed improvements:**
+- Expand Returns: `List[{text, media_urls, link_urls, link_titles, author, channel_id, channel_name, platform, message_ts, timestamp (ISO), media_type, fact_id}]. Filtered by media_type if specified.`
+
+#### get_recent_activity
+
+**Docstring** (lines 328-346):
+```
+Return recent facts from the channel, optionally filtered by topic.
+
+Cost: $0. Target latency: <200ms.
+
+Args:
+    channel_id: Scope to this channel.
+    days: How many days back to look.
+    topic: Optional topic filter.
+    limit: Max results.
+
+Returns:
+    Facts from the last N days ordered by timestamp descending.
+```
+
+**What's good:**
+- Clear time window (days parameter, default 7)
+- Optional topic filter
+- Explicit sort order (timestamp descending, line 395)
+
+**What's weak:**
+- Return dict schema not documented (actual: text, author, author_id, channel_id, channel_name, platform, message_ts, timestamp, importance, topic_tags, fact_id; lines 379-391)
+- Hybrid search with fallback not obvious from docstring
+
+**Proposed improvements:**
+- Expand Returns: `List[{text, author, author_id, channel_id, channel_name, platform, message_ts, timestamp (ISO), importance, topic_tags, fact_id}] ordered by timestamp descending.`
+
+### Graph tools
+
+#### search_relationships
+
+**Docstring** (lines 14-30):
+```
+Traverse Neo4j graph for relationships between named entities.
+
+Cost: ~$0.005. Target latency: ~500ms.
+
+Args:
+    channel_id: Scope traversal context (used for logging/filtering).
+    entities: List of entity names to resolve and traverse from.
+    hops: Number of graph hops (default 2).
+
+Returns:
+    Dict with nodes, edges, and entities_searched.
+```
+
+**What's good:**
+- Clear hops semantics
+- Fuzzy-match fallback (line 42)
+
+**What's weak:**
+- Return type is documented as "Dict" but actual shape is inconsistent: returns dict on success (lines 87-97) but list[dict] on empty graph (line 80)
+- Empty-graph case returns `[{"_empty": True, ...}]` (a list), violating the docstring contract
+- Keys in success dict not itemized (actual: entities_searched, nodes, edges, text, subject_id, predicate, object_id, channel_id)
+- Node shape (`{name, type}`) and edge shape (`{source, target, type, confidence, context}`) not documented
+
+**Proposed improvements (non-breaking):**
+- Docstring: `Returns dict {entities_searched: List[str], nodes: List[{name, type}], edges: List[{source, target, type, confidence, context}], text (excerpt), subject_id, predicate, object_id, channel_id}. On empty graph, returns dict with empty nodes/edges lists (not a list).`
+- Consider removing the `[{"_empty": True}]` branch (line 80) to guarantee dict return type.
+
+#### trace_decision_history
+
+**Docstring** (lines 104-115):
+```
+Trace temporal evolution of decisions about a topic via Neo4j SUPERSEDES chain.
+
+Cost: ~$0.005. Target latency: ~500ms.
+
+Args:
+    channel_id: Scope context (for logging).
+    topic: Topic or entity name to trace.
+
+Returns:
+    List of decision nodes and SUPERSEDES relationships, ordered by traversal.
+```
+
+**What's good:**
+- Clear relationship type (SUPERSEDES)
+- Topic fuzzy-match (line 122)
+
+**What's weak:**
+- Return type is documented as `List[dict]` but exception handler at line 166 returns `{"result": [], "error": "graph_unavailable"}` — a dict, not a list
+- No example of success shape (actual: entity, superseded_by, relationship, confidence, context, text, decision_id, channel_id, topic)
+- Sentinel return (line 157: `[{"_empty": True, ...}]`) is inconsistent with normal list returns
+
+**Proposed improvements (critical):**
+- Fix exception return (line 166) to return `[]` or `[{"_empty": True, ...}]` for consistency
+- Docstring: `Returns List[{entity, superseded_by, relationship: "SUPERSEDES", confidence, context, text (excerpt), decision_id, channel_id, topic}]. Empty graph returns [{"_empty": True, ...}].`
+
+#### find_experts
+
+**Docstring** (lines 173-185):
+```
+Find top contributors for a topic by Neo4j expertise ranking.
+
+Cost: ~$0.005. Target latency: ~500ms.
+
+Args:
+    channel_id: Scope to this channel.
+    topic: Topic to rank expertise for.
+    limit: Max people to return (default 5).
+
+Returns:
+    List of {handle, expertise_score, fact_count} ordered by expertise_score desc.
+```
+
+**What's good:**
+- Clear ranking order (expertise_score desc, line 209)
+- Fallback for no results (line 211)
+
+**What's weak:**
+- Return shape incomplete: actual dict also includes text, subject_id, predicate, object_id, channel_id (lines 215-219)
+- Semantic scoring logic (lines 199-207) is opaque: scores any Person endpoint connected to a topic-containing node — not described in docstring
+
+**Proposed improvements:**
+- Expand Returns: `List[{handle, expertise_score, fact_count, text (excerpt), subject_id, predicate: "EXPERT_IN", object_id, channel_id}] ordered by expertise_score desc. Empty results return [{"_empty": True, ...}].`
+
+### Citation decorator touchpoints
+
+The citation decorator (`_citation_decorator.py:68-89`) has three branching paths:
+
+1. **List return (line 68-75)**: `isinstance(result, list)` — iterates each dict, annotates with `_cite` and `_src_id`
+   - Tools: `search_qa_history`, `search_channel_facts`, `search_media_references`, `get_recent_activity`, `trace_decision_history` (mostly), `find_experts`
+   - Sentinel handling: skips dicts with `_empty: True` (line 72)
+
+2. **Dict envelope (line 77-86)**: `isinstance(result, dict)` → checks for `results`, `items`, or `data` key containing a list
+   - Tools: `search_external_knowledge` (returns dict with `results` key; line 74 in external_tools.py)
+   - Unwraps, annotates inner list, re-wraps
+
+3. **Single-source dict (line 87-89)**: Falls through to annotate whole dict
+   - Tools: `search_relationships`, `get_wiki_page`, `get_topic_overview` (both return single dict per call)
+
+**Inconsistencies requiring snapshot tests before refactor:**
+
+- `search_relationships` returns dict on success but `[{"_empty": True}]` (list) on empty graph → inconsistent with decorator's list-vs-dict detection
+- `trace_decision_history` returns `list[dict]` on success but `dict` (error shape) on ConnectionError (line 166) → requires snapshot to verify decorator doesn't crash
+- `get_wiki_page` and `get_topic_overview` return single dict but decorator treats them as single-source (path 3), not envelope-wrapped — correct, but not obvious
+
+**Recommendation:** Generate snapshot tests for all three paths before Stream 3b refactors the decorator or return shapes.
+
+## Proposed TypedDict shapes for graph_tools.py (Stream 3b input)
+
+### search_relationships → dict (return type, not list)
+
+```python
+class RelationshipNode(TypedDict):
+    name: str
+    type: str | None
+
+class RelationshipEdge(TypedDict):
+    source: str
+    target: str
+    type: str
+    confidence: float
+    context: str
+
+class RelationshipSearchResult(TypedDict):
+    entities_searched: list[str]
+    nodes: list[RelationshipNode]
+    edges: list[RelationshipEdge]
+    text: str  # citation decorator field
+    subject_id: str  # citation decorator field
+    predicate: str  # citation decorator field
+    object_id: str  # citation decorator field
+    channel_id: str  # citation decorator field
+```
+
+### trace_decision_history → list[dict]
+
+```python
+class DecisionEvent(TypedDict):
+    entity: str
+    superseded_by: str
+    relationship: str  # always "SUPERSEDES"
+    confidence: float
+    context: str
+    text: str  # citation decorator field
+    decision_id: str  # citation decorator field
+    channel_id: str  # citation decorator field
+    topic: str
+```
+
+### find_experts → list[dict]
+
+```python
+class ExpertHit(TypedDict):
+    handle: str
+    expertise_score: float
+    fact_count: int
+    text: str  # citation decorator field
+    subject_id: str  # citation decorator field
+    predicate: str  # "EXPERT_IN"
+    object_id: str  # citation decorator field
+    channel_id: str  # citation decorator field
+```
+
+## Refactor recommendations deferred (non-graph modules)
+
+**wiki_tools.py**: Docstring enhancements only (no code changes). The stale activity sentinel and fallback to get_recent_activity (line 46-66) should be documented in the Returns section and hinted in the Args as "may invoke memory_tools if page is stale."
+
+**memory_tools.py**: Expand all return dict schemas in docstrings to list actual keys. The _mmr_rerank helper (line 92-143) is well-commented; no changes needed. The embedding fallback patterns (search_qa_history:78, search_channel_facts:189) are consistent and should be noted in docstrings.
+
+**external_tools.py**: Already well-documented. The envelope-wrapped result (lines 59-77) is appropriate for Tavily API responses. No changes needed.
+
+## Golden query set for Stream 3b snapshot tests
+
+Use these representative queries to generate JSON snapshots before refactoring graph_tools or citation decorator:
+
+1. **search_relationships, single entity**: `search_relationships(channel_id="C001", entities=["Alice"], hops=2)` → should return dict with 5+ edges, verify `text` field is present
+2. **search_relationships, multi-entity**: `search_relationships(channel_id="C001", entities=["Alice", "Bob"], hops=1)` → should merge subgraphs, verify no duplicate nodes
+3. **search_relationships, empty**: `search_relationships(channel_id="C001", entities=["Nonexistent"], hops=2)` → should return dict with empty nodes/edges (not list)
+4. **trace_decision_history, with SUPERSEDES**: `trace_decision_history(channel_id="C001", topic="Architecture v2")` → should return list of 2-5 events, verify decision_id format
+5. **trace_decision_history, no decisions**: `trace_decision_history(channel_id="C001", topic="Nonexistent")` → should return list or empty sentinel, verify not dict
+6. **find_experts, high scoring**: `find_experts(channel_id="C001", topic="Database", limit=3)` → should return 1-3 dicts, verify expertise_score is numeric
+7. **find_experts, no experts**: `find_experts(channel_id="C001", topic="Obscure XYZ", limit=5)` → should return empty list or sentinel
+8. **search_channel_facts, MMR rerank**: `search_channel_facts(channel_id="C001", query="deployment", limit=5)` → verify over-fetch (k*3) is applied and results are diversity-ranked
+9. **get_wiki_page, stale activity**: `get_wiki_page(channel_id="C001", page_type="activity")` → if cached page contains stale sentinel, verify fresh fallback is invoked
+10. **search_external_knowledge, Tavily envelope**: `search_external_knowledge(query="Python best practices", mode="best_practices")` → verify `results` key is unwrapped by decorator before annotation
diff --git a/docs/v1-archive/ARCHITECTURE_OVERVIEW.md b/docs/v1-archive/ARCHITECTURE_OVERVIEW.md
new file mode 100644
index 00000000..a38e3b08
--- /dev/null
+++ b/docs/v1-archive/ARCHITECTURE_OVERVIEW.md
@@ -0,0 +1,788 @@
+# Beever Atlas: Comprehensive Architecture Overview
+
+> **For**: Development Team, Product Team, Stakeholders
+> **Purpose**: Understand how Beever Atlas works and what makes it different from competitors
+
+---
+
+## TL;DR: What Makes Beever Atlas Different?
+
+```
+┌─────────────────────────────────────────────────────────────────────────────────┐
+│                     BEEVER ATLAS vs COMPETITORS AT A GLANCE                      │
+├─────────────────────────────────────────────────────────────────────────────────┤
+│                                                                                  │
+│  TRADITIONAL MEMORY SYSTEMS (memU, Mem0, Zep):                                  │
+│  ┌──────────┐     ┌──────────┐     ┌──────────┐                                 │
+│  │  Query   │ ──▶ │ Retrieve │ ──▶ │   LLM    │ ──▶  $0.05/query               │
+│  └──────────┘     └──────────┘     └──────────┘                                 │
+│       Every query hits LLM = HIGH COST, text-only, no free exploration          │
+│                                                                                  │
+│  ─────────────────────────────────────────────────────────────────────────────  │
+│                                                                                  │
+│  BEEVER ATLAS (Wiki-First + Multimodal):                                        │
+│  ┌──────────┐     ┌──────────┐                                                  │
+│  │  Query   │ ──▶ │   Wiki   │ ──▶  FREE (80% of queries)                      │
+│  └──────────┘     └──────────┘                                                  │
+│       │                                                                          │
+│       │ (only if needed)                                                         │
+│       ▼                                                                          │
+│  ┌──────────┐     ┌──────────┐                                                  │
+│  │ Retrieve │ ──▶ │   LLM    │ ──▶  $0.05 (20% of queries)                     │
+│  └──────────┘     └──────────┘                                                  │
+│       AVERAGE COST: $0.01/query (5x cheaper)                                    │
+│       + True multimodal (text, image, video, PDF)                               │
+│       + Cross-modal search ("find auth diagrams" returns images)                │
+│       + Intelligent forgetting (memories decay like human brain)                │
+│                                                                                  │
+└─────────────────────────────────────────────────────────────────────────────────┘
+```
+
+---
+
+## Part 1: The Complete User Journey
+
+The following diagram shows how a user interacts with Beever Atlas from start to finish, with the underlying technical components at each step.
+
+```mermaid
+flowchart TB
+    subgraph UserJourney["👤 USER JOURNEY"]
+        direction LR
+        U1["1. Connect<br/>Sources"] --> U2["2. Auto-Sync<br/>& Process"]
+        U2 --> U3["3. Browse<br/>Wiki (FREE)"]
+        U3 --> U4["4. Search<br/>& Ask"]
+        U4 --> U5["5. Export<br/>& Integrate"]
+    end
+
+    subgraph Sources["📥 STEP 1: CONNECT SOURCES"]
+        direction TB
+        S1["Slack"]
+        S2["Notion"]
+        S3["GitHub"]
+        S4["Local Files"]
+        S5["Web/URLs"]
+        S6["Calendar<br/>(Meetings)"]
+    end
+
+    subgraph Pipeline["⚙️ STEP 2: PROCESSING PIPELINE"]
+        direction TB
+        P1["INGEST<br/>Fetch raw content"]
+        P2["PREPROCESS<br/>Modality detection<br/>Image/PDF/Video parsing"]
+        P3["EXTRACT<br/>Facts + Narrative<br/>(LLM: Gemini Flash)"]
+        P4["CLASSIFY<br/>Domain/Entity/Action tags<br/>Knowledge type"]
+        P5["EMBED<br/>Jina v4 (2048-dim)<br/>Unified multimodal space"]
+        P6["CLUSTER<br/>Auto-topic grouping<br/>Label propagation"]
+        P7["PERSIST<br/>Weaviate + MongoDB"]
+
+        P1 --> P2 --> P3 --> P4 --> P5 --> P6 --> P7
+    end
+
+    subgraph Storage["💾 STEP 3: HIERARCHICAL STORAGE"]
+        direction TB
+        T0["TIER 0: Collection Summary<br/>• Overall overview<br/>• Key themes & decisions<br/>• Updated weekly"]
+        T1["TIER 1: Topic Clusters<br/>• Auto-grouped by theme<br/>• Cluster summary + members<br/>• Updated on new content"]
+        T2["TIER 2: Atomic Memories<br/>• facts[] + narrative<br/>• Full metadata & tags<br/>• Multimodal vectors"]
+
+        T0 --> T1 --> T2
+    end
+
+    subgraph Retrieval["🔍 STEP 4: DUAL RETRIEVAL"]
+        direction TB
+        R1{"Query<br/>Complexity?"}
+        R2["WIKI PATH (FREE)<br/>• Cached markdown<br/>• Topic tree<br/>• Decision log"]
+        R3["RAG PATH (Fast)<br/>• BM25 + Vector hybrid<br/>• RRF fusion<br/>• < 100ms"]
+        R4["LLM PATH (Deep)<br/>• Semantic ranking<br/>• CoT decomposition<br/>• Complex queries"]
+        R5["SUFFICIENCY CHECK<br/>Stop early if enough<br/>Expand if needed"]
+
+        R1 -->|"Simple/Browse"| R2
+        R1 -->|"Keyword/Fact"| R3
+        R1 -->|"Complex/Why"| R4
+        R3 --> R5
+        R4 --> R5
+    end
+
+    subgraph Output["📤 STEP 5: OUTPUT FORMATS"]
+        direction TB
+        O1["WIKI<br/>• FREE reads<br/>• Auto-updated<br/>• Topics/Decisions"]
+        O2["SEARCH<br/>• Progressive disclosure<br/>• Index → Full → Source<br/>• Multimodal results"]
+        O3["GROUNDED RESPONSE<br/>• Answer + Citations<br/>• Source permalinks<br/>• Confidence score"]
+        O4["TRAINING DATA<br/>• Instruction pairs<br/>• Trajectories<br/>• Quality filtered"]
+        O5["MCP SERVER<br/>• Agent integration<br/>• Tool interface<br/>• SDK"]
+    end
+
+    subgraph Lifecycle["🔄 BACKGROUND: MEMORY LIFECYCLE"]
+        direction TB
+        L1["NOVELTY DETECTION<br/>Skip duplicates<br/>Reinforce similar"]
+        L2["SELF-EVOLUTION<br/>Auto-update summaries<br/>on CRUD events"]
+        L3["FORGETTING<br/>Ebbinghaus curve<br/>Source-aware decay"]
+        L4["CONFLICT DETECTION<br/>Find contradictions<br/>Temporal supersession"]
+    end
+
+    %% Connections
+    Sources --> Pipeline
+    Pipeline --> Storage
+    Storage --> Retrieval
+    Retrieval --> Output
+    Storage <--> Lifecycle
+
+    style UserJourney fill:#e1f5fe
+    style Sources fill:#fff3e0
+    style Pipeline fill:#f3e5f5
+    style Storage fill:#e8f5e9
+    style Retrieval fill:#fce4ec
+    style Output fill:#e0f2f1
+    style Lifecycle fill:#fff8e1
+```
+
+---
+
+## Part 2: Why Wiki-First Architecture Matters
+
+This is the **#1 differentiator** from competitors. Most users don't need LLM for every query.
+
+```mermaid
+flowchart LR
+    subgraph Traditional["❌ TRADITIONAL: memU, Mem0, Zep"]
+        direction TB
+        TQ["Every Query"] --> TR["Vector Retrieve"]
+        TR --> TL["LLM Generate"]
+        TL --> TC["$0.05 per query"]
+
+        TN["100 queries/day<br/>= $5.00/day<br/>= $150/month"]
+    end
+
+    subgraph WikiFirst["✅ BEEVER ATLAS: Wiki-First"]
+        direction TB
+        WQ["Query"]
+        WD{"What type?"}
+
+        WQ --> WD
+        WD -->|"80% Browse/Explore"| WW["Wiki<br/>(Cached)"]
+        WD -->|"15% Search"| WS["Hybrid Search<br/>(Embedding only)"]
+        WD -->|"5% Complex"| WL["LLM Generate"]
+
+        WW --> WC1["FREE"]
+        WS --> WC2["$0.001"]
+        WL --> WC3["$0.05"]
+
+        WN["100 queries/day<br/>= $1.35/day<br/>= $40/month"]
+    end
+
+    subgraph Savings["💰 SAVINGS"]
+        direction TB
+        S1["3.7x CHEAPER"]
+        S2["Better UX<br/>(instant wiki)"]
+        S3["Exploration<br/>encouraged"]
+    end
+
+    Traditional -.->|"vs"| WikiFirst
+    WikiFirst --> Savings
+
+    style Traditional fill:#ffebee
+    style WikiFirst fill:#e8f5e9
+    style Savings fill:#e3f2fd
+```
+
+### Wiki Content Structure
+
+```
+┌─────────────────────────────────────────────────────────────────────────┐
+│  📖 WIKI: Engineering Knowledge Base                                     │
+├─────────────────────────────────────────────────────────────────────────┤
+│                                                                          │
+│  📄 OVERVIEW (Tier 0)                                                    │
+│  ├── "Our engineering team owns 12 services focused on..."              │
+│  ├── Key Themes: [Auth, Payments, Data Pipeline, Infrastructure]        │
+│  └── Recent: "Migrated to Kubernetes (Jan 2025)"                        │
+│                                                                          │
+│  📁 TOPICS (Tier 1)                                                      │
+│  ├── 🔐 Authentication (23 memories)                                     │
+│  │   └── "OAuth2 + JWT, migrated from sessions in Q3 2024"              │
+│  ├── 💳 Payments (18 memories)                                           │
+│  │   └── "Stripe integration with retry logic"                          │
+│  ├── 🗄️ Database (31 memories)                                          │
+│  │   └── "PostgreSQL + Redis, considering CockroachDB"                  │
+│  └── 🚀 Infrastructure (15 memories)                                     │
+│      └── "AWS EKS, Terraform, ArgoCD"                                   │
+│                                                                          │
+│  📋 DECISIONS (Extracted from Tier 2)                                    │
+│  ├── 2025-01-15: "Chose Prisma over TypeORM - better DX"                │
+│  ├── 2025-01-10: "Added Redis for session caching"                      │
+│  └── 2025-01-05: "Delayed K8s migration by 2 weeks"                     │
+│                                                                          │
+│  👥 PEOPLE (Entity extraction)                                           │
+│  ├── Alice: [auth, security]                                            │
+│  ├── Bob: [payments, infrastructure]                                     │
+│  └── Carol: [database, performance]                                      │
+│                                                                          │
+└─────────────────────────────────────────────────────────────────────────┘
+```
+
+---
+
+## Part 3: True Multimodal Architecture
+
+Unlike competitors that only support text, Beever Atlas uses **unified embedding space** for cross-modal search.
+
+```mermaid
+flowchart TB
+    subgraph Input["📥 MULTIMODAL INPUT"]
+        I1["📝 Text<br/>Slack, Notion, docs"]
+        I2["🖼️ Images<br/>Diagrams, screenshots"]
+        I3["📄 PDFs<br/>Specs, contracts"]
+        I4["🎥 Videos<br/>Meetings, demos"]
+        I5["🎤 Audio<br/>Voice memos"]
+    end
+
+    subgraph Processing["⚙️ MODALITY-SPECIFIC PROCESSING"]
+        P1["Text Extraction"]
+        P2["Vision Analysis<br/>(Gemini Vision)"]
+        P3["Document Parsing<br/>(PyMuPDF)"]
+        P4["Frame Extraction<br/>+ Transcription<br/>(Whisper)"]
+        P5["Transcription<br/>(Whisper)"]
+    end
+
+    subgraph Unified["🎯 UNIFIED EMBEDDING SPACE"]
+        direction TB
+        U1["Jina v4 Multimodal<br/>2048 dimensions"]
+
+        subgraph Vectors["Named Vectors in Weaviate"]
+            V1["text_vector"]
+            V2["image_vector"]
+            V3["doc_vector"]
+        end
+
+        U1 --> Vectors
+
+        Note1["Same query can match<br/>across ALL modalities"]
+    end
+
+    subgraph Search["🔍 CROSS-MODAL SEARCH"]
+        S1["Query: 'auth flow diagram'"]
+        S2["Results:"]
+        S3["• 🖼️ OAuth2 diagram (0.95)"]
+        S4["• 📄 Auth spec PDF (0.88)"]
+        S5["• 📝 Slack discussion (0.82)"]
+        S6["• 🎥 Video frame (0.78)"]
+    end
+
+    I1 --> P1
+    I2 --> P2
+    I3 --> P3
+    I4 --> P4
+    I5 --> P5
+
+    P1 & P2 & P3 & P4 & P5 --> Unified
+    Unified --> Search
+
+    style Input fill:#e3f2fd
+    style Processing fill:#f3e5f5
+    style Unified fill:#e8f5e9
+    style Search fill:#fff3e0
+```
+
+### Competitor Comparison: Multimodal Support
+
+| Capability | memU | Mem0 | MemOS | Zep/Graphiti | **Beever Atlas** |
+|------------|------|------|-------|--------------|------------------|
+| Text | ✅ | ✅ | ✅ | ✅ | ✅ |
+| Images | ✅ | ❌ | ❌ | ❌ | ✅ |
+| PDFs | ✅ | ❌ | ✅ | ❌ | ✅ |
+| Video | ✅ | ❌ | ❌ | ❌ | ✅ |
+| Audio | ✅ | ❌ | ❌ | ❌ | ✅ |
+| Cross-modal search | ❌ | ❌ | ❌ | ❌ | **✅** |
+| Unified embeddings | ❌ | ❌ | ❌ | ❌ | **✅** |
+
+---
+
+## Part 4: Memory Lifecycle Management
+
+Beever Atlas doesn't just store memories - it **evolves** them intelligently.
+
+```mermaid
+flowchart TB
+    subgraph Ingest["📥 NEW CONTENT ARRIVES"]
+        I1["New message<br/>from Slack"]
+    end
+
+    subgraph Novelty["🔍 NOVELTY DETECTION"]
+        N1{"Check similarity<br/>to existing"}
+        N2["≥ 95%: SKIP<br/>(exact duplicate)"]
+        N3["≥ 85%: REINFORCE<br/>(similar exists)"]
+        N4["≥ 70%: LINK<br/>(related content)"]
+        N5["< 70%: ADD<br/>(novel content)"]
+    end
+
+    subgraph Evolution["🔄 SELF-EVOLUTION"]
+        E1["Memory Added/Updated/Deleted"]
+        E2["Find affected clusters"]
+        E3["LLM patches summaries"]
+        E4["Update Tier 0 if significant"]
+        E5["Wiki auto-refreshes"]
+    end
+
+    subgraph Forgetting["⏳ INTELLIGENT FORGETTING (Ebbinghaus)"]
+        direction TB
+        F1["Retention Formula:<br/>R(t) = e^(-t/S) × source_multiplier"]
+
+        subgraph Multipliers["Source Credibility"]
+            M1["📚 Docs: 2.0x<br/>(authoritative)"]
+            M2["💬 Internal: 1.5x<br/>(important)"]
+            M3["🌐 Web: 0.5x<br/>(ephemeral)"]
+            M4["📱 Social: 0.3x<br/>(very ephemeral)"]
+        end
+
+        F2["Low retention → Archive/Prune"]
+    end
+
+    subgraph Reinforce["💪 SPACED REPETITION"]
+        R1["Memory Retrieved"]
+        R2["Stability increases"]
+        R3["Decay slows down"]
+        R4["Important memories persist"]
+    end
+
+    subgraph Conflict["⚠️ CONTRADICTION DETECTION"]
+        C1["Find similar facts"]
+        C2["LLM checks contradiction"]
+        C3{"Contradicts?"}
+        C4["Set old.invalid_at = new.valid_at"]
+        C5["Track supersession chain"]
+        C6["Keep both for history"]
+    end
+
+    Ingest --> Novelty
+    N1 --> N2 & N3 & N4 & N5
+    N5 --> Evolution
+    N3 --> Reinforce
+
+    Evolution --> E1 --> E2 --> E3 --> E4 --> E5
+
+    Forgetting
+    Reinforce --> R1 --> R2 --> R3 --> R4
+
+    N5 --> Conflict
+    C1 --> C2 --> C3
+    C3 -->|Yes| C4 --> C5
+    C3 -->|No| C6
+
+    style Ingest fill:#e3f2fd
+    style Novelty fill:#fff3e0
+    style Evolution fill:#e8f5e9
+    style Forgetting fill:#ffebee
+    style Reinforce fill:#f3e5f5
+    style Conflict fill:#fce4ec
+```
+
+---
+
+## Part 5: Dual Retrieval System
+
+Beever Atlas uses **two retrieval modes** that automatically select based on query complexity.
+
+```mermaid
+flowchart TB
+    subgraph Query["🔍 QUERY ARRIVES"]
+        Q1["'What is our auth system?'<br/>or<br/>'Why did we choose JWT over sessions<br/>and how does it relate to the<br/>mobile app security requirements?'"]
+    end
+
+    subgraph Classifier["🧠 QUERY CLASSIFIER"]
+        C1{"Analyze Query"}
+        C2["SIMPLE:<br/>• Factual lookup<br/>• Keyword search<br/>• Overview request"]
+        C3["COMPLEX:<br/>• 'Why' questions<br/>• Comparison<br/>• Multi-hop reasoning<br/>• Temporal analysis"]
+    end
+
+    subgraph RAG["⚡ RAG PATH (Fast)"]
+        direction TB
+        R1["Hierarchical Routing"]
+        R2["Query Depth:<br/>OVERVIEW → Tier 0<br/>TOPIC → Tier 1<br/>DETAIL → Tier 2"]
+        R3["Hybrid Search<br/>BM25 + Vector"]
+        R4["RRF Fusion<br/>(k=60)"]
+        R5["< 100ms latency"]
+    end
+
+    subgraph LLM["🧠 LLM PATH (Deep)"]
+        direction TB
+        L1["CoT Decomposition"]
+        L2["Break into sub-questions:<br/>1. What is our auth system?<br/>2. Why JWT vs sessions?<br/>3. Mobile security reqs?"]
+        L3["Parallel retrieval"]
+        L4["LLM synthesizes"]
+        L5["< 3s latency"]
+    end
+
+    subgraph Sufficiency["✅ SUFFICIENCY CHECK"]
+        S1{"Enough info?"}
+        S2["RETURN results"]
+        S3["EXPAND search<br/>• Drill down tier<br/>• Broaden query<br/>• Include related"]
+    end
+
+    subgraph Response["📤 GROUNDED RESPONSE"]
+        Resp1["Answer with citations"]
+        Resp2["Source permalinks"]
+        Resp3["Confidence score"]
+    end
+
+    Query --> Classifier
+    C1 --> C2 & C3
+    C2 --> RAG
+    C3 --> LLM
+
+    RAG --> R1 --> R2 --> R3 --> R4 --> R5 --> Sufficiency
+    LLM --> L1 --> L2 --> L3 --> L4 --> L5 --> Sufficiency
+
+    S1 -->|Yes| S2 --> Response
+    S1 -->|No| S3 --> R3
+
+    style Query fill:#e3f2fd
+    style Classifier fill:#fff3e0
+    style RAG fill:#e8f5e9
+    style LLM fill:#f3e5f5
+    style Sufficiency fill:#fce4ec
+    style Response fill:#e0f2f1
+```
+
+---
+
+## Part 6: Technical Stack Mapping
+
+How each technology serves the user journey.
+
+```mermaid
+flowchart TB
+    subgraph UserLayer["👤 USER INTERFACE LAYER"]
+        UI1["Web UI<br/>(React + Vite)"]
+        UI2["REST API<br/>(FastAPI)"]
+        UI3["MCP Server<br/>(Claude/Agent integration)"]
+    end
+
+    subgraph AppLayer["⚙️ APPLICATION LAYER"]
+        A1["Ingestion Service<br/>Source adapters, pipeline"]
+        A2["Retrieval Service<br/>Dual retrieval, sufficiency"]
+        A3["Wiki Service<br/>Generation, caching"]
+        A4["Lifecycle Service<br/>Decay, evolution, conflicts"]
+    end
+
+    subgraph MLLayer["🧠 ML/AI LAYER"]
+        ML1["Gemini Flash Lite<br/>Metadata extraction<br/>$0.30/1M tokens"]
+        ML2["Gemini Flash<br/>Response generation<br/>$0.60/1M tokens"]
+        ML3["Jina v4<br/>Multimodal embeddings<br/>2048-dim unified"]
+        ML4["Whisper API<br/>Audio transcription"]
+    end
+
+    subgraph DataLayer["💾 DATA LAYER"]
+        D1["Weaviate Cloud<br/>• Vector storage (HNSW)<br/>• Named vectors<br/>• BM25 index<br/>• Cross-references"]
+        D2["MongoDB<br/>• State management<br/>• Sync status<br/>• Relationships<br/>• Conflict log"]
+        D3["Redis (optional)<br/>• Wiki cache<br/>• Session state"]
+    end
+
+    subgraph InfraLayer["🏗️ INFRASTRUCTURE"]
+        I1["Docker Compose<br/>(local dev)"]
+        I2["Kubernetes<br/>(production)"]
+        I3["Celery/Dramatiq<br/>(background jobs)"]
+    end
+
+    UserLayer --> AppLayer
+    AppLayer --> MLLayer
+    AppLayer --> DataLayer
+    DataLayer --> InfraLayer
+
+    style UserLayer fill:#e3f2fd
+    style AppLayer fill:#f3e5f5
+    style MLLayer fill:#fff3e0
+    style DataLayer fill:#e8f5e9
+    style InfraLayer fill:#eceff1
+```
+
+### Technology Decision Matrix
+
+| Component | Choice | Why (vs alternatives) |
+|-----------|--------|----------------------|
+| **Vector DB** | Weaviate | Named vectors for multimodal, built-in BM25, production-ready (vs Qdrant, Pinecone) |
+| **Embeddings** | Jina v4 | 2048-dim unified multimodal space (vs OpenAI 1536-dim text-only) |
+| **State DB** | MongoDB | Flexible schema for relationships, async via Motor (vs PostgreSQL rigidity) |
+| **LLM (cheap)** | Gemini Flash Lite | $0.30/1M tokens, fast (vs GPT-4o-mini at $0.60) |
+| **LLM (quality)** | Gemini Flash | $0.60/1M tokens, good quality (vs GPT-4o at $2.50) |
+| **Backend** | FastAPI | Async-first, MCP support, Python ecosystem (vs Node.js) |
+| **Frontend** | React + Vite | Fast dev, component ecosystem (vs Next.js complexity for MVP) |
+
+---
+
+## Part 7: Competitive Feature Matrix
+
+### Feature-by-Feature Comparison
+
+```mermaid
+flowchart TB
+    subgraph Features["📊 FEATURE COMPARISON"]
+        direction TB
+
+        subgraph Cost["💰 COST MODEL"]
+            C1["memU/Mem0/Zep:<br/>Every query = LLM call<br/>~$0.05/query"]
+            C2["Beever Atlas:<br/>Wiki-first = FREE reads<br/>~$0.01/query average"]
+        end
+
+        subgraph Modal["🎨 MULTIMODAL"]
+            M1["memU: Text, Image, Video<br/>(separate spaces)"]
+            M2["Mem0/Zep: Text only"]
+            M3["Beever Atlas:<br/>Unified cross-modal<br/>(text query → image results)"]
+        end
+
+        subgraph Forget["⏳ MEMORY DECAY"]
+            F1["memU/Mem0: None"]
+            F2["MemOS: FIFO only"]
+            F3["Beever Atlas:<br/>Ebbinghaus curve +<br/>source-aware multipliers"]
+        end
+
+        subgraph Graph["🔗 RELATIONSHIPS"]
+            G1["memU: Category only"]
+            G2["Mem0: Basic graph"]
+            G3["MemOS: Rich edges"]
+            G4["Beever Atlas:<br/>Rich edges +<br/>temporal supersession"]
+        end
+    end
+
+    style Features fill:#f5f5f5
+```
+
+### Summary Table
+
+| Feature | memU | Mem0 | MemOS | Zep/Graphiti | **Beever Atlas** |
+|---------|------|------|-------|--------------|------------------|
+| **Wiki-First (FREE reads)** | ❌ | ❌ | ❌ | ❌ | ✅ |
+| **True Cross-Modal Search** | ❌ | ❌ | ❌ | ❌ | ✅ |
+| **Unified Embedding Space** | ❌ | ❌ | ❌ | ❌ | ✅ |
+| **Ebbinghaus Forgetting** | ❌ | ❌ | ❌ | ❌ | ✅ |
+| **Source Credibility Decay** | ❌ | ❌ | ❌ | ❌ | ✅ |
+| **Bi-Temporal Model** | ❌ | ❌ | ❌ | ✅ | ✅ |
+| **Contradiction Detection** | ❌ | ❌ | ✅ | ✅ | ✅ |
+| **Self-Evolving Summaries** | ✅ | ❌ | Partial | ❌ | ✅ |
+| **Dual Retrieval (RAG+LLM)** | ✅ | ❌ | ✅ | ✅ | ✅ |
+| **CoT Query Decomposition** | ❌ | ❌ | ✅ | ❌ | ✅ |
+| **Graph Relationships** | Category | Basic | Rich | Rich | Rich |
+| **Training Data Export** | ❌ | ❌ | ❌ | ❌ | ✅ |
+
+---
+
+## Part 8: Data Flow - Complete Picture
+
+```mermaid
+flowchart TB
+    subgraph Sources["📥 DATA SOURCES"]
+        Slack["Slack"]
+        Notion["Notion"]
+        GitHub["GitHub"]
+        Files["Local Files"]
+        Web["Web/URLs"]
+        Calendar["Calendar"]
+    end
+
+    subgraph Adapters["🔌 SOURCE ADAPTERS"]
+        A1["SlackAdapter"]
+        A2["NotionAdapter"]
+        A3["GitHubAdapter"]
+        A4["FileSystemAdapter"]
+        A5["WebScraperAdapter"]
+        A6["CalendarAdapter"]
+    end
+
+    subgraph Normalize["📋 NORMALIZED INPUT"]
+        N1["RawContent<br/>• content: str<br/>• source_type: str<br/>• source_id: str<br/>• source_url: str<br/>• timestamp: datetime"]
+    end
+
+    subgraph Pipeline["⚙️ PROCESSING PIPELINE"]
+        P1["1. PREPROCESS<br/>Detect modality<br/>Parse PDF/Image/Video"]
+        P2["2. EXTRACT<br/>facts[] + narrative<br/>(Gemini Flash Lite)"]
+        P3["3. CLASSIFY<br/>domain_tags<br/>entity_tags<br/>action_tags<br/>knowledge_type"]
+        P4["4. EMBED<br/>Jina v4 (2048-dim)<br/>text/image/doc vectors"]
+        P5["5. DEDUPE<br/>Novelty detection<br/>Skip/Reinforce/Link"]
+        P6["6. CLUSTER<br/>Topic assignment<br/>Label propagation"]
+        P7["7. PERSIST<br/>Weaviate + MongoDB"]
+    end
+
+    subgraph Memory["💾 ATOMIC MEMORY"]
+        M1["AtomicMemory<br/>├── id<br/>├── facts: list[str]<br/>├── narrative: str<br/>├── source_*<br/>├── *_tags<br/>├── knowledge_type<br/>├── text_vector<br/>├── image_vector<br/>├── stability<br/>├── valid_at<br/>├── invalid_at<br/>├── cluster_id<br/>└── collection_id"]
+    end
+
+    subgraph Hierarchy["📚 HIERARCHICAL STORAGE"]
+        H0["Tier 0: Collection Summary"]
+        H1["Tier 1: Topic Clusters"]
+        H2["Tier 2: Atomic Memories"]
+
+        H0 --> H1 --> H2
+    end
+
+    subgraph Lifecycle["🔄 LIFECYCLE SERVICES"]
+        L1["Self-Evolution<br/>Auto-update summaries"]
+        L2["Forgetting<br/>Ebbinghaus decay"]
+        L3["Conflict Detection<br/>Temporal supersession"]
+    end
+
+    subgraph Output["📤 OUTPUT INTERFACES"]
+        O1["Wiki API<br/>FREE reads"]
+        O2["Search API<br/>Hybrid RAG"]
+        O3["Ask API<br/>Grounded responses"]
+        O4["Export API<br/>Training data"]
+        O5["MCP Server<br/>Agent tools"]
+    end
+
+    Sources --> Adapters
+    Adapters --> Normalize
+    Normalize --> Pipeline
+    P1 --> P2 --> P3 --> P4 --> P5 --> P6 --> P7
+    P7 --> Memory
+    Memory --> Hierarchy
+    Hierarchy <--> Lifecycle
+    Hierarchy --> Output
+
+    style Sources fill:#e3f2fd
+    style Adapters fill:#fff3e0
+    style Pipeline fill:#f3e5f5
+    style Memory fill:#e8f5e9
+    style Hierarchy fill:#e0f2f1
+    style Lifecycle fill:#fff8e1
+    style Output fill:#fce4ec
+```
+
+---
+
+## Part 9: Key Innovations Explained
+
+### Innovation 1: Wiki-First Pattern
+
+```
+PROBLEM: Every query costs money (LLM calls)
+SOLUTION: Pre-generate browsable wiki from memories
+
+HOW IT WORKS:
+1. Tier 0/1 summaries are generated on content change
+2. Wiki markdown is cached and served statically
+3. 80% of user interactions are just browsing
+4. Only "Ask" queries hit the LLM
+
+RESULT: 5x cost reduction vs competitors
+```
+
+### Innovation 2: Unified Multimodal Space
+
+```
+PROBLEM: Can't search for "auth diagram" and find images
+SOLUTION: Jina v4 embeds all modalities in same 2048-dim space
+
+HOW IT WORKS:
+1. Text, images, PDFs → same vector space
+2. Semantic similarity works across modalities
+3. Named vectors in Weaviate allow modality-specific indexes
+4. Query can match any modality
+
+RESULT: "Find deployment architecture" returns diagrams, docs, and discussions
+```
+
+### Innovation 3: Intelligent Forgetting
+
+```
+PROBLEM: Memory grows forever, old info clutters results
+SOLUTION: Ebbinghaus curve + source-aware decay
+
+HOW IT WORKS:
+1. Retention = e^(-time/Stability) × source_multiplier
+2. Documentation decays slowly (2.0x multiplier)
+3. Social media decays fast (0.3x multiplier)
+4. Frequently accessed memories gain stability
+5. Low-retention memories are archived
+
+RESULT: Fresh, relevant memories; authoritative sources persist
+```
+
+### Innovation 4: Temporal Supersession
+
+```
+PROBLEM: Facts change over time, old info misleads
+SOLUTION: Bi-temporal model with contradiction detection
+
+HOW IT WORKS:
+1. Each memory has valid_at, invalid_at, created_at, expired_at
+2. On new fact, find similar existing facts
+3. LLM detects contradictions
+4. Old fact gets invalid_at = new fact's valid_at
+5. Supersession chain tracked for history
+
+RESULT: "What was true on Jan 15?" queries work correctly
+```
+
+---
+
+## Part 10: Implementation Phases
+
+```mermaid
+gantt
+    title Beever Atlas Implementation Roadmap
+    dateFormat  YYYY-MM-DD
+
+    section Phase 1: MVP
+    Foundation (FastAPI, Weaviate, Models)    :p1a, 2025-01-20, 7d
+    Local File + GitHub Adapters              :p1b, after p1a, 5d
+    Memory Extraction + Embedding             :p1c, after p1a, 7d
+    Hierarchical Storage (3-tier)             :p1d, after p1c, 5d
+    Hybrid Search (BM25 + Vector)             :p1e, after p1d, 3d
+    Wiki Generation + Caching                 :p1f, after p1e, 5d
+    Ask with Citations                        :p1g, after p1f, 4d
+    Web UI (Browse, Search, Ask)              :p1h, after p1f, 10d
+
+    section Phase 2: Core Enhancements
+    Dual Retrieval (RAG + LLM)                :p2a, after p1h, 7d
+    Self-Evolving Summaries                   :p2b, after p2a, 7d
+    Sufficiency Checking                      :p2c, after p2a, 4d
+    Novelty Detection                         :p2d, after p2b, 5d
+    Forgetting (Ebbinghaus)                   :p2e, after p2d, 5d
+    Slack + Notion Adapters                   :p2f, after p1h, 10d
+
+    section Phase 3: Advanced Features
+    Graph Relationships                       :p3a, after p2e, 10d
+    Contradiction Detection                   :p3b, after p3a, 7d
+    Bi-Temporal Model                         :p3c, after p3b, 7d
+    CoT Query Decomposition                   :p3d, after p2c, 7d
+    Memory Scheduling                         :p3e, after p3c, 10d
+    MCP Server                                :p3f, after p2b, 7d
+
+    section Phase 4: Extended Use Cases
+    Meeting Minutes Processing                :p4a, after p3a, 10d
+    Video/Audio Processing                    :p4b, after p4a, 14d
+    Training Data Export                      :p4c, after p3c, 7d
+    Multi-Tenancy                             :p4d, after p3e, 14d
+```
+
+---
+
+## Quick Reference: When to Use What
+
+| User Intent | Path | Cost | Latency |
+|-------------|------|------|---------|
+| "Show me the overview" | Wiki → Tier 0 | FREE | ~50ms |
+| "What topics do we have?" | Wiki → Tier 1 list | FREE | ~50ms |
+| "Tell me about authentication" | Wiki → Tier 1 detail | FREE | ~50ms |
+| "Find messages about Redis" | Search → Hybrid RAG | ~$0.001 | ~100ms |
+| "What did Alice say about caching?" | Search → Hybrid RAG | ~$0.001 | ~100ms |
+| "Why did we choose PostgreSQL?" | Ask → LLM Path | ~$0.02 | ~2s |
+| "Compare our auth approaches over time" | Ask → CoT + LLM | ~$0.05 | ~3s |
+
+---
+
+## Appendix: Glossary
+
+| Term | Definition |
+|------|------------|
+| **Atomic Memory** | Single unit of knowledge with facts[], narrative, and metadata |
+| **Topic Cluster** | Group of related memories with summary (Tier 1) |
+| **Collection Summary** | High-level overview of all knowledge (Tier 0) |
+| **Wiki-First** | Pattern where reads are cached, LLM only for complex queries |
+| **Dual Retrieval** | Automatic selection between RAG (fast) and LLM (deep) |
+| **Sufficiency Check** | Stop retrieval early when enough context found |
+| **RRF** | Reciprocal Rank Fusion - combining multiple search results |
+| **Ebbinghaus Curve** | Memory decay formula: R(t) = e^(-t/S) |
+| **Temporal Supersession** | When new fact invalidates old fact |
+| **Cross-Modal Search** | Text query finding images/videos/docs |
+| **Named Vectors** | Weaviate feature for modality-specific indexes |
+
+---
+
+*This document provides a comprehensive overview of Beever Atlas architecture. For implementation details, see PIVOT_PLAN.md and MVP_PLAN.md.*
diff --git a/docs/v1-archive/ARCHITECTURE_OVERVIEW_V2_MONOLITH.md b/docs/v1-archive/ARCHITECTURE_OVERVIEW_V2_MONOLITH.md
new file mode 100644
index 00000000..a3d0302f
--- /dev/null
+++ b/docs/v1-archive/ARCHITECTURE_OVERVIEW_V2_MONOLITH.md
@@ -0,0 +1,1288 @@
+# Beever Atlas v2: Comprehensive Architecture Overview
+
+> **For**: Development Team, Product Team, Stakeholders
+> **Purpose**: Complete technical reference for the v2 dual-memory architecture
+> **Related**: `TECHNICAL_PROPOSAL.md` (design decisions), `WEAKNESS_RESOLUTION_MAP.md` (v1 fixes), `REFERENCE_PAPERS.md` (research basis)
+
+---
+
+## TL;DR: What Changed from v1 to v2?
+
+```
+┌──────────────────────────────────────────────────────────────────────────────┐
+│                          v1 (DEMO) vs v2 (PRODUCTION)                        │
+├──────────────────────────────────────────────────────────────────────────────┤
+│                                                                               │
+│  v1: Single Memory System (Weaviate only)                                    │
+│  ┌──────────┐     ┌──────────┐     ┌──────────┐                             │
+│  │  Query   │ ──▶ │ Weaviate │ ──▶ │   LLM    │                             │
+│  └──────────┘     │ (broken  │     └──────────┘                             │
+│                    │ clusters)│                                               │
+│                    └──────────┘                                               │
+│  Problems: cluster linking no-op, regex classifier, no temporal decay,      │
+│  no relational queries, Slack only, 5.25/10 memory quality                  │
+│                                                                               │
+│  ─────────────────────────────────────────────────────────────────────────── │
+│                                                                               │
+│  v2: Dual Memory System (Weaviate + Neo4j)                                  │
+│  ┌──────────┐     ┌──────────────┐     ┌──────────┐                         │
+│  │  Query   │ ──▶ │ Smart Router │ ──▶ │ Response │                         │
+│  └──────────┘     └──────┬───────┘     └──────────┘                         │
+│                     ┌────┴─────┐                                             │
+│                     ▼          ▼                                             │
+│               ┌──────────┐ ┌──────────┐                                     │
+│               │ Weaviate │ │  Neo4j   │                                     │
+│               │ (fixed   │ │ (graph   │                                     │
+│               │ 3-tier)  │ │ memory)  │                                     │
+│               └──────────┘ └──────────┘                                     │
+│  Adds: graph relationships, temporal evolution, flexible entities,          │
+│  multi-platform, LLM query understanding, quality gates, all v1 fixes      │
+│                                                                               │
+└──────────────────────────────────────────────────────────────────────────────┘
+```
+
+**Key differentiators from competitors (memU, Mem0, Zep, MemGPT):**
+
+| Capability | Competitors | Beever Atlas v2 |
+|------------|------------|-----------------|
+| Wiki-first (FREE reads) | Every query = LLM call | 80% free via cached wiki |
+| Dual memory (semantic + graph) | Single memory model | Weaviate for facts + Neo4j for relationships |
+| Cross-modal search | Text only (mostly) | Text query → finds images, PDFs, videos |
+| Temporal evolution | Limited | Bi-temporal + SUPERSEDES chains in graph |
+| Flexible entity types | Fixed or none | LLM creates any entity/relationship type |
+| Multi-platform | Single platform | Slack, Teams, Discord via adapter layer |
+| Quality-gated ingestion | Accept everything | Reject < 0.5 quality score |
+
+---
+
+## Part 1: The Complete System — Full Picture
+
+```mermaid
+flowchart TB
+    subgraph Sources["📥 DATA SOURCES"]
+        direction LR
+        S1["Slack"]
+        S2["Teams"]
+        S3["Discord"]
+    end
+
+    subgraph Adapters["🔌 PLATFORM ADAPTERS"]
+        direction LR
+        A1["SlackAdapter<br/>(slack-sdk)"]
+        A2["TeamsAdapter<br/>(MS Graph API)"]
+        A3["DiscordAdapter<br/>(discord.py)"]
+    end
+
+    subgraph Normalize["📋 NORMALIZED MESSAGE"]
+        N1["NormalizedMessage<br/>content, author, platform<br/>channel, timestamp<br/>attachments, thread"]
+    end
+
+    subgraph Pipeline["⚙️ INGESTION PIPELINE (6 stages)"]
+        direction TB
+        P1["1. PREPROCESS<br/>Modality detection<br/>Thread assembly"]
+        P2["2. EXTRACT + QUALITY GATE<br/>LLM fact extraction<br/>Reject quality < 0.5<br/>Max 2 facts/message"]
+        P3["3. ENTITY EXTRACTION<br/>People, Decisions, Projects<br/>+ flexible extensions<br/>Relationships + temporal"]
+        P4["4. CLASSIFY + TAG<br/>Topic, entity, action tags<br/>Importance scoring"]
+        P5["5. EMBED<br/>Jina v4 (2048-dim)<br/>text/image/doc vectors"]
+        P6["6. NOVELTY + PERSIST<br/>Dedup → Write to both stores"]
+
+        P1 --> P2 --> P3 --> P4 --> P5 --> P6
+    end
+
+    subgraph SemanticMem["💾 SEMANTIC MEMORY (Weaviate)"]
+        direction TB
+        W0["Tier 0: Channel Summary"]
+        W1["Tier 1: Topic Clusters<br/>(FIXED: actual linking)"]
+        W2["Tier 2: Atomic Facts<br/>+ multimodal vectors"]
+        W0 --> W1 --> W2
+    end
+
+    subgraph GraphMem["🔗 GRAPH MEMORY (Neo4j)"]
+        direction TB
+        G1["Flexible Entities<br/>Person, Decision, Project<br/>Technology, Team, ..."]
+        G2["Flexible Relationships<br/>DECIDED, WORKS_ON<br/>BLOCKED_BY, SUPERSEDES, ..."]
+        G3["Event Nodes<br/>(episodic links<br/>to Weaviate)"]
+        G1 --- G2
+        G2 --- G3
+    end
+
+    subgraph State["📊 STATE (MongoDB)"]
+        direction TB
+        M1["Sync state"]
+        M2["Wiki cache"]
+        M3["Quality logs"]
+    end
+
+    subgraph Decompose["🔀 QUERY DECOMPOSITION"]
+        direction TB
+        QD["QueryDecomposer<br/>(LLM flash-lite)<br/>Complex → parallel sub-queries"]
+    end
+
+    subgraph Router["🧠 SMART QUERY ROUTER"]
+        direction TB
+        QU["Query Understanding<br/>(LLM flash-lite)"]
+        RD{"Route?"}
+        QU --> RD
+    end
+
+    subgraph Sys1["⚡ SYSTEM-1: Semantic Retrieval"]
+        direction TB
+        S1R["3-tier hierarchical search<br/>Topic-first → scoped atomic<br/>Bidirectional expansion<br/>Temporal decay + quality boost"]
+    end
+
+    subgraph Sys2["🔗 SYSTEM-2: Graph Retrieval"]
+        direction TB
+        S2R["Entity resolution<br/>Multi-hop traversal<br/>Temporal chain following<br/>Episodic enrichment from Weaviate"]
+    end
+
+    subgraph ExtSearch["🌐 EXTERNAL SEARCH"]
+        direction TB
+        ES["Tavily API<br/>Web search + doc lookup<br/>Content extraction"]
+    end
+
+    subgraph Response["📤 RESPONSE GENERATION"]
+        direction TB
+        RG["Merge + dedup + rank<br/>Grounded answer<br/>Citations + permalinks<br/>Gemini Flash"]
+    end
+
+    subgraph Wiki["📖 WIKI SYSTEM"]
+        direction TB
+        WK["Overview (Tier 0)<br/>Topics (Tier 1)<br/>People (Neo4j)<br/>Decisions (Neo4j)<br/>Recent Activity"]
+    end
+
+    subgraph Lifecycle["🔄 BACKGROUND SERVICES"]
+        direction TB
+        LC1["Consolidation<br/>(cluster building)"]
+        LC2["Wiki refresh"]
+        LC3["Temporal decay<br/>+ contradiction check"]
+    end
+
+    Sources --> Adapters --> Normalize --> Pipeline
+    P6 --> SemanticMem
+    P6 --> GraphMem
+    P6 --> State
+
+    Decompose --> Router
+    Router --> |"semantic"| Sys1
+    Router --> |"graph"| Sys2
+    Router --> |"both"| Sys1 & Sys2
+    Decompose --> |"external_queries"| ExtSearch
+
+    Sys1 --> Response
+    Sys2 --> Response
+    ExtSearch --> Response
+
+    SemanticMem --> Sys1
+    GraphMem --> Sys2
+    SemanticMem --> Wiki
+    GraphMem --> Wiki
+    State --> Wiki
+
+    SemanticMem <--> Lifecycle
+    GraphMem <--> Lifecycle
+
+    style Sources fill:#e3f2fd,color:#333
+    style Adapters fill:#fff3e0,color:#333
+    style Pipeline fill:#f3e5f5,color:#333
+    style SemanticMem fill:#e8f5e9,color:#333
+    style GraphMem fill:#e1f5fe,color:#333
+    style State fill:#eceff1,color:#333
+    style Decompose fill:#fce4ec,color:#333
+    style Router fill:#fff8e1,color:#333
+    style Sys1 fill:#e8f5e9,color:#333
+    style Sys2 fill:#e1f5fe,color:#333
+    style ExtSearch fill:#fff3e0,color:#333
+    style Response fill:#e0f2f1,color:#333
+    style Wiki fill:#fce4ec,color:#333
+    style Lifecycle fill:#fff8e1,color:#333
+```
+
+---
+
+## Part 2: Multi-Platform Ingestion
+
+### How Messages Enter the System
+
+```mermaid
+flowchart LR
+    subgraph Platforms["Communication Platforms"]
+        Slack["Slack<br/>slack-sdk (Python)"]
+        Teams["Microsoft Teams<br/>MS Graph API"]
+        Discord["Discord<br/>discord.py"]
+    end
+
+    subgraph Mode1["MODE 1: Batch History (Primary)"]
+        PY["Python Adapters<br/>fetch_history()<br/>fetch_thread()"]
+    end
+
+    subgraph Mode2["MODE 2: Real-Time (Phase 2)"]
+        CS["Chat SDK Bridge<br/>(TypeScript)<br/>Webhook receiver"]
+    end
+
+    subgraph Norm["Normalized Message"]
+        NM["content: str<br/>author: AuthorInfo<br/>platform: slack|teams|discord<br/>channel_id: str<br/>timestamp: datetime<br/>thread_id: str?<br/>attachments: list<br/>reactions: list"]
+    end
+
+    Slack --> PY
+    Teams --> PY
+    Discord --> PY
+
+    Slack -.-> CS
+    Teams -.-> CS
+    Discord -.-> CS
+
+    PY --> Norm
+    CS -.->|"POST /api/ingest"| Norm
+
+    style Mode1 fill:#e8f5e9,color:#333
+    style Mode2 fill:#fff3e0,color:#333,stroke-dasharray: 5 5
+```
+
+**Mode 1 (Python Adapters)** is the primary ingestion path. Each adapter fetches message history via platform-specific APIs and normalizes to `NormalizedMessage`.
+
+**Mode 2 (Chat SDK)** is optional for real-time. The [Vercel Chat SDK](https://chat-sdk.dev/) is TypeScript-only and can receive webhooks but cannot fetch history. It runs as a separate Docker service that forwards normalized events to the Python backend.
+
+### Adapter Data Model
+
+```python
+@dataclass
+class NormalizedMessage:
+    content: str                            # Message text
+    author: AuthorInfo                      # id, name, email, role, platform
+    platform: Platform                      # slack | teams | discord
+    channel_id: str                         # Platform channel ID
+    channel_name: str                       # Human-readable name
+    message_id: str                         # Platform message ID
+    timestamp: datetime                     # When sent
+    thread_id: str | None = None            # Parent thread (if reply)
+    attachments: list[Attachment] = []      # Files: id, name, type, url
+    reactions: list[str] = []               # Emoji reactions
+    reply_count: int = 0                    # Thread reply count
+    raw_metadata: dict = {}                 # Platform-specific extras
+```
+
+---
+
+## Part 3: The Ingestion Pipeline (How Memories Are Created)
+
+Every `NormalizedMessage` passes through 6 stages. Stages 2 and 3 use LLM. The pipeline writes to **both** Weaviate and Neo4j.
+
+```
+NormalizedMessage
+     │
+     ▼
+┌─────────────────────────────────────────────────────────────────────┐
+│  STAGE 1: PREPROCESS                                                │
+│                                                                      │
+│  • Detect modality: text, image, PDF, video, audio                  │
+│  • Parse attachments:                                                │
+│    - Images → Gemini Vision analysis                                │
+│    - PDFs → page-by-page image conversion + analysis                │
+│    - Videos → key frame extraction + transcription                  │
+│  • Assemble thread context (if reply, include parent + siblings)    │
+│  • Resolve user identity across platforms                            │
+│                                                                      │
+│  Input:  NormalizedMessage                                           │
+│  Output: PreprocessedContent (text + modality metadata)              │
+│  Cost:   ~$0 (no LLM for text; Gemini Vision for images/PDFs)      │
+└──────────────────────┬──────────────────────────────────────────────┘
+                       ▼
+┌─────────────────────────────────────────────────────────────────────┐
+│  STAGE 2: EXTRACT FACTS + QUALITY GATE                              │
+│                                                                      │
+│  LLM (Gemini Flash Lite — $0.30/1M tokens):                        │
+│  "Extract the 1-2 MOST IMPORTANT facts from this message.           │
+│   Each fact must be self-contained and specific."                    │
+│                                                                      │
+│  Quality Gate:                                                       │
+│  ┌───────────────────────────────────────────────────┐              │
+│  │  For each extracted fact:                          │              │
+│  │  • Length < 40 chars?          → score -0.3       │              │
+│  │  • Contains "the user", "it was"? → score -0.2   │              │
+│  │  • Has named entity or number? → score +0.1      │              │
+│  │  • Starts with "It ", "This "? → score -0.15     │              │
+│  │                                                    │              │
+│  │  REJECT if score < 0.5                            │              │
+│  │  KEEP max 2 highest-scoring facts                 │              │
+│  └───────────────────────────────────────────────────┘              │
+│                                                                      │
+│  Input:  PreprocessedContent                                         │
+│  Output: list[ScoredFact] (max 2, each quality ≥ 0.5)              │
+│  Cost:   ~$0.001/message                                            │
+└──────────────────────┬──────────────────────────────────────────────┘
+                       ▼
+┌─────────────────────────────────────────────────────────────────────┐
+│  STAGE 3: ENTITY EXTRACTION (for Graph Memory)                      │
+│                                                                      │
+│  LLM (Gemini Flash Lite):                                           │
+│  "Extract entities and relationships from this message.              │
+│   Core types: Person, Decision, Project, Technology.                 │
+│   Extension types: Team, Meeting, Budget, Constraint, ...            │
+│   Relationships: DECIDED, WORKS_ON, BLOCKED_BY, ... (any verb)"     │
+│                                                                      │
+│  Example input:                                                      │
+│  "Alice decided to use RS256 for JWT — blocked by Carol's review"   │
+│                                                                      │
+│  Example output:                                                     │
+│  entities:                                                           │
+│    Person(Alice), Person(Carol)                                      │
+│    Decision(Use RS256), Technology(JWT)                               │
+│  relationships:                                                      │
+│    Alice ──DECIDED──▶ Use RS256                                     │
+│    Use RS256 ──USES──▶ JWT                                          │
+│    Use RS256 ──BLOCKED_BY──▶ Carol                                  │
+│  confidence: 0.85                                                    │
+│                                                                      │
+│  Dedup: Compare against existing Neo4j entities (fuzzy name match)  │
+│  Temporal: Mark as "current" or "supersedes:<old_decision>"          │
+│                                                                      │
+│  Input:  PreprocessedContent + existing graph entities               │
+│  Output: list[Entity], list[Relationship]                            │
+│  Cost:   ~$0.001/message                                            │
+└──────────────────────┬──────────────────────────────────────────────┘
+                       ▼
+┌─────────────────────────────────────────────────────────────────────┐
+│  STAGE 4: CLASSIFY + TAG                                            │
+│                                                                      │
+│  LLM (Gemini Flash Lite):                                           │
+│  • topic_tags: ["authentication", "security"]                       │
+│  • entity_tags: ["Alice", "JWT", "RS256"]                           │
+│  • action_tags: ["decision", "blocker"]                             │
+│  • importance: "high" | "medium" | "low"                            │
+│                                                                      │
+│  Input:  ScoredFacts + Entities                                      │
+│  Output: TaggedFacts                                                 │
+│  Cost:   ~$0.0005/message (can batch with Stage 2)                  │
+└──────────────────────┬──────────────────────────────────────────────┘
+                       ▼
+┌─────────────────────────────────────────────────────────────────────┐
+│  STAGE 5: EMBED                                                     │
+│                                                                      │
+│  Jina v4 (2048-dim, multimodal unified space):                      │
+│  • text_vector: embed fact text (prefix: "Passage")                 │
+│  • image_vector: embed image content (if attachment)                │
+│  • doc_vector: embed document page (if PDF)                         │
+│                                                                      │
+│  Same embedding space → cross-modal search works:                   │
+│  text query "auth diagram" → finds image of OAuth2 flowchart        │
+│                                                                      │
+│  Input:  TaggedFacts + attachments                                   │
+│  Output: EmbeddedFacts (with named vectors)                          │
+│  Cost:   Jina API pricing                                            │
+└──────────────────────┬──────────────────────────────────────────────┘
+                       ▼
+┌─────────────────────────────────────────────────────────────────────┐
+│  STAGE 6: NOVELTY CHECK + PERSIST                                   │
+│                                                                      │
+│  Novelty detection (cosine similarity vs existing):                 │
+│  • ≥ 95% similarity → SKIP (exact duplicate)                       │
+│  • ≥ 85% similarity → REINFORCE (boost existing memory's stability)│
+│  • < 85% similarity → INSERT (novel content)                       │
+│                                                                      │
+│  WRITE TO ALL THREE STORES:                                         │
+│                                                                      │
+│  1. WEAVIATE — Atomic fact (Tier 2)                                 │
+│     memory text, vectors, tags, quality_score, citations            │
+│     graph_entity_ids → links to Neo4j entities                      │
+│                                                                      │
+│  2. NEO4J — Entities + relationships                                │
+│     MERGE entities (no duplicates)                                  │
+│     CREATE relationships with temporal properties                   │
+│     CREATE Event node with weaviate_id → links back to Weaviate    │
+│                                                                      │
+│  3. MONGODB — Sync state update                                     │
+│     message_count++, last_sync_at, processing status                │
+│                                                                      │
+│  Input:  EmbeddedFacts + Entities + Relationships                    │
+│  Output: Persisted to all stores                                     │
+│  Cost:   Database writes only                                        │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+### Cost Per Message (Total Pipeline)
+
+| Stage | LLM Model | Cost/Message |
+|-------|-----------|-------------|
+| 1. Preprocess | None (text) / Gemini Vision (media) | ~$0 / ~$0.005 |
+| 2. Extract + Quality | Gemini Flash Lite | ~$0.001 |
+| 3. Entity Extraction | Gemini Flash Lite | ~$0.001 |
+| 4. Classify + Tag | Gemini Flash Lite (batched with 2) | ~$0.0005 |
+| 5. Embed | Jina v4 | ~$0.0001 |
+| 6. Persist | None (DB writes) | ~$0 |
+| **Total (text message)** | | **~$0.0025** |
+| **Total (with media)** | | **~$0.008** |
+| **10K messages bulk sync** | | **~$25** |
+
+---
+
+## Part 4: Semantic Memory — Weaviate (3-Tier Hierarchy)
+
+### How Semantic Memory Is Structured
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│              SEMANTIC MEMORY: WEAVIATE                                │
+│                                                                      │
+│  Purpose: Store and search FACTS — "what was said"                  │
+│  Strength: BM25 + vector hybrid search, multimodal, fast            │
+│  Query types: factual, topical, overview, cross-modal               │
+│                                                                      │
+│  ┌───────────────────────────────────────────────────────────────┐  │
+│  │  TIER 0: Channel Summary                                      │  │
+│  │                                                                │  │
+│  │  "The #backend channel focuses on authentication, database     │  │
+│  │   migration, and deployment. Key themes include JWT adoption,  │  │
+│  │   Kubernetes migration, and API design standards."             │  │
+│  │                                                                │  │
+│  │  Created by: Consolidation service (scheduled)                │  │
+│  │  Updated: Weekly or after significant new content              │  │
+│  │  Used for: Wiki overview, "what's happening?" queries          │  │
+│  │  Access cost: FREE (cached, no search needed)                  │  │
+│  └───────────────────────────────────────────────────────────────┘  │
+│                                                                      │
+│  ┌───────────────────────────────────────────────────────────────┐  │
+│  │  TIER 1: Topic Clusters (FIXED in v2)                         │  │
+│  │                                                                │  │
+│  │  "authentication" cluster:                                     │  │
+│  │    summary: "Team discussed JWT with RS256..."                │  │
+│  │    member_ids: [uuid1, uuid2, ..., uuid23]  ← ACTUALLY LINKED│  │
+│  │    topic_tags: ["authentication"]                              │  │
+│  │                                                                │  │
+│  │  "deployment" cluster:                                         │  │
+│  │    summary: "Kubernetes migration using ArgoCD..."            │  │
+│  │    member_ids: [uuid30, uuid31, ..., uuid45]                  │  │
+│  │    topic_tags: ["deployment", "infrastructure"]                │  │
+│  │                                                                │  │
+│  │  Created by: Consolidation service                             │  │
+│  │  v2 fix: _link_memories_to_cluster() writes cluster_id        │  │
+│  │  v2 fix: Existing cluster lookup prevents duplicates          │  │
+│  │  Used for: Topic-level wiki sections, topic scoping            │  │
+│  │  Access cost: FREE (cached, no LLM)                            │  │
+│  └───────────────────────────────────────────────────────────────┘  │
+│                                                                      │
+│  ┌───────────────────────────────────────────────────────────────┐  │
+│  │  TIER 2: Atomic Facts                                         │  │
+│  │                                                                │  │
+│  │  {                                                             │  │
+│  │    memory: "Alice decided to use RS256 algorithm for JWT",    │  │
+│  │    quality_score: 0.85,                                        │  │
+│  │    topic_tags: ["authentication"],                             │  │
+│  │    action_tags: ["decision"],                                  │  │
+│  │    importance: "high",                                         │  │
+│  │    cluster_id: "uuid-cluster-auth",  ← linked to Tier 1      │  │
+│  │    graph_entity_ids: ["neo4j-alice", "neo4j-rs256-decision"], │  │
+│  │    user_name: "Alice",                                         │  │
+│  │    message_ts: "1711234567.000100",                            │  │
+│  │    valid_at: "2026-03-20",                                     │  │
+│  │    text_vector: [0.12, -0.34, ...],  ← 2048-dim Jina v4     │  │
+│  │    image_vector: null,                                         │  │
+│  │    doc_vector: null,                                           │  │
+│  │  }                                                             │  │
+│  │                                                                │  │
+│  │  Used for: Detailed search, cross-modal, citation retrieval   │  │
+│  │  Access cost: ~$0.001 (embedding search)                       │  │
+│  └───────────────────────────────────────────────────────────────┘  │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+### How Weaviate Searches Work
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│                HYBRID SEARCH (BM25 + Vector)                         │
+│                                                                      │
+│  Query: "JWT authentication"                                        │
+│                                                                      │
+│  ┌──────────────────┐      ┌──────────────────┐                    │
+│  │  BM25 (keyword)  │      │  Vector (semantic)│                    │
+│  │                  │      │                  │                    │
+│  │  Exact match:    │      │  Meaning match:  │                    │
+│  │  "JWT" → score 5 │      │  "auth flow" →   │                    │
+│  │  "auth" → score 3│      │  score 0.87      │                    │
+│  └────────┬─────────┘      └────────┬─────────┘                    │
+│           │                         │                               │
+│           └──────────┬──────────────┘                               │
+│                      ▼                                               │
+│           ┌──────────────────┐                                      │
+│           │  Adaptive Alpha  │                                      │
+│           │                  │                                      │
+│           │  Short query     │                                      │
+│           │  → alpha=0.2     │  (favor BM25 for keywords)          │
+│           │  Medium query    │                                      │
+│           │  → alpha=0.5     │  (balanced)                          │
+│           │  Long query      │                                      │
+│           │  → alpha=0.7     │  (favor vector for meaning)         │
+│           └────────┬─────────┘                                      │
+│                    ▼                                                 │
+│           ┌──────────────────┐                                      │
+│           │  Fused Results   │                                      │
+│           │  Ranked by       │                                      │
+│           │  combined score  │                                      │
+│           └──────────────────┘                                      │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+---
+
+## Part 5: Graph Memory — Neo4j (Flexible Knowledge Graph)
+
+### How Graph Memory Is Structured
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│              GRAPH MEMORY: Neo4j                                     │
+│                                                                      │
+│  Purpose: Capture RELATIONSHIPS — "who did what, when, why"         │
+│  Strength: Multi-hop traversal, temporal chains, flexible schema    │
+│  Query types: relational, temporal, cross-channel entity lookup     │
+│                                                                      │
+│  EXAMPLE GRAPH (from a #backend channel):                           │
+│                                                                      │
+│                    ┌─────────┐                                      │
+│         ┌─DECIDED─▶│Use RS256│◀─SUPERSEDES─┐                       │
+│         │          │(Decision│              │                       │
+│         │          │ active) │         ┌────┴────┐                  │
+│    ┌────┴───┐      └────┬────┘         │Use HS256│                  │
+│    │ Alice  │           │              │(Decision│                  │
+│    │(Person)│      USES─┘              │superseded│                 │
+│    │lead,eng│           │              └─────────┘                  │
+│    └────┬───┘      ┌────▼────┐                                     │
+│         │          │  JWT    │                                      │
+│   WORKS_ON         │(Technol)│                                      │
+│         │          └─────────┘                                      │
+│    ┌────▼────┐                                                      │
+│    │Auth Svc │◀──BLOCKED_BY──┐                                     │
+│    │(Project)│               │                                     │
+│    │in_progr │          ┌────┴───┐                                  │
+│    └─────────┘          │ Carol  │                                  │
+│                         │(Person)│                                  │
+│                         │security│                                  │
+│                         └────────┘                                  │
+│                                                                      │
+│  Every entity links to Weaviate via Event nodes:                    │
+│  Alice ──MENTIONED_IN──▶ Event{weaviate_id: uuid-abc-123}          │
+│  This enables: graph traversal → find entity → get source text     │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+### Guided-Flexible Schema
+
+The LLM is guided toward **core entity types** but can create **any extension type**:
+
+```
+┌──────────────────────────────────────────────────────────────────┐
+│  CORE TYPES (LLM prefers these — well-defined properties):       │
+│                                                                    │
+│  Person:     name, role, team, email, platform                    │
+│  Decision:   summary, status, rationale, date                     │
+│  Project:    name, status, description                            │
+│  Technology: name, category                                       │
+│                                                                    │
+│  EXTENSION TYPES (LLM creates as needed):                        │
+│                                                                    │
+│  Team, Meeting, Artifact, Constraint, Budget, Deadline,          │
+│  Document, Sprint, Environment, Service, ...                     │
+│  (any type that captures the conversation's meaning)             │
+│                                                                    │
+│  ALL RELATIONSHIPS ARE FLEXIBLE:                                  │
+│                                                                    │
+│  DECIDED, WORKS_ON, MEMBER_OF, OWNS, BLOCKED_BY,                │
+│  SUPERSEDES, DEPENDS_ON, USES, APPROVED, POSTPONED,             │
+│  REVIEWED, ASSIGNED_TO, ...                                       │
+│  (LLM extracts whatever verb phrase fits)                        │
+│                                                                    │
+│  TEMPORAL PROPERTIES (on all relationships):                      │
+│  valid_from, valid_until, created_at, confidence                 │
+│                                                                    │
+│  EPISODIC LINK (connects graph to Weaviate):                     │
+│  Event node with weaviate_id → source fact in Weaviate           │
+└──────────────────────────────────────────────────────────────────┘
+```
+
+### Temporal Evolution (SUPERSEDES Chains)
+
+```
+Time ──────────────────────────────────────────────────────▶
+
+Feb 1                  Mar 5                  Mar 20
+┌─────────────┐       ┌─────────────┐       ┌─────────────┐
+│ Use sessions │──────▶│ Use HS256   │──────▶│ Use RS256   │
+│ (Decision)   │SUPER- │ (Decision)   │SUPER- │ (Decision)   │
+│              │SEDES  │              │SEDES  │              │
+│ valid_from:  │       │ valid_from:  │       │ valid_from:  │
+│   Feb 1      │       │   Mar 5      │       │   Mar 20     │
+│ valid_until: │       │ valid_until: │       │ valid_until: │
+│   Mar 5      │       │   Mar 20     │       │   null (curr)│
+└─────────────┘       └─────────────┘       └─────────────┘
+
+Query: "How did the auth approach evolve?"
+→ Traverse SUPERSEDES chain → returns full timeline with
+  source citations from Weaviate via episodic links
+```
+
+---
+
+## Part 6: The Smart Query Router
+
+### How Queries Are Understood and Routed
+
+```mermaid
+flowchart TB
+    Q["User Query"] --> QD["Query Decomposer<br/>(simple → pass-through<br/>complex → parallel sub-queries)"]
+
+    QD --> QU["Query Understanding<br/>(LLM Flash Lite ~$0.001)"]
+    QD --> |"external_queries"| EXT["External Search<br/>(Tavily API)"]
+
+    QU --> |"route=semantic<br/>conf > 0.7"| S1["SYSTEM-1<br/>Semantic Retrieval<br/>(Weaviate 3-tier)"]
+
+    QU --> |"route=graph<br/>conf > 0.7"| S2["SYSTEM-2<br/>Graph Retrieval<br/>(Neo4j + Weaviate)"]
+
+    QU --> |"route=both<br/>OR conf ≤ 0.7"| BOTH["PARALLEL<br/>System-1 AND System-2"]
+
+    BOTH --> S1
+    BOTH --> S2
+
+    S1 --> MERGE["Result Merger<br/>Dedup + rank + decay"]
+    S2 --> MERGE
+    EXT --> MERGE
+
+    MERGE --> RESP["Response Generator<br/>(Gemini Flash)<br/>Grounded answer + citations"]
+
+    style S1 fill:#e8f5e9,color:#333
+    style S2 fill:#e1f5fe,color:#333
+    style BOTH fill:#fff8e1,color:#333
+    style EXT fill:#fff3e0,color:#333
+    style QD fill:#fce4ec,color:#333
+```
+
+### Routing Decision Table
+
+| Query | Route | System | Cost | Latency |
+|-------|-------|--------|------|---------|
+| "Show me the overview" | Semantic (Tier 0) | System-1 | FREE | ~50ms |
+| "Tell me about authentication" | Semantic (Tier 1) | System-1 | FREE | ~50ms |
+| "What did Alice say about caching?" | Semantic (Tier 2) | System-1 | ~$0.001 | ~200ms |
+| "Find the architecture diagram" | Semantic (cross-modal) | System-1 | ~$0.001 | ~200ms |
+| "Who decided to use JWT?" | Graph | System-2 | ~$0.005 | ~500ms |
+| "What is Alice working on?" | Graph | System-2 | ~$0.005 | ~500ms |
+| "How did the auth approach evolve?" | Graph (temporal) | System-2 | ~$0.005 | ~500ms |
+| "What blocks the migration project?" | Graph | System-2 | ~$0.005 | ~500ms |
+| "Tell me about the JWT migration" | Both (parallel) | System-1 + 2 | ~$0.006 | ~500ms |
+| "How does our auth compare to best practices?" | Decomposed | Internal + External (Tavily) | ~$0.01 | ~1.5s |
+| "What auth method did we pick and is it OWASP-compliant?" | Decomposed | System-1 + Tavily parallel | ~$0.01 | ~1.5s |
+
+### Average Query Cost
+
+```
+80% semantic queries × $0.001  = $0.0008
+20% graph/both queries × $0.005 = $0.001
+─────────────────────────────────────────
+Average per query:               ~$0.002  (+ LLM synthesis ~$0.02 if needed)
+
+With wiki-first (50% of reads are wiki):
+Effective average:               ~$0.001/query
+```
+
+---
+
+## Part 7: System-1 — Semantic Retrieval (Detailed)
+
+**What it answers:** "What was said/discussed/written about X?"
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│  SYSTEM-1: SEMANTIC RETRIEVAL FLOW                                   │
+│                                                                      │
+│  Query: "What was discussed about authentication?"                  │
+│  Classified: route=semantic, depth=topic, topics=["authentication"]  │
+│                                                                      │
+│  STEP 1: TIER ROUTING                                               │
+│  ┌─────────────────────────────────────────────────┐               │
+│  │  depth=overview → search Tier 0 summaries       │               │
+│  │  depth=topic   → search Tier 1 clusters (*)     │               │
+│  │  depth=detail  → search Tier 2 atomics          │               │
+│  └─────────────────────┬───────────────────────────┘               │
+│                        │ (*) topic depth selected                   │
+│                        ▼                                            │
+│  STEP 2: TWO-STAGE TOPIC-FIRST RETRIEVAL                          │
+│  ┌─────────────────────────────────────────────────┐               │
+│  │  Stage 1 (coarse): Search Tier 1 clusters       │               │
+│  │    hybrid_search(tier=cluster,                   │               │
+│  │                  topic_filter=["authentication"],│               │
+│  │                  alpha=None)  ← adaptive         │               │
+│  │    → "authentication" cluster                    │               │
+│  │      summary: "Team discussed JWT..."            │               │
+│  │      member_ids: [uuid1, uuid2, ..., uuid23]    │               │
+│  │                                                   │               │
+│  │  Stage 2 (fine): Search Tier 2 WITHIN cluster   │               │
+│  │    hybrid_search(tier=atomic,                    │               │
+│  │                  id_filter=member_ids,           │               │
+│  │                  alpha=None)                      │               │
+│  │    → Searches 23 memories (not 10,000+)          │               │
+│  └─────────────────────┬───────────────────────────┘               │
+│                        ▼                                            │
+│  STEP 3: BIDIRECTIONAL EXPANSION (if results weak)                 │
+│  ┌─────────────────────────────────────────────────┐               │
+│  │  if max_score < 0.6 or avg_score < 0.4:        │               │
+│  │    expand UP → also search Tier 0 summaries     │               │
+│  │    expand DOWN → also search broader atomics    │               │
+│  │    merge_and_rerank(all results)                │               │
+│  └─────────────────────┬───────────────────────────┘               │
+│                        ▼                                            │
+│  STEP 4: POST-PROCESSING                                           │
+│  ┌─────────────────────────────────────────────────┐               │
+│  │  1. Temporal decay: score *= e^(-days/30)       │               │
+│  │  2. Quality boost: score *= (0.7 + 0.3*quality) │               │
+│  │  3. Semantic dedup: Jaccard > 0.85 → keep more  │               │
+│  │     specific one                                 │               │
+│  │  4. Return top N results with citations          │               │
+│  └─────────────────────────────────────────────────┘               │
+│                                                                      │
+│  Output: Ranked list of facts with Slack permalink citations        │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+---
+
+## Part 8: System-2 — Graph Retrieval (Detailed)
+
+**What it answers:** "Who decided X? What blocks Y? How did Z evolve?"
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│  SYSTEM-2: GRAPH RETRIEVAL FLOW                                      │
+│                                                                      │
+│  Query: "Who decided to use JWT and what was the rationale?"        │
+│  Classified: route=graph, entities=["JWT"], temporal=any             │
+│                                                                      │
+│  STEP 1: ENTITY RESOLUTION                                          │
+│  ┌─────────────────────────────────────────────────┐               │
+│  │  Fuzzy match "JWT" against Neo4j nodes:          │               │
+│  │  MATCH (n) WHERE n.name =~ '(?i).*jwt.*'        │               │
+│  │  → Found: Technology{name: "JWT"}                │               │
+│  └─────────────────────┬───────────────────────────┘               │
+│                        ▼                                            │
+│  STEP 2: MULTI-HOP GRAPH TRAVERSAL                                 │
+│  ┌─────────────────────────────────────────────────┐               │
+│  │  MATCH (d:Decision)-[:USES]->(t:Technology)     │               │
+│  │  WHERE t.name = "JWT"                            │               │
+│  │  MATCH (p:Person)-[:DECIDED]->(d)                │               │
+│  │  OPTIONAL MATCH (d)-[:AFFECTS]->(proj:Project)  │               │
+│  │  RETURN p, d, proj                               │               │
+│  │                                                   │               │
+│  │  Result graph:                                    │               │
+│  │  Alice ──DECIDED──▶ "Use RS256 for JWT"          │               │
+│  │    Decision ──USES──▶ JWT                         │               │
+│  │    Decision ──AFFECTS──▶ Auth Service             │               │
+│  └─────────────────────┬───────────────────────────┘               │
+│                        ▼                                            │
+│  STEP 3: EPISODIC ENRICHMENT (Graph → Weaviate)                    │
+│  ┌─────────────────────────────────────────────────┐               │
+│  │  For each entity/relationship found:             │               │
+│  │  1. Follow MENTIONED_IN edge to Event node      │               │
+│  │  2. Get Event.weaviate_id                        │               │
+│  │  3. Fetch from Weaviate:                         │               │
+│  │     → Full memory text                           │               │
+│  │     → Slack permalink (citation)                 │               │
+│  │     → Original timestamp, author                 │               │
+│  │                                                   │               │
+│  │  This is what makes System-2 more than just      │               │
+│  │  a graph query — it returns GROUNDED answers     │               │
+│  │  with source citations, not just entity names.   │               │
+│  └─────────────────────┬───────────────────────────┘               │
+│                        ▼                                            │
+│  STEP 4: TEMPORAL CHAIN (if temporal query)                        │
+│  ┌─────────────────────────────────────────────────┐               │
+│  │  For "How did X evolve?" queries:                │               │
+│  │  MATCH path = (d:Decision)-[:SUPERSEDES*0..10]  │               │
+│  │    ->(older:Decision)                             │               │
+│  │  Returns timeline:                                │               │
+│  │    Mar 20: "Use RS256" (active) ← current        │               │
+│  │    Mar 5: "Use HS256" (superseded)               │               │
+│  │    Feb 1: "Use sessions" (superseded)            │               │
+│  │  Each with source citations from Weaviate        │               │
+│  └─────────────────────────────────────────────────┘               │
+│                                                                      │
+│  Output: Graph paths + source memories + citations                  │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+---
+
+## Part 9: How Memories Are Updated (Consolidation + Evolution)
+
+### Scheduled Consolidation
+
+```mermaid
+flowchart TB
+    subgraph Trigger["⏰ TRIGGERS"]
+        T1["After sync<br/>(new content)"]
+        T2["Daily schedule<br/>(2 AM UTC)"]
+        T3["Weekly schedule<br/>(Sunday 3 AM)"]
+        T4["Manual<br/>(consolidate_channel)"]
+    end
+
+    subgraph ClusterUpdate["📦 CLUSTER CONSOLIDATION"]
+        C1["Get unclustered<br/>Tier 2 atomics<br/>(cluster_id = null)"]
+        C2["Group by<br/>topic_tags"]
+        C3{"Existing cluster<br/>for topic?"}
+        C4["UPDATE cluster<br/>summary + add<br/>new member_ids"]
+        C5["CREATE new cluster<br/>summary + member_ids"]
+        C6["WRITE cluster_id<br/>to each atomic<br/>(FIXED in v2!)"]
+
+        C1 --> C2 --> C3
+        C3 -->|"Yes"| C4 --> C6
+        C3 -->|"No"| C5 --> C6
+    end
+
+    subgraph SummaryUpdate["📄 SUMMARY UPDATE"]
+        S1["Read all Tier 1<br/>clusters for channel"]
+        S2["Read recent<br/>Tier 2 atomics"]
+        S3["LLM generates<br/>channel overview"]
+        S4["Upsert Tier 0<br/>summary"]
+
+        S1 --> S3
+        S2 --> S3
+        S3 --> S4
+    end
+
+    subgraph WikiRefresh["📖 WIKI REFRESH"]
+        W1["Read Tier 0<br/>(overview)"]
+        W2["Read Tier 1<br/>(topics)"]
+        W3["Query Neo4j<br/>(people, decisions)"]
+        W4["Generate wiki<br/>markdown"]
+        W5["Cache in MongoDB"]
+
+        W1 --> W4
+        W2 --> W4
+        W3 --> W4
+        W4 --> W5
+    end
+
+    Trigger --> ClusterUpdate --> SummaryUpdate --> WikiRefresh
+
+    style Trigger fill:#fff8e1,color:#333
+    style ClusterUpdate fill:#e8f5e9,color:#333
+    style SummaryUpdate fill:#e3f2fd,color:#333
+    style WikiRefresh fill:#fce4ec,color:#333
+```
+
+### Contradiction Detection + Temporal Supersession
+
+```
+NEW FACT ARRIVES:
+  "The team decided to use RS256 for JWT signing"
+
+CONTRADICTION CHECK:
+  1. Search existing facts with high similarity
+     → Found: "The team chose HS256 algorithm for JWT signing"
+     → Similarity: 0.82 (high but different key detail)
+
+  2. LLM comparison:
+     "Are these contradictory?"
+     → Yes: HS256 vs RS256 for the same purpose
+
+  3. Actions:
+     IN WEAVIATE:
+       Old fact: set invalid_at = now()
+       New fact: set valid_at = now()
+
+     IN NEO4J:
+       Old Decision(Use HS256): set valid_until = now()
+       New Decision(Use RS256): set valid_from = now()
+       CREATE (new)-[:SUPERSEDES]->(old)
+```
+
+---
+
+## Part 10: Wiki System (FREE Reads)
+
+### Wiki Content Structure
+
+```
+┌─────────────────────────────────────────────────────────────────────────┐
+│  📖 WIKI: #backend-engineering                                           │
+├─────────────────────────────────────────────────────────────────────────┤
+│                                                                          │
+│  📄 OVERVIEW (from Weaviate Tier 0 — FREE)                              │
+│  ├── "Our backend team focuses on authentication, database              │
+│  │    migration, and API design. Key recent activity: JWT               │
+│  │    adoption with RS256, Kubernetes migration in progress."           │
+│  └── Updated: 2026-03-24                                                │
+│                                                                          │
+│  📁 TOPICS (from Weaviate Tier 1 — FREE)                                │
+│  ├── 🔐 Authentication (23 memories)                                     │
+│  │   └── "OAuth2 + JWT with RS256, migrated from sessions"              │
+│  ├── 🗄️ Database (31 memories)                                          │
+│  │   └── "PostgreSQL + Redis, considering CockroachDB"                  │
+│  └── 🚀 Infrastructure (15 memories)                                     │
+│      └── "AWS EKS, Terraform, ArgoCD"                                   │
+│                                                                          │
+│  👥 PEOPLE (from Neo4j graph — ~$0.001)                                 │
+│  ├── Alice (Lead): auth, API — decided JWT migration                    │
+│  ├── Bob (SRE): infra, K8s — decided GKE adoption                      │
+│  └── Carol (Security): security review — blocking auth project          │
+│                                                                          │
+│  📋 DECISIONS (from Neo4j graph — ~$0.001)                              │
+│  ├── Mar 20: "Use RS256 for JWT" by Alice (active) ← supersedes HS256 │
+│  ├── Mar 15: "Adopt GKE" by Bob (active)                                │
+│  └── Mar 5: "Use HS256 for JWT" by Alice (superseded)                   │
+│                                                                          │
+│  📅 RECENT ACTIVITY (from Weaviate recent atomics — ~$0.001)           │
+│  ├── Today: Carol's security review blocking JWT migration              │
+│  ├── Yesterday: Bob started K8s namespace setup                         │
+│  └── 3 days ago: Alice proposed RS256 to replace HS256                  │
+│                                                                          │
+└─────────────────────────────────────────────────────────────────────────┘
+```
+
+**Cost:** Overview + Topics = FREE. People + Decisions + Recent = ~$0.003. Total wiki generation: ~$0.01 (LLM for synthesis). Wiki is cached in MongoDB — subsequent reads are FREE until next refresh.
+
+---
+
+## Part 11: Technology Stack
+
+```mermaid
+flowchart TB
+    subgraph UI["👤 INTERFACE LAYER"]
+        UI1["Web UI<br/>(React + Vite)"]
+        UI2["REST API<br/>(FastAPI)"]
+        UI3["MCP Server<br/>(FastMCP)"]
+    end
+
+    subgraph App["⚙️ APPLICATION LAYER"]
+        A1["Ingestion Service<br/>Adapters + Pipeline"]
+        A0["Query Decomposer<br/>Parallel sub-queries"]
+        A2["Query Router<br/>Understanding + Routing"]
+        A3["Semantic Retriever<br/>Weaviate 3-tier"]
+        A4["Graph Retriever<br/>Neo4j traversal"]
+        A7["External Search<br/>Tavily API"]
+        A5["Wiki Service<br/>Generation + Cache"]
+        A6["Consolidation Service<br/>Clusters + Summaries"]
+    end
+
+    subgraph AI["🧠 AI LAYER"]
+        ML1["Gemini Flash Lite<br/>Extraction, tagging<br/>$0.30/1M tokens"]
+        ML2["Gemini Flash<br/>Response generation<br/>$0.60/1M tokens"]
+        ML3["Jina v4<br/>Multimodal embeddings<br/>2048-dim unified"]
+    end
+
+    subgraph Data["💾 DATA LAYER"]
+        D1["Weaviate<br/>Semantic Memory<br/>3-tier hierarchy<br/>BM25 + HNSW vectors"]
+        D2["Neo4j<br/>Graph Memory<br/>Flexible entities<br/>Cypher traversal"]
+        D3["MongoDB<br/>State + Cache<br/>Sync, wiki, logs"]
+    end
+
+    subgraph Infra["🏗️ INFRASTRUCTURE"]
+        I1["Docker Compose<br/>(5 services)"]
+    end
+
+    UI --> App
+    App --> AI
+    App --> Data
+    Data --> Infra
+
+    style UI fill:#e3f2fd,color:#333
+    style App fill:#f3e5f5,color:#333
+    style AI fill:#fff3e0,color:#333
+    style Data fill:#e8f5e9,color:#333
+    style Infra fill:#eceff1,color:#333
+```
+
+### Technology Decision Matrix
+
+| Component | Choice | Why |
+|-----------|--------|-----|
+| **Semantic Memory** | Weaviate | Named vectors (multimodal), built-in BM25, hybrid search, production-ready |
+| **Graph Memory** | Neo4j | Native multi-hop traversal, Cypher query language, flexible schema, APOC extensions |
+| **State/Cache** | MongoDB | Flexible schema, async via Motor, wiki cache + sync state + quality logs |
+| **Embeddings** | Jina v4 | 2048-dim unified multimodal space (text + image + doc in same space) |
+| **LLM (cheap)** | Gemini Flash Lite | $0.30/1M tokens — extraction, tagging, query understanding |
+| **LLM (quality)** | Gemini Flash | $0.60/1M tokens — response generation, complex synthesis |
+| **External Search** | Tavily | AI-optimized web search, doc extraction, 1K free credits/mo |
+| **Backend** | FastAPI | Async-first, MCP support via FastMCP, Python ecosystem |
+| **Frontend** | React + Vite | Fast dev, component ecosystem, Tailwind CSS |
+
+---
+
+## Part 12: Deployment Architecture
+
+```yaml
+# docker-compose.yml — 5 services on single VM
+services:
+  beever-atlas:          # Python/FastAPI app (MCP + REST + pipeline)
+    ports: ["8000:8000"]
+    depends_on: [weaviate, neo4j, mongodb]
+
+  web:                   # React frontend
+    ports: ["3000:80"]
+
+  weaviate:              # Semantic memory (3-tier)
+    image: weaviate:1.28.0
+    ports: ["8080:8080", "50051:50051"]
+
+  neo4j:                 # Graph memory (flexible)
+    image: neo4j:5.26-community
+    ports: ["7474:7474", "7687:7687"]
+
+  mongodb:               # State + wiki cache
+    image: mongo:7.0
+    ports: ["27017:27017"]
+```
+
+---
+
+## Part 13: Module Structure
+
+```
+src/beever_atlas/
+├── main.py                          # FastMCP + FastAPI entry
+├── config.py                        # All settings
+│
+├── adapters/                        # Platform ingestion
+│   ├── base.py                      #   NormalizedMessage model
+│   ├── slack_adapter.py             #   Slack (slack-sdk)
+│   ├── teams_adapter.py             #   Teams (MS Graph API)
+│   └── discord_adapter.py           #   Discord (discord.py)
+│
+├── pipeline/                        # 6-stage ingestion
+│   ├── preprocessor.py              #   Stage 1: modality + threads
+│   ├── extractor.py                 #   Stage 2: facts + quality gate
+│   ├── entity_extractor.py          #   Stage 3: entities → Neo4j
+│   ├── classifier.py                #   Stage 4: tagging
+│   ├── embedder.py                  #   Stage 5: Jina v4
+│   └── persister.py                 #   Stage 6: write all stores
+│
+├── stores/                          # Data access
+│   ├── weaviate_store.py            #   Semantic memory (3-tier)
+│   ├── neo4j_store.py               #   Graph memory (flexible)
+│   └── mongo_store.py               #   State + wiki cache
+│
+├── retrieval/                       # Query system
+│   ├── query_decomposer.py         #   Complex Q → parallel sub-queries
+│   ├── query_router.py              #   LLM understanding + routing
+│   ├── semantic_retriever.py        #   System-1 (Weaviate)
+│   ├── graph_retriever.py           #   System-2 (Neo4j + enrichment)
+│   ├── external_search.py           #   Tavily web search (from v1)
+│   ├── result_merger.py             #   Merge + dedup + rank
+│   ├── temporal.py                  #   Ebbinghaus decay
+│   ├── consolidation.py             #   Cluster building (FIXED)
+│   └── response_generator.py        #   Grounded answer + citations
+│
+├── wiki/                            # Wiki generation
+│   ├── wiki_builder.py              #   Weaviate + Neo4j → markdown
+│   └── wiki_cache.py                #   MongoDB cache
+│
+└── server/                          # External interfaces
+    ├── tools.py                     #   MCP tools
+    ├── resources.py                 #   MCP resources (wiki://)
+    └── api_routes.py                #   REST API for frontend
+```
+
+---
+
+## Part 14: Competitive Feature Matrix (Updated for v2)
+
+| Feature | memU | Mem0 | MemOS | Zep/Graphiti | **Beever Atlas v2** |
+|---------|------|------|-------|--------------|---------------------|
+| **Wiki-First (FREE reads)** | No | No | No | No | **Yes** |
+| **Dual Memory (semantic+graph)** | No | Partial (Mem0g) | No | Partial | **Yes (Weaviate+Neo4j)** |
+| **Flexible Entity Types** | No | Fixed | No | Fixed | **Yes (guided-flexible)** |
+| **Cross-Modal Search** | Separate spaces | No | No | No | **Yes (unified Jina v4)** |
+| **Multi-Platform** | No | No | No | No | **Yes (Slack+Teams+Discord)** |
+| **Ebbinghaus Forgetting** | No | No | No | No | **Yes (applied to ranking)** |
+| **Bi-Temporal Model** | No | No | No | Yes | **Yes** |
+| **Temporal Supersession** | No | No | No | Partial | **Yes (SUPERSEDES chains)** |
+| **Quality-Gated Ingestion** | No | No | No | No | **Yes (reject < 0.5)** |
+| **Smart Query Routing** | No | No | No | No | **Yes (semantic/graph/both)** |
+| **Graph Relationships** | Category | Basic | Rich | Rich | **Rich + flexible** |
+
+---
+
+## Quick Reference: When to Use What
+
+| User Intent | System | Path | Cost | Latency |
+|-------------|--------|------|------|---------|
+| "Show me the overview" | System-1 | Wiki → Tier 0 | FREE | ~50ms |
+| "What topics do we have?" | System-1 | Wiki → Tier 1 list | FREE | ~50ms |
+| "Tell me about authentication" | System-1 | Wiki → Tier 1 detail | FREE | ~50ms |
+| "Find messages about Redis" | System-1 | Weaviate hybrid search | ~$0.001 | ~200ms |
+| "Find the architecture diagram" | System-1 | Weaviate cross-modal | ~$0.001 | ~200ms |
+| "Who decided to use JWT?" | System-2 | Neo4j traversal | ~$0.005 | ~500ms |
+| "What is Alice working on?" | System-2 | Neo4j traversal | ~$0.005 | ~500ms |
+| "How did auth evolve?" | System-2 | Neo4j temporal chain | ~$0.005 | ~500ms |
+| "What blocks the migration?" | System-2 | Neo4j traversal | ~$0.005 | ~500ms |
+| "Tell me about JWT migration" | Both | Parallel → merge | ~$0.006 | ~500ms |
+| "Why did we choose PostgreSQL?" | Both | Parallel → LLM synth | ~$0.025 | ~2s |
+| "How does our auth compare to OWASP?" | Decomposed | Internal + Tavily | ~$0.01 | ~1.5s |
+| "What auth did we pick and is it secure?" | Decomposed | System-1 + Tavily | ~$0.01 | ~1.5s |
+
+---
+
+## Glossary
+
+| Term | Definition |
+|------|------------|
+| **Semantic Memory** | Weaviate-based memory for facts, topics, and content search (BM25 + vector) |
+| **Graph Memory** | Neo4j-based memory for entity relationships and temporal evolution |
+| **Atomic Fact** | Single unit of knowledge in Weaviate with embeddings and metadata (Tier 2) |
+| **Topic Cluster** | Group of related atomic facts with summary (Weaviate Tier 1) |
+| **Channel Summary** | High-level overview of a channel (Weaviate Tier 0) |
+| **System-1** | Fast semantic retrieval path via Weaviate hybrid search |
+| **System-2** | Deep relational retrieval path via Neo4j graph traversal + Weaviate enrichment |
+| **Smart Router** | LLM-powered query understanding that routes to System-1, System-2, or both |
+| **Episodic Link** | Neo4j Event node that connects a graph entity to its source fact in Weaviate |
+| **SUPERSEDES** | Neo4j relationship indicating a new decision replaced an old one |
+| **Quality Gate** | Score-based filter that rejects vague/low-quality facts at extraction time |
+| **Temporal Decay** | Ebbinghaus curve: R(t) = e^(-t/S) — old facts rank lower unless reinforced |
+| **Bi-Temporal** | Tracking both event time (when it happened) and ingestion time (when recorded) |
+| **Guided-Flexible** | Schema with core types (Person, Decision...) + LLM-created extensions |
+| **Wiki-First** | Pattern where cached summaries serve 80% of reads for FREE |
+| **Cross-Modal Search** | Text query finding images/PDFs via unified embedding space |
+| **NormalizedMessage** | Platform-agnostic message model for multi-platform ingestion |
+| **Consolidation** | Background service that builds topic clusters and channel summaries |
+| **Query Decomposition** | Breaking complex questions into focused parallel sub-queries (internal + external) |
+| **External Search** | Tavily-powered web search for best practices, docs, and industry comparisons |
+
+---
+
+## Part 15: Resilience & Operations
+
+### Degradation Matrix
+
+| Component Down | Ingestion | Retrieval | Behavior |
+|----------------|-----------|-----------|----------|
+| **Neo4j** | Stage 3 skipped; facts in Weaviate only | `route=graph` → reclassify as semantic | Wiki People/Decisions: "temporarily unavailable" |
+| **Gemini** | Messages queued in dead letter queue | Regex classifier fallback; cached wiki only | Alert; retry on recovery |
+| **Jina** | Embeddings queued; text-only in Weaviate | Existing embeddings work; BM25-only for new | Backfill on recovery |
+| **Tavily** | No impact | Drop external sub-queries | "External search unavailable" note |
+| **Weaviate** | Ingestion paused (queue in MongoDB) | Cached wiki; graph-only for relational | Critical alert |
+| **MongoDB** | System paused | Read-only from Weaviate/Neo4j | Critical alert |
+
+### Entity Scoping Strategy
+
+Global entities (Person, Technology, Project, Team) are MERGED by name only — the same node spans all channels. Channel-scoped entities (Decision, Meeting, Artifact) are MERGED by name + channel.
+
+| Entity Type | Scope | MERGE Key | Cross-Channel? |
+|-------------|-------|-----------|----------------|
+| Person | Global | `{name}` | Yes — `channels: []` array tracks provenance |
+| Technology | Global | `{name}` | Yes |
+| Project | Global | `{name}` | Yes |
+| Team | Global | `{name}` | Yes |
+| Decision | Channel | `{name, channel}` | No — decisions are contextual |
+| Meeting | Channel | `{name, channel}` | No |
+| Artifact | Channel | `{name, channel}` | No |
+| Extension types | Channel (default) | `{name, channel}` | No |
+
+### Graph Traversal Guards
+
+- **Directed traversal** (`->` not `-`) halves search space
+- **APOC path expansion** with `uniqueness: NODE_GLOBAL` and `limit: 50`
+- **Transaction timeout**: 5 seconds hard limit — returns empty on timeout, retriever falls back to semantic
+- **SUPERSEDES chains**: capped at 5 hops with `WITH DISTINCT` to prevent combinatorial explosion
+
+### Required Neo4j Indexes
+
+```cypher
+CREATE INDEX person_name FOR (n:Person) ON (n.name);
+CREATE INDEX tech_name FOR (n:Technology) ON (n.name);
+CREATE INDEX decision_name FOR (n:Decision) ON (n.name);
+CREATE INDEX project_name FOR (n:Project) ON (n.name);
+CREATE FULLTEXT INDEX entity_fulltext FOR (n:Person|Decision|Project|Technology) ON EACH [n.name];
+CREATE INDEX event_wid FOR (e:Event) ON (e.weaviate_id);
+```
+
+### Write Safety (Outbox Pattern)
+
+```
+Message → MongoDB write_intent (atomic) → Fan out:
+  ├── Weaviate upsert (idempotent via deterministic UUID)
+  ├── Neo4j MERGE (idempotent via MERGE semantics)
+  └── MongoDB sync state update
+Background reconciler retries pending/failed writes every 15 minutes.
+```
+
+### LLM Fallback Chain
+
+| Call Site | Primary | Fallback | Last Resort |
+|-----------|---------|----------|-------------|
+| Query Router | Gemini Flash Lite | Claude Haiku | v1 regex classifier |
+| Fact Extraction | Gemini Flash Lite | Claude Haiku | Dead letter queue |
+| Entity Extraction | Gemini Flash Lite | Claude Haiku | Skip (Weaviate-only) |
+| Classification | Gemini Flash Lite | Rule-based tagger | Skip (no tags) |
+| Response Gen | Gemini Flash | Claude Sonnet | Return raw results |
+| Wiki Gen | Gemini Flash Lite | Claude Haiku | Serve stale cache |
+
+### Observability
+
+- **Health endpoint**: `/health` aggregates all 6 dependencies (healthy/degraded/unhealthy)
+- **Distributed tracing**: OpenTelemetry spans per pipeline stage and retrieval path
+- **Key metrics**: Ingestion rate, quality gate rejection ratio, per-store latency/error rates, LLM cost tracking, orphan counts
+- **Backups**: Daily 3 AM UTC for all 3 stores → S3, 30-day retention
+- **Consistency checks**: Weekly cross-store referential integrity validation
+
+### Access Control
+
+- Channel-level ACL inherited from source platform membership
+- Private channel results filtered before returning to user
+- API authentication via Bearer token middleware
+- Global entities visible to all; relationships from private channels filtered by `source_channel`
+
+### Temporal Decay Behavior
+
+Default `DECAY_RATE = 0.1` with exemptions:
+
+| Fact Age | Multiplier | Notes |
+|----------|-----------|-------|
+| 1 day | 0.997 | No noticeable decay |
+| 30 days | 0.905 | ~10% reduction |
+| 90 days | 0.741 | ~26% reduction |
+| 180 days | 0.549 | ~45% reduction |
+| 365 days | 0.295 | ~70% reduction |
+
+- **Exempt from decay**: facts with `importance: "high"` or `"critical"`
+- **Half decay rate**: facts tagged `decision`, `architecture`, `policy`, `deadline`
+- **Citation reinforcement**: cited facts decay slower — rate = `0.1 / (1 + 0.1 * citation_count)`
+
+### Consolidation Schedule
+
+| Trigger | When | Scope | Cost |
+|---------|------|-------|------|
+| **After sync** | Automatic on sync completion | Incremental — new unclustered facts only | ~$0.001/fact |
+| **Daily rebuild** | 2 AM UTC | Full — coherence check, split/merge, summaries | ~$0.05/channel |
+| **On-demand** | `POST /api/consolidate/{channel_id}` | Full reconsolidation + wiki rebuild | ~$0.05/channel |
+
+Cluster health: split at >100 members, merge at summary cosine >0.85, re-cluster at coherence <0.4.
+
+Wiki dirty flag: set by consolidation, entity extraction, and contradiction detection. Wiki regenerated on next read if dirty.
+
+### Contradiction Detection
+
+Background job runs every 15 minutes:
+1. Find recently ingested facts not yet checked
+2. For each: cosine similarity scan (70-95% range) + entity-scoped scan (same Decision topic)
+3. LLM comparison: CONTRADICTORY / PROGRESSIVE / INDEPENDENT
+4. If CONTRADICTORY with confidence > 0.8: set `invalid_at` on old fact, create SUPERSEDES edge in Neo4j
+5. Superseded facts automatically excluded from retrieval (`invalid_at IS NULL` filter)
+
+Cost: ~$0.001 per comparison. Typically 0-5 comparisons per new fact.
+
+### MCP Tool Surface
+
+| Tool | Description | Cost |
+|------|-------------|------|
+| `ask_questions` | Query via smart router (semantic/graph/both) | $0.001-$0.006 |
+| `search_memories` | Direct hybrid search (bypass router) | ~$0.001 |
+| `get_wiki` | Read cached wiki | FREE |
+| `get_topics` | List topic clusters | FREE |
+| `sync_channel` | Trigger channel ingestion | ~$0.0025/msg |
+| `get_sync_status` | Check sync progress | FREE |
+| `refresh_wiki` | Force wiki regeneration | ~$0.01 |
+
+Graph queries abstracted behind `ask_questions` — users don't interact with Neo4j directly.
+
+MCP Resources: `wiki://{channel_id}`, `wiki://{channel_id}/overview`, `wiki://{channel_id}/topics`
+
+---
+
+*This document is the comprehensive architecture reference for Beever Atlas v2. For design decisions and rationale, see `TECHNICAL_PROPOSAL.md`. For v1 weakness resolution details, see `WEAKNESS_RESOLUTION_MAP.md`.*
diff --git a/docs/v1-archive/PROJECT_ANALYSIS.md b/docs/v1-archive/PROJECT_ANALYSIS.md
new file mode 100644
index 00000000..25d2874e
--- /dev/null
+++ b/docs/v1-archive/PROJECT_ANALYSIS.md
@@ -0,0 +1,977 @@
+# Beever Atlas — Comprehensive Project Analysis
+
+> **Date**: March 20, 2026
+> **Version Analyzed**: 3.2.0 (pyproject.toml) / 3.3 (server.py header)
+> **Codebase**: 43 Python source files, ~17,700 LOC | 6 test files, ~743 LOC | React frontend (web/)
+
+---
+
+## Table of Contents
+
+1. [Project Overview](#1-project-overview)
+2. [Memory Retrieval Architecture Commentary](#2-memory-retrieval-architecture-commentary)
+3. [Critical Bugs](#3-critical-bugs)
+4. [Limitations](#4-limitations)
+5. [Weaknesses](#5-weaknesses)
+6. [Incomplete / In-Progress Work](#6-incomplete--in-progress-work)
+7. [Further Improvements](#7-further-improvements)
+8. [Priority Matrix](#8-priority-matrix)
+
+---
+
+## 1. Project Overview
+
+Beever Atlas is a Slack Context MCP Server that provides AI agents with channel context through hierarchical memory retrieval. It ingests Slack messages, images, PDFs, and videos, processes them through Gemini LLMs for fact extraction, stores them in Weaviate (vector + BM25), and serves them via the MCP protocol with grounded responses and Slack permalink citations.
+
+### Architecture at a Glance
+
+```
+Slack API ──► Fetch Phase ──► MongoDB (metadata)
+                                  │
+                              Process Phase
+                                  │
+                        Gemini (fact extraction)
+                                  │
+                           Jina v4 (embeddings)
+                                  │
+                         Weaviate (vector store)
+                                  │
+              ┌───────────────────┼───────────────────┐
+              ▼                   ▼                   ▼
+         Tier 0              Tier 1              Tier 2
+    Channel Summary     Topic Clusters      Atomic Memories
+              │                   │                   │
+              └───────────────────┼───────────────────┘
+                                  │
+                     Hierarchical Retrieval
+                                  │
+                    ┌─────────────┼─────────────┐
+                    ▼             ▼             ▼
+               Wiki System   ask_questions   Search Tools
+              (FREE reads)   (PAID - LLM)   (hybrid/vector)
+                    │             │             │
+                    └─────────────┼─────────────┘
+                                  ▼
+                         MCP Protocol / REST API
+                                  │
+                            AI Agent Clients
+```
+
+### Tech Stack
+
+| Component | Technology |
+|-----------|-----------|
+| MCP Framework | FastMCP (Python, async) |
+| Vector Store | Weaviate (hybrid BM25 + vector, named vectors) |
+| LLM (cheap) | Gemini Flash Lite (extraction, tagging, classification) |
+| LLM (quality) | Gemini Flash (response generation) |
+| Embeddings | Jina v4 (2048-dim, multimodal unified space) |
+| Database | MongoDB (sync state, via Motor async driver) |
+| Web Search | Tavily |
+| Frontend | React + Vite + shadcn/ui ("memory-browser") |
+| Deployment | Docker Compose (MCP server, Weaviate, frontend) |
+
+---
+
+## 2. Memory Retrieval Architecture Commentary
+
+### 2.1 Wiki-First Design — Strengths and Gaps
+
+The wiki-first architecture (`wiki://slack/{channel_id}`) is one of the strongest design decisions in the project. It creates a **two-tier cost model**:
+
+1. **FREE tier**: Pre-generated wiki documents cached in MongoDB, served as MCP resources with zero LLM cost per read
+2. **PAID tier**: On-demand `ask_questions` with full hierarchical retrieval + Gemini response generation
+
+**What works well:**
+- The wiki is generated from the same hierarchical memory system, so it stays consistent with the underlying data
+- Topic-level wiki sections (`wiki://slack/{channel}/topics/{topic}`) provide targeted reads without paying for a full LLM call
+- The `WikiDocument` model (`models/wiki.py`) has proper structure with sections for overview, topics, decisions, and recent activity
+- Wiki generation uses `gemini-2.5-flash-lite` (the cheapest model), keeping regeneration costs low
+- The `wiki_cache_ttl_hours` setting (default 24h) prevents unnecessary regeneration
+
+**Gaps and concerns:**
+
+| Issue | Detail | Impact |
+|-------|--------|--------|
+| **Full regeneration only** | `refresh_wiki` rebuilds the entire wiki document from scratch. There is no incremental update — even a single new message triggers a full regeneration with LLM calls for every section. The `WikiUpdatePlan` and `WikiChangeAnalysis` models exist in `models/wiki.py` but are **never used** in `services/wiki.py`. | Cost waste on large channels |
+| **No staleness indicator** | Wiki resources are served without any metadata about when they were last updated. A client reading `wiki://slack/{channel}` has no way to know if the wiki is 1 hour old or 1 week old without calling `list_wikis` separately. | Client trust issues |
+| **Wiki generation failures are silent** | If Gemini fails during wiki generation (quota, network, content filter), the error is caught and logged but the wiki is not marked as failed — stale content continues to be served without any indication. | Misleading data |
+| **No diff/changelog** | There is no mechanism to show what changed between wiki versions. For teams tracking decisions, knowing "what's new since I last read" is critical. | Reduced utility for recurring readers |
+| **Topic limit may drop content** | `wiki_max_topics: int = 20` caps the number of topics in the wiki. Channels with rich discussions across 30+ topics will silently lose coverage. | Information loss |
+| **Recent activity is fixed at 7 days** | The `wiki_recent_days: int = 7` is not configurable per-channel and doesn't account for channel activity patterns. A low-traffic channel might have no activity in 7 days; a high-traffic one might need only 2 days. | Poor adaptation |
+
+**Recommendations:**
+- Implement the incremental update path using the existing `WikiUpdatePlan` model — detect which sections changed and only regenerate those
+- Add `last_updated_at` and `staleness_indicator` to wiki resource metadata
+- Consider a diff/changelog section that highlights changes since the last generation
+- Make `wiki_recent_days` adaptive based on channel activity (e.g., target N recent memories rather than N days)
+
+### 2.2 Three-Tier Hierarchical Memory — Design Analysis
+
+The 3-tier design maps naturally to how humans think about team knowledge:
+
+```
+Tier 0 (Channel Summary)    → "What is this channel about?"
+Tier 1 (Topic Clusters)     → "What's happening with authentication?"
+Tier 2 (Atomic Memories)    → "Who said we should use JWT, and when?"
+```
+
+**What works well:**
+- **Query classification** routes queries to the right starting tier automatically, avoiding expensive full-depth searches for simple overview questions
+- **Automatic tier expansion** (e.g., if Tier 1 returns < 3 results, expand to Tier 2) provides graceful degradation
+- **Temporal decay** (`temporal.py`) applies time-based scoring without re-embedding, keeping recent information prominent
+- **Hybrid search** (BM25 + vector) with configurable alpha gives good retrieval for both keyword-precise and semantic queries
+- **Cross-modal search** via Jina v4's unified embedding space enables text-to-image and text-to-document retrieval
+- **Citation validation** (`grounding.py:165-184`) removes hallucinated citation IDs from LLM responses
+
+**Architectural weaknesses:**
+
+#### 2.2.1 Consolidation Is Fundamentally Broken
+
+This is the most critical issue in the retrieval architecture. The `_link_memories_to_cluster` method at `consolidation.py:214-231` is explicitly a no-op:
+
+```python
+async def _link_memories_to_cluster(self, memories, cluster_id):
+    # In a future version, we could update memories in Weaviate
+    # For now, we track membership in the cluster's member_ids
+    logger.debug(f"Linked {len(memories)} memories to cluster {cluster_id}")
+```
+
+**The consequence**: Since atomic memories never get their `cluster_id` property set, the `_get_unclustered_memories` method (line 112-136) filters by `not m.get("cluster_id")`, which means **the exact same memories will be re-clustered on every consolidation run**. This creates an ever-growing number of duplicate Tier 1 clusters.
+
+The deterministic UUID generation in `weaviate_client.py:189-223` will prevent exact duplicates, but because the cluster summary is LLM-generated (non-deterministic), even slight text variation produces a new cluster. After N consolidation runs, the system accumulates N near-duplicate clusters per topic, degrading Tier 1 retrieval quality.
+
+**Fix required**: Implement the Weaviate `collection.data.update()` call to set `cluster_id` on atomic memories, or maintain a cluster membership index in MongoDB.
+
+#### 2.2.2 Query Classifier Is Too Brittle
+
+The `QueryClassifier` (`hierarchical_retrieval.py:49-120`) uses hardcoded regex patterns:
+
+```python
+TOPIC_PATTERNS = [
+    (r"about\s+(?:the\s+)?(\w+)", 1),  # Only captures single word!
+    (r"regarding\s+(\w+)", 1),
+    ...
+]
+```
+
+**Problems:**
+- **Single-word topic extraction**: "What's happening with API design?" captures only "API", missing the compound topic "API design"
+- **Priority ordering creates misclassification**: DETAIL patterns are checked first, so "who said something about authentication" matches `r"who\s+said"` and is classified as DETAIL instead of TOPIC_SPECIFIC, skipping topic extraction entirely
+- **No learning or adaptation**: The classifier doesn't use the `model_query_classification` setting that exists in config — there's a Gemini model configured for classification but never called
+- **Default fallback is fragile**: Unrecognized queries default to TOPIC_SPECIFIC with `fallback_to_detail: True`, but the `fallback_to_detail` flag is never read by the retrieval logic — it's set but ignored
+
+**Recommendation**: Replace the regex classifier with a lightweight LLM call (using the already-configured `model_query_classification = gemini-2.5-flash-lite`), or at minimum:
+- Support multi-word topic extraction via `(\w[\w\s-]+\w)` patterns
+- Reorder pattern priority: OVERVIEW → TOPIC → DETAIL (broad to narrow)
+- Actually use the `fallback_to_detail` flag in `HierarchicalRetrievalService.retrieve()`
+
+#### 2.2.3 Tier Expansion Logic Has Blind Spots
+
+The tier expansion thresholds are hardcoded magic numbers:
+
+```python
+# In retrieve() method
+if depth == "summary":
+    if len(memories) < 2:  # Why 2?
+        # expand to clusters
+
+elif depth == "cluster":
+    if len(memories) < 3:  # Why 3?
+        # expand to atomic
+```
+
+- The thresholds (2 and 3) are arbitrary with no documentation on why these values were chosen
+- There is **no expansion from Tier 2 upward** — if an atomic search returns poor results, the system doesn't try broader tiers
+- Quality of results is not considered — 3 low-relevance cluster hits won't trigger expansion, even though they might not answer the question
+- The expansion adds results via `.extend()` without re-scoring, so expanded results appear at the end regardless of relevance
+
+**Recommendation**:
+- Make thresholds configurable
+- Consider relevance scores, not just count — expand if max score < threshold
+- Re-sort combined results by score after expansion
+- Add upward expansion: if Tier 2 returns results but none are high-confidence, check Tier 1 for a cluster summary
+
+#### 2.2.4 Temporal Decay Is Computed But Not Applied to Retrieval
+
+The `TemporalResolutionService` has a well-designed `apply_temporal_decay()` method that adjusts scores by recency. However, examining the call sites:
+
+- `query.py:269` — calls `enrich_memories_with_temporal()` (adds labels only, no score adjustment)
+- `grounding.py:82` — calls `enrich_memories_with_temporal()` (same — labels only)
+- `hierarchical_retrieval.py` — **never calls temporal service at all** for score adjustment
+
+The `apply_temporal_decay()` method that actually adjusts scores and re-sorts is **never called anywhere in the codebase**. Temporal decay exists as infrastructure but has zero effect on retrieval ranking. Recent and old memories are ranked purely by Weaviate's hybrid search score.
+
+**Impact**: A decision from 6 months ago has the same retrieval weight as one from yesterday, even though the user likely cares more about recent information. The LLM generation prompt mentions temporal preference ("prefer more recent"), but the retrieval layer doesn't enforce it, so the LLM may never see the recent memory if an old one scores higher.
+
+**Fix**: Call `self.temporal_service.apply_temporal_decay(memories)` in `HierarchicalRetrievalService.retrieve()` before returning results.
+
+#### 2.2.5 Deduplication Is Fragile
+
+The deduplication in `hierarchical_retrieval.py:206-213`:
+
+```python
+seen_ids = set()
+for m in memories:
+    mem_id = m.get("id") or m.get("memory", "")[:50]  # First 50 chars as fallback
+    if mem_id not in seen_ids:
+        seen_ids.add(mem_id)
+        unique_memories.append(m)
+```
+
+Using the first 50 characters of memory text as a dedup key is unreliable — two memories about the same topic with the same opening ("The team decided to...") will be incorrectly deduplicated. Conversely, two identical facts with different openings will both pass through.
+
+**Recommendation**: Use Weaviate UUIDs exclusively for dedup, and add semantic similarity dedup for cross-tier expansion (e.g., if a cluster summary and its member atomic memories both appear, prefer the more specific one).
+
+#### 2.2.6 No Negative Feedback Loop
+
+The retrieval system has no mechanism to learn from bad retrievals:
+
+- No relevance feedback (user can't mark results as unhelpful)
+- No query logs to analyze retrieval quality
+- No A/B testing between retrieval strategies
+- The grounding service validates citation IDs but doesn't track citation usage rates
+
+For a production system, this means retrieval quality can only be improved by manual tuning of `hybrid_alpha`, thresholds, and patterns — there's no data-driven optimization path.
+
+### 2.3 Retrieval Redesign Proposal — Topic-First with Flexible Tiers
+
+> **Status**: Rough idea / design exploration. Not validated yet.
+
+#### The Core Problem: Tiers Don't Actually Reduce Search Cost
+
+The current 3-tier design gives the illusion of "starting broad and drilling down," but all three tiers query the **same Weaviate collection** via the same `hybrid_search` function. The "tier" is just a metadata property filter — Weaviate still scans the same index regardless. This means:
+
+```
+OVERVIEW query  → hybrid_search(tier_filter="tier0_summary", limit=5)   → scans full index
+DETAIL query    → hybrid_search(tier_filter="tier2_atomic", limit=20)   → scans full index
+```
+
+The retrieval cost is essentially identical. The difference is only in **which pre-computed content** is returned (a stale summary vs. raw facts), not in how efficiently the search runs.
+
+#### Proposed Direction: Topic-First, Then Drill Down
+
+Instead of classifying queries into a tier and searching within that tier, use a **two-stage coarse-to-fine retrieval**:
+
+```
+Stage 1: Find relevant topic cluster(s)
+  → hybrid_search(tier="tier1_cluster", query=question, limit=5)
+  → Identifies which topic areas are relevant
+  → Returns cluster member_ids (pointers to atomic memories)
+
+Stage 2: Search WITHIN matched cluster members only
+  → hybrid_search(ids=member_ids, query=question, limit=10)
+  → Search space narrowed from thousands of memories → tens
+  → Much higher precision
+```
+
+**Why this is better:**
+
+| Aspect | Current (tier-only) | Topic-first |
+|--------|-------------------|-------------|
+| Search space for detail queries | All atomic memories in channel | Only memories within matched topic cluster |
+| Precision | Diluted — irrelevant topics compete | Focused — pre-filtered by topic relevance |
+| Scalability | Degrades linearly as channel grows | Stays bounded — cluster sizes are capped |
+| Context quality for LLM | Mixed-topic results | Topically coherent context window |
+
+**Example:**
+```
+User: "What did the team decide about JWT authentication?"
+
+Current path:
+  → Classified as DETAIL → searches ALL tier2_atomic memories
+  → Gets 20 results from thousands, hoping "JWT" memories rank high
+  → May return noise from unrelated discussions mentioning "team" or "decided"
+
+Topic-first path:
+  1. Search clusters → finds "authentication" cluster (15 member memories)
+  2. Search within those 15 → "JWT" keywords + semantic match
+  → Higher precision, less noise, faster
+```
+
+#### Proposed Direction: Bottom-Up Expansion (Low → High)
+
+The current system only expands **top-down** (summary → clusters → atomic). But bottom-up makes sense too:
+
+```
+User: "What's the overall project status?"
+
+Current: Classified as OVERVIEW → Tier 0 → returns a stale summary
+  (Tier 0 is only updated during consolidation, which may not have run recently)
+
+Bottom-up approach:
+  → Start at Tier 2 (always has the freshest data)
+  → Get recent atomic memories (last 7 days, limit=30)
+  → Group by topic_tags dynamically
+  → Synthesize a fresh overview on-the-fly
+  → More accurate than a potentially stale Tier 0 summary
+```
+
+A **bidirectional** retrieval system would choose direction based on query type:
+
+```
+OVERVIEW / "catch me up" questions:
+  → Bottom-up: Tier 2 (fresh atomics) → group by topic → synthesize
+  → Fresher than pre-computed Tier 0
+
+DETAIL / "who said X" questions:
+  → Top-down (topic-first): Tier 1 (find clusters) → Tier 2 (search within)
+  → Higher precision than searching all atomics
+
+TOPIC / "what about auth" questions:
+  → Direct: Tier 1 cluster + its Tier 2 members
+  → Best of both worlds
+```
+
+#### Proposed Direction: Score-Based Expansion Instead of Magic Thresholds
+
+Replace the current arbitrary count thresholds (2 and 3) with relevance-score-based decisions:
+
+```python
+# Current (arbitrary count thresholds)
+if len(memories) < 3:
+    expand_to_next_tier()
+
+# Proposed (score-based)
+max_score = max(m.get("score", 0) for m in memories) if memories else 0
+avg_score = sum(m.get("score", 0) for m in memories) / len(memories) if memories else 0
+
+should_expand = (
+    len(memories) == 0              # no results at all
+    or max_score < 0.5              # best result is low confidence
+    or avg_score < 0.3              # overall quality is poor
+)
+
+if should_expand:
+    expanded = search_adjacent_tier(direction="down" or "up")
+    all_memories = memories + expanded
+    # Re-rank combined results by score
+    all_memories.sort(key=lambda m: m.get("score", 0), reverse=True)
+```
+
+This way expansion happens when results are **low quality**, not just when there are **few results**. Three irrelevant results shouldn't prevent expansion into a tier that might have the answer.
+
+#### Proposed Direction: Flexible / Dynamic Tier Structure
+
+The current system hardcodes exactly 3 tiers. A more flexible approach would treat tiers as a **dynamic hierarchy** rather than a fixed structure:
+
+```
+Current (fixed 3 tiers):
+  Tier 0: Channel Summary (1 per channel)
+  Tier 1: Topic Clusters (N per channel, flat)
+  Tier 2: Atomic Memories (many per channel)
+
+Future possibility (dynamic, nested):
+  Channel
+  ├── Area: "Backend Engineering"
+  │   ├── Topic: "Authentication"
+  │   │   ├── Sub-topic: "JWT Implementation"
+  │   │   │   ├── Fact: "Team chose RS256 over HS256"
+  │   │   │   └── Fact: "Token TTL set to 1 hour"
+  │   │   └── Sub-topic: "OAuth Integration"
+  │   │       └── Fact: "Using Auth0 as provider"
+  │   └── Topic: "Database"
+  │       └── ...
+  └── Area: "Product Design"
+      └── ...
+```
+
+In this model:
+- Tiers are not numbered (0, 1, 2) but represent **semantic depth levels** that emerge from the data
+- A channel with 50 messages might only have 2 levels; one with 50,000 might have 4-5
+- Consolidation dynamically decides when to create intermediate groupings based on cluster sizes
+- Retrieval navigates the tree rather than scanning a flat tier
+
+This is a significant architectural change and needs careful design. Key open questions:
+- How to determine when a topic is large enough to warrant sub-topics?
+- How to handle memories that belong to multiple topic branches?
+- How to keep the tree balanced and prevent degenerate structures?
+- Does Weaviate's filtering support efficient tree navigation, or does this need a separate index (e.g., a graph database or MongoDB tree)?
+
+**This is a rough direction, not a concrete proposal.** The immediate priorities are:
+1. Fix the consolidation no-op (prerequisite for any cluster-based retrieval)
+2. Implement score-based expansion (low effort, high impact)
+3. Add topic-first retrieval as an alternative path (medium effort)
+4. Evaluate dynamic tier structure as a longer-term evolution
+
+---
+
+### 2.4 Multi-Query Parallel Search — Exists But Disconnected
+
+The project has a well-built multi-query parallel search pipeline in the `agents/` layer:
+
+**What exists:**
+- `query_planner.py` — LLM-based query decomposition that splits complex questions into focused sub-queries (internal + external), using `gemini-flash-lite` for cost efficiency
+- `parallel_search.py` — True parallel execution via `asyncio.gather()` with deduplication by memory ID and score-based re-ranking
+- `coordinator_agent.py` — Wires decomposition → parallel search → grounded response together
+- `chat_routes.py` — The web frontend streaming chat uses this full pipeline with thinking tokens
+
+**Example of what it can do:**
+```
+User: "Tell about NBA and FIFA, are they talking about it?"
+
+query_planner decomposes to:
+{
+    "internal_queries": [
+        {"query": "NBA basketball discussion", "focus": "nba_topic"},
+        {"query": "FIFA soccer football", "focus": "fifa_topic"},
+        {"query": "NBA FIFA comparison", "focus": "cross_topic"}
+    ],
+    "external_queries": []
+}
+
+search_internal_parallel() runs all 3 via asyncio.gather()
+→ Deduplicates by memory ID
+→ Sorts by score
+→ Returns merged results
+```
+
+**The critical gap: The primary MCP tool doesn't use it.**
+
+| Tool | Decomposition | Parallel Search | Used By |
+|------|--------------|-----------------|---------|
+| `ask_questions()` | None — single query | No | MCP clients (primary tool) |
+| `ask_with_context()` | Yes — via coordinator | Yes — asyncio.gather | MCP clients (secondary) |
+| `ask_parallel()` | None | No — sequential loop | MCP clients (misleadingly named) |
+| Streaming chat API | Yes — with thinking | Yes — asyncio.gather | Web frontend only |
+
+This means:
+- An MCP client calling the recommended `ask_questions("Tell about NBA and FIFA")` gets a **single undifferentiated search** that may miss one topic entirely
+- The best retrieval path (`ask_with_context`) exists but is positioned as a secondary tool for "external knowledge comparison," not as the default
+- The web frontend gets better retrieval quality than MCP clients do
+
+**Recommendation:**
+- Make `ask_questions` use the decomposition pipeline by default (or at least when the query classifier detects multiple topics)
+- Rename/merge `ask_with_context` and `ask_questions` so the best retrieval path is the default, not an opt-in
+- Fix `ask_parallel` to actually use `asyncio.gather()` or remove it (it's misleading)
+- Consider making the coordinator path the single entry point for all queries, with the `include_external` flag controlling whether web search is included
+
+---
+
+### 2.5 Grounding and Citation System — Commentary
+
+The grounding system (`grounding.py`) is well-designed for its purpose:
+
+**Strengths:**
+- Citation validation removes hallucinated citation IDs via regex (`_validate_citations`)
+- Temporal context is injected into the generation prompt so the LLM can reason about recency
+- Streaming support with thinking/content separation (`generate_stream`)
+- Configurable model override per request
+
+**Weaknesses:**
+- `grounding.py:103` — `generate_content()` is a **synchronous blocking call** inside an async function (same issue as Weaviate client)
+- `grounding.py:269` — `generate_content_stream()` uses a synchronous `for chunk in ...` loop, which blocks the event loop during streaming
+- Citation builder fetches Slack permalinks one at a time (no batching) — for responses with 10 citations, this means 10 sequential Slack API calls
+- The `max_citations_per_response` (default 10) limits memories before generation, not citations in the output — if the first 10 memories are all from the same topic, the response loses diversity
+- No caching of generated responses — identical questions within minutes trigger full LLM regeneration
+
+### 2.6 Cross-Modal Search — Commentary
+
+The cross-modal search via Jina v4's unified embedding space is a strong differentiator:
+
+**Strengths:**
+- Single embedding model for text, images, and documents reduces operational complexity
+- Named vectors in Weaviate (`text_vector`, `image_vector`, `doc_vector`) enable targeted cross-modal queries
+- Distance threshold filtering (< 0.5) prevents irrelevant cross-modal results from polluting text search
+
+**Weaknesses:**
+- The distance threshold (0.5) is hardcoded in `query.py:204` — different embedding spaces may need different thresholds
+- Image embeddings are limited to `image_max_tokens: 7500` due to endpoint constraints, which may lose detail for complex diagrams
+- No re-ranking step after cross-modal search — text and image results are simply appended, not interleaved by relevance
+- PDF page embeddings are per-page, not per-document — a 50-page PDF generates 50 separate embeddings, which fragments retrieval context
+
+---
+
+## 3. Critical Bugs
+
+### 3.0a Internal Error Details Leaked to API Clients
+
+**File**: `api/routes.py` (22 occurrences), `api/chat_routes.py:562`
+
+Every exception handler passes `detail=str(e)` to `HTTPException(status_code=500)`:
+
+```python
+except Exception as e:
+    raise HTTPException(status_code=500, detail=str(e))  # Leaks internals!
+```
+
+This exposes internal stack traces, database connection strings, file paths, and potentially secrets to API clients. Found at lines: 252, 332, 395, 544, 574, 605, 692, 708, 731, 938, 1021, 1092, 1162, 1246, 1331, 1409, 1448, 1476, 1514, 1557.
+
+**Impact**: Information disclosure vulnerability — attackers can learn internal architecture, dependency versions, and potentially credentials from error responses.
+
+**Fix**: Return generic error messages to clients and log full details server-side:
+```python
+except Exception as e:
+    logger.error(f"Failed: {e}", exc_info=True)
+    raise HTTPException(status_code=500, detail="Internal server error")
+```
+
+### 3.0b Tavily API Key Sent in HTTP Request Body
+
+**File**: `services/external_search.py:145, 263`
+
+The Tavily API key is placed directly in the JSON request body (`"api_key": self.api_key`). If request logging is enabled (common in production), the API key will appear in access logs. The error handler at line 195 includes `str(e)` which could also leak the key.
+
+**Impact**: API key exposure via logs or error messages.
+
+**Fix**: Use the `Authorization` header instead, or ensure request body logging is suppressed for these calls.
+
+### 3.0c `SyncRequest.max_messages` Has No Upper Bound
+
+**File**: `api/schemas.py:121`
+
+`max_messages: int | None = None` has no `Field(le=...)` constraint. A client can pass `max_messages=999999999`, triggering an extremely expensive sync that hits the Slack API thousands of times, incurs massive Gemini API costs, and could cause resource exhaustion. Compare with `SearchRequest.limit` on line 91 which correctly uses `Field(default=20, ge=1, le=100)`.
+
+**Impact**: Denial of service / cost explosion via unbounded sync requests.
+
+**Fix**: Add validation: `max_messages: int = Field(default=500, ge=1, le=5000)`.
+
+### 3.0d `search_scope` Parameter Accepted But Silently Ignored
+
+**File**: `server.py:118-152`
+
+The `ask_questions` tool accepts a `search_scope` parameter (line 138) documented as `"auto"`, `"wiki"`, or `"memories"`, but it is **never passed** to `query_service.ask_question()` on line 150-152. Callers who set `search_scope="wiki"` expecting a different behavior get the same result as the default.
+
+**Impact**: Misleading API contract; callers cannot control retrieval strategy as documented.
+
+**Fix**: Either implement the routing logic for `search_scope` or remove the parameter.
+
+### 3.0e Existing Tests Are Broken
+
+**File**: `tests/test_tools.py:19-20`
+
+`test_tools_are_registered` checks for `"ask_channel"` but the actual tool in `server.py` is `"ask_questions"`. This test will always fail. The expected tools list is outdated and missing many v3.0+ tools (`search_by_topic`, `search_decisions`, `search_recent`, `ask_with_context`, `ask_parallel`, `refresh_wiki`, `list_wikis`, etc.).
+
+**Impact**: The test suite gives false confidence — it doesn't actually validate the current codebase.
+
+**Fix**: Update the expected tools list to match `server.py`.
+
+### 3.1 Consolidation Creates Infinite Duplicate Clusters
+
+**File**: `services/consolidation.py:214-231`
+
+`_link_memories_to_cluster()` is a no-op — atomic memories never get their `cluster_id` set. Every consolidation run re-selects the same memories via `_get_unclustered_memories()` (line 112-136, which filters by `not m.get("cluster_id")`), creating ever-growing duplicate Tier 1 clusters. LLM non-determinism means each run generates slightly different summary text, bypassing the deterministic UUID dedup in Weaviate.
+
+**Impact**: Tier 1 quality degrades over time; duplicate clusters waste storage and confuse retrieval.
+
+**Fix**: Implement `collection.data.update()` in Weaviate to set `cluster_id` on member memories, or track membership in MongoDB.
+
+### 3.2 Synchronous Weaviate Calls Block the Event Loop
+
+**File**: `services/weaviate_client.py:35-63`
+
+The Weaviate client uses the synchronous `weaviate.WeaviateClient`, but all wrapper functions (`hybrid_search`, `insert_multimodal_memory`, `vector_search`) are declared `async`. Every Weaviate call blocks the entire asyncio event loop.
+
+**Impact**: Under concurrent MCP requests, one search blocks all other requests. This is a scalability ceiling.
+
+**Fix**: Wrap all synchronous Weaviate calls in `asyncio.get_event_loop().run_in_executor(None, ...)`, or migrate to the Weaviate async client.
+
+### 3.3 `ask_parallel` Is Sequential, Not Parallel
+
+**File**: `server.py:654-672`
+
+Despite its name, `ask_parallel` uses a sequential `for tier_name in tiers:` loop. The code even acknowledges this with a comment: `# in practice, these could be parallelized`.
+
+**Impact**: Tool name is misleading; multi-tier queries are slower than necessary.
+
+**Fix**: Replace the `for` loop with `asyncio.gather()`.
+
+### 3.4 Naive vs Aware Datetime Comparison Bug
+
+**File**: `services/query.py:89-95`
+
+```python
+cutoff = datetime.utcnow() - timedelta(days=max_age_days)  # naive datetime
+text_memories = [
+    m for m in text_memories
+    if self._parse_date(m.get("extracted_at")) >= cutoff  # _parse_date returns aware datetime
+]
+```
+
+`_parse_date()` (line 411-425) converts "Z" suffixed dates to timezone-aware datetimes (`+00:00`), but `cutoff` is a naive datetime from `datetime.utcnow()`. Comparing aware and naive datetimes **raises TypeError on Python 3.12+**.
+
+There are **89 uses** of the deprecated `datetime.utcnow()` across **16 files**.
+
+**Impact**: Will crash on Python 3.12+ for any time-filtered query.
+
+**Fix**: Replace all `datetime.utcnow()` with `datetime.now(timezone.utc)` across the codebase.
+
+### 3.5 MongoDB Missing from Docker Compose
+
+**File**: `docker-compose.yml`
+
+The compose file defines services for `slack-context-mcp`, `memory-browser`, and `weaviate`, but **MongoDB is not included**. The server requires MongoDB (`mongodb_uri` defaults to `localhost:27017` in `config.py:47`).
+
+**Impact**: `docker-compose up -d` fails with MongoDB connection errors. The documented deployment path is broken.
+
+**Fix**: Add a MongoDB service to `docker-compose.yml`.
+
+### 3.6 `.env` File Leaked Into Docker Image
+
+**File**: `Dockerfile`
+
+```dockerfile
+COPY src/ src/
+```
+
+This copies `src/slack_context_mcp/.env` (containing API keys for Slack, Google, Jina, Tavily) into the Docker image. There is no `.dockerignore` to exclude it.
+
+**Impact**: Anyone with access to the Docker image can extract all API keys.
+
+**Fix**: Add a `.dockerignore` excluding `**/.env`, or restructure to `COPY` only the needed files.
+
+### 3.7 CORS Hardcoded — Breaks Docker Deployment
+
+**File**: `__main__.py:107-113`
+
+CORS origins are hardcoded to `http://localhost:5173` and `http://127.0.0.1:5173` only. The Docker memory-browser runs on port 3002 with a different origin.
+
+**Impact**: Web frontend cannot communicate with the MCP server in Docker deployments without code changes.
+
+**Fix**: Add `cors_origins: list[str]` to `Settings` in `config.py` and use it in `__main__.py`.
+
+### 3.8 No Graceful Shutdown
+
+**File**: `__main__.py:142-152`
+
+The shutdown event only stops the wiki scheduler. Weaviate client, MongoDB connection, aiohttp sessions (embedding client), and Slack client connections are **never closed**.
+
+**Impact**: Connection leaks during container restarts; potential data corruption if MongoDB writes are in-flight.
+
+**Fix**: Call `close_client()` (Weaviate), `close_database()` (MongoDB), and close aiohttp sessions in the shutdown handler.
+
+---
+
+## 4. Limitations
+
+### 4.1 Single-Workspace Only
+
+The system is designed for one Slack workspace at a time. `slack_bot_token` is a single value, and channel IDs are assumed globally unique. Multi-tenant or multi-workspace deployments require separate instances.
+
+### 4.2 No Authentication/Authorization
+
+- MCP server binds to `0.0.0.0` with no auth (`config.py:52`)
+- Weaviate has `AUTHENTICATION_ANONYMOUS_ACCESS_ENABLED: 'true'` (`docker-compose.yml`)
+- REST API endpoints have no API keys, JWT, or RBAC
+- Anyone on the network can read all Slack content and trigger expensive LLM operations
+
+### 4.3 No Real-Time Sync
+
+Sync is pull-based via manual `sync_channel` calls. There is no Slack event subscription (Socket Mode / Events API). The `slack_app_token` and `slack_signing_secret` config fields exist but are unused — they suggest real-time sync was planned but never implemented.
+
+### 4.4 Embedding Provider Lock-In
+
+Hardcoded to Jina v4 embeddings (2048-dim). Switching providers requires re-embedding all stored memories since vector dimensions are baked into the Weaviate schema's named vectors.
+
+### 4.5 Gemini-Only LLM
+
+All LLM calls go through Google Gemini via `google.genai`. No abstraction layer exists for swapping to OpenAI, Anthropic, or local models. The `google-adk` dependency is tightly coupled throughout agents and services.
+
+### 4.6 Single-Channel Query Scope
+
+All query tools (`ask_questions`, `search_by_topic`, etc.) operate on a single channel. There is no cross-channel search — a user can't ask "what has been discussed about authentication across all channels?"
+
+### 4.7 No Memory Pruning / TTL
+
+Once memories are stored, they persist indefinitely. There is no mechanism to:
+- Expire old, low-importance memories
+- Archive or compact historical data
+- Set per-channel retention policies
+- Manage storage growth over time
+
+---
+
+## 5. Weaknesses
+
+### 5.1 Extremely Low Test Coverage
+
+**6 test files / 743 LOC** covering a **17,700 LOC codebase** (~4% test-to-source ratio):
+
+| Service File | Lines | Test Coverage |
+|-------------|-------|--------------|
+| `services/sync.py` | 1,834 | `test_sync.py` (129 lines) — basic mocks only |
+| `api/routes.py` | 1,557 | **None** |
+| `services/wiki.py` | 1,154 | **None** |
+| `agents/content_analyzer.py` | 1,067 | **None** |
+| `server.py` | 881 | `test_tools.py` (53 lines) — parameter registration only |
+| `services/weaviate_client.py` | 778 | **None** |
+| `agents/batch_analyzer.py` | 666 | **None** |
+| `api/chat_routes.py` | 562 | **None** |
+| `agents/coordinator_agent.py` | 555 | **None** |
+| `services/consolidation.py` | 450 | **None** |
+| `services/query.py` | 442 | `test_query.py` (78 lines) — basic mocks |
+| `services/hierarchical_retrieval.py` | ~430 | **None** |
+| `services/grounding.py` | ~420 | **None** |
+| `services/temporal.py` | ~280 | **None** |
+| `services/citation.py` | ~250 | **None** |
+
+Only **4 actual test functions** exist in `test_tools.py`, and they only verify parameter registration, not behavior. There are **zero integration tests** and **zero end-to-end tests**.
+
+### 5.2 Broad Exception Swallowing
+
+**16 files** use `except Exception` catches. Many silently catch and log without re-raising:
+
+- `sync.py` — errors during message/file processing are logged but items may be silently dropped
+- `wiki.py` — wiki generation failures are caught and logged; stale wiki continues to be served
+- `consolidation.py` — cluster creation failures are caught per-topic; remaining topics still run
+- `consistency.py` — consistency check errors are swallowed
+- `routes.py` — has 5+ bare `pass` statements in exception handlers, swallowing errors completely
+
+### 5.3 God Files
+
+Several files are excessively large and violate single-responsibility:
+
+| File | Lines | Responsibilities |
+|------|-------|-----------------|
+| `sync.py` | 1,834 | Fetching, processing, batching, retrying, file downloads, user resolution, temp file management |
+| `routes.py` | 1,557 | All REST API endpoints for memories, channels, sync, wiki, chat — all in one file |
+| `wiki.py` | 1,154 | Wiki generation, caching, section rendering, topic extraction, LLM prompting |
+| `content_analyzer.py` | 1,067 | Image analysis, PDF analysis, video analysis, text analysis, temp file handling |
+
+### 5.4 No Connection Health Checks or Reconnection
+
+- Weaviate client (`weaviate_client.py:35-63`) creates a connection once and caches it globally. If the connection drops, all subsequent calls fail with no recovery path.
+- MongoDB (`mongodb.py:15-28`) — same pattern, no reconnection.
+- Embedding client's aiohttp session (`embeddings.py:244-251`) — no timeout, no retry, no health check.
+
+### 5.5 No Rate Limiting on MCP/API Endpoints
+
+The Slack client handles Slack API rate limits, but the MCP server itself has **no rate limiting**. A consumer can flood `ask_questions` or `sync_channel` with unbounded requests, each triggering expensive Gemini API calls and potentially exhausting quotas.
+
+### 5.6 Embedding Client Has No Retry Logic
+
+The `JinaEmbeddingClient` (`embeddings.py:231-394`) makes single HTTP calls with no retry logic, no timeout configuration, and no rate limiting. Compare this to the Slack client (`slack_client.py:63-127`) which has well-implemented exponential backoff. The embedding client will fail hard on any transient network error.
+
+Additionally, `embed_texts` (line 258-297) sends all texts in a single HTTP request with no batching, which can fail for large lists exceeding the endpoint's payload limit.
+
+### 5.7 Settings Instantiated at Module Import Time
+
+At `config.py:117`, `settings = Settings()` is executed at module import time. Since `slack_bot_token`, `google_api_key`, and `jina_embedding_url` are required fields (no defaults), importing the config module from any test or script fails unless all environment variables are set. This makes unit testing difficult without module-level mocking.
+
+### 5.8 Temporary File Cleanup Risk
+
+`sync.py:1652`, `content_analyzer.py:92-94`, `batch_analyzer.py:257` use `tempfile.NamedTemporaryFile` with `delete=False`. Cleanup relies on manual `os.unlink` in `finally` blocks. If the process crashes between creation and cleanup, temp files accumulate.
+
+---
+
+## 6. Incomplete / In-Progress Work
+
+### 6.1 Dual Agent / Service Architecture
+
+The codebase has **two parallel query execution paths**:
+
+**Services path** (original):
+```
+QueryService → HierarchicalRetrievalService → GroundedResponseGenerator
+```
+
+**Agents path** (newer, ADK-based):
+```
+CoordinatorAgent → Orchestrator → RetrievalAgents → ParallelSearch
+```
+
+The `agents/` directory contains 10 files (coordinator, orchestrator, retrieval agents, query planner, batch analyzer, content analyzer, unified tagging, etc.) that partially duplicate the services layer. `coordinator_agent.py:458` has a `TODO: Collect from agent events`.
+
+It is unclear which path is canonical. The MCP tools in `server.py` use the services path, while `api/chat_routes.py` appears to use the agents path. The ADK migration plan exists in `docs/architecture/11-ADK_MIGRATION_PLAN.md` but is marked as "not started."
+
+### 6.2 Web Frontend (WIP)
+
+The `web/` directory has a React app ("memory-browser") with ~25 components:
+- Dashboard, wiki panel, chat interface, sync controls
+- Memory visualization, grouped view, detail dialogs
+- Channel search/selector, ask-question interface
+- AI elements for streaming chat
+
+However:
+- No frontend tests (no test runner in `package.json`)
+- No E2E tests configured
+- Docker builds it but production-readiness is unclear
+- `chat_routes.py` (562 lines) provides a streaming chat API separate from MCP
+
+### 6.3 Batch API Toggle
+
+`config.py:63` — `use_batch_api: bool = True` for Gemini Batch API (50% cost savings). However, this was disabled on March 17 due to quota issues. The batch vs. inline processing path adds complexity and the toggle creates two code paths that must both be maintained and tested.
+
+### 6.4 Video Processing
+
+`config.py:70` — `process_videos: bool = True` is configured. Video processing exists in `content_analyzer.py` but is the least mature multimodal path compared to images and PDFs. Frame extraction and analysis quality has not been validated.
+
+### 6.5 Consistency Service (No Automation)
+
+`services/consistency.py` (450 lines) implements MongoDB-to-Weaviate consistency checks (orphan detection, missing entries, stale state recovery). However, there is no automated scheduling — it must be triggered manually. This means inconsistencies can accumulate silently.
+
+### 6.6 No CI/CD Pipeline
+
+There is no `.github/` directory, no GitHub Actions, no GitLab CI, and no automated testing or deployment pipeline. All testing and deployment is manual.
+
+### 6.7 Documentation Out of Sync
+
+- `pyproject.toml` says version `3.2.0`; `server.py` header says `v3.3`
+- Architecture docs reference older versions and pre-ADK patterns
+- Getting started guide has outdated `.env` patterns
+
+---
+
+## 7. Further Improvements
+
+### 7.1 Security (Critical)
+
+| Improvement | Detail |
+|-------------|--------|
+| **Add authentication to MCP server and REST API** | API keys at minimum; OAuth2 or JWT for production. Consider MCP protocol-level auth. |
+| **Enable Weaviate authentication** | Disable anonymous access; use API keys or OIDC. |
+| **Secrets management** | Migrate from `.env` files to a vault (HashiCorp Vault, AWS Secrets Manager, GCP Secret Manager). |
+| **Input sanitization** | Channel names and queries flow directly to Weaviate and Gemini without validation. Add input length limits, character filtering, and prompt injection detection. |
+| **Docker security hardening** | Run as non-root user, add `.dockerignore`, use multi-stage builds, set `HEALTHCHECK` instruction. |
+| **Network isolation** | Weaviate and MongoDB should not be exposed on public ports in production. |
+
+### 7.2 Testing (High Priority)
+
+| Improvement | Detail |
+|-------------|--------|
+| **Unit tests for every service** | Especially sync, wiki, consolidation, hierarchical_retrieval, query, grounding. Target >70% coverage. |
+| **Integration tests with test containers** | Use `testcontainers-python` for Weaviate and MongoDB integration tests. |
+| **API/MCP endpoint tests** | Test all MCP tools and REST API endpoints with mocked services. |
+| **Frontend tests** | Add Vitest + React Testing Library for component tests; Playwright for E2E. |
+| **Load/stress tests** | Validate concurrent request handling given the event loop blocking issues. |
+| **Retrieval quality tests** | Create eval datasets to measure retrieval precision/recall across the three tiers. |
+
+### 7.3 Operational Readiness
+
+| Improvement | Detail |
+|-------------|--------|
+| **CI/CD pipeline** | GitHub Actions with lint, test, build, and Docker image push. |
+| **Health checks** | Add readiness/liveness probes that verify Weaviate, MongoDB, and Gemini connectivity. |
+| **Metrics/monitoring** | Instrument with OpenTelemetry (already a dependency but unused). Track: query latency, retrieval quality, LLM costs, sync progress, cache hit rates. |
+| **Structured logging** | Switch from basic `logging` to JSON structured logs for production aggregation. |
+| **Alerting** | Alert on sync failures, LLM quota exhaustion, Weaviate connection drops, consolidation errors. |
+| **Graceful shutdown** | Close all connections (Weaviate, MongoDB, aiohttp, Slack) during shutdown. |
+
+### 7.4 Architecture
+
+| Improvement | Detail |
+|-------------|--------|
+| **Unify agent vs. service paths** | Pick one query execution architecture and remove the other. The ADK agent path is newer but incomplete. |
+| **Break up god files** | Split `sync.py` into fetch/process/batch modules. Split `routes.py` by domain (memories, channels, wiki, chat). |
+| **Async Weaviate** | Wrap synchronous calls in `run_in_executor()` as immediate fix; migrate to async client long-term. |
+| **Connection pooling** | Implement proper connection lifecycle with health checks and reconnection for all external services. |
+| **Retry with backoff** | Add exponential backoff for Gemini, Jina, and Tavily API calls (matching the Slack client's existing pattern). |
+| **MongoDB transactions** | Use transactions for multi-document updates during sync state changes. |
+| **LLM abstraction layer** | Abstract LLM calls behind an interface to enable provider swapping (Gemini, OpenAI, Anthropic, local). |
+
+### 7.5 Features
+
+| Improvement | Detail |
+|-------------|--------|
+| **Real-time sync** | Implement Slack Socket Mode / Events API for live message ingestion instead of manual pull. |
+| **Multi-workspace support** | Support multiple Slack workspaces in a single deployment. |
+| **Cross-channel search** | Enable queries across all synced channels. |
+| **Incremental wiki updates** | Detect changed sections and only regenerate those (using the existing `WikiUpdatePlan` model). |
+| **Memory pruning / TTL** | Implement per-channel retention policies and automatic pruning of old, low-importance memories. |
+| **User access control** | Respect Slack channel permissions — private channel memories should only be accessible to authorized users. |
+| **Relevance feedback** | Allow users to mark retrieval results as helpful/unhelpful to improve future retrieval quality. |
+| **Query result caching** | Cache frequent question/answer pairs with TTL to avoid redundant LLM calls. |
+| **Webhook notifications** | Notify when sync completes, wiki updates, or errors occur. |
+
+### 7.6 Cost Optimization
+
+| Improvement | Detail |
+|-------------|--------|
+| **Embedding caching** | Cache embeddings for repeated or similar queries. |
+| **Query result caching** | Cache frequent question answers with TTL. |
+| **Message deduplication** | Detect near-duplicate messages before LLM processing. |
+| **Adaptive model selection** | Use flash-lite for simple queries and flash/pro only for complex ones (currently all response generation uses flash). |
+| **Batch consolidation scheduling** | Run consolidation during off-peak hours to spread API costs. |
+| **Token budget tracking** | Track and report LLM token usage per channel and per operation type. |
+
+---
+
+## 8. Priority Matrix
+
+### P0 — Fix Now (Blocking / Data Integrity / Security)
+
+| # | Issue | Type | Effort | Impact |
+|---|-------|------|--------|--------|
+| 1 | No authentication on MCP server and REST API (24 unprotected endpoints incl. DELETE) | Security | Medium | Unauthorized access to all Slack data + destructive ops |
+| 2 | Internal error details leaked via `str(e)` in 22 endpoints | Security | Low | Information disclosure (paths, connection strings, secrets) |
+| 3 | Tavily API key sent in HTTP request body (logged in access logs) | Security | Low | API key exposure |
+| 4 | `.env` secrets leaked into Docker image via `COPY src/` | Security | Low | API key exposure |
+| 5 | Weaviate anonymous access enabled | Security | Low | Unauthorized vector store access |
+| 6 | Consolidation no-op creating infinite duplicate clusters | Bug | Low | Data integrity degradation |
+| 7 | Synchronous Weaviate calls blocking event loop | Bug | Medium | Scalability ceiling |
+| 8 | MongoDB missing from docker-compose | Bug | Low | Deployment broken |
+| 9 | `SyncRequest.max_messages` has no upper bound | Security | Low | DoS / cost explosion |
+| 10 | Existing tests reference stale tool names — test suite is broken | Bug | Low | False confidence in quality |
+
+### P1 — Fix Soon (Correctness / Reliability)
+
+| # | Issue | Type | Effort | Impact |
+|---|-------|------|--------|--------|
+| 11 | `datetime.utcnow()` breaks on Python 3.12+ (89 occurrences) | Bug | Medium | Future runtime crashes |
+| 12 | CORS hardcoded, breaks Docker deployment | Bug | Low | Frontend broken in Docker |
+| 13 | `ask_parallel` is sequential | Bug | Low | Performance / naming mislead |
+| 14 | `search_scope` parameter accepted but silently ignored | Bug | Low | Misleading API contract |
+| 15 | No graceful shutdown / connection cleanup | Bug | Low | Connection leaks |
+| 16 | Temporal decay computed but never applied to retrieval ranking | Design gap | Low | Retrieval quality |
+| 17 | No retry logic in embedding client | Reliability | Low | Transient failure crashes |
+| 18 | Test coverage at ~4%, zero integration tests | Quality | High | Regression risk |
+| 19 | No CI/CD pipeline | Ops | Medium | No automated quality gates |
+
+### P2 — Plan For Next Quarter
+
+| # | Issue | Type | Effort | Impact |
+|---|-------|------|--------|--------|
+| 20 | Unify agent/service dual architecture | Architecture | High | Maintainability |
+| 21 | Break up god files (sync.py, routes.py) | Architecture | Medium | Maintainability |
+| 22 | Query classifier too brittle (regex-only, ignores configured LLM) | Design gap | Medium | Retrieval quality |
+| 23 | Retrieval redesign: topic-first two-stage search (requires consolidation fix) | Architecture | Medium | Retrieval precision |
+| 24 | Retrieval redesign: bottom-up expansion (low→high) for overview queries | Architecture | Medium | Freshness of results |
+| 25 | Retrieval redesign: score-based expansion instead of magic thresholds | Architecture | Low | Retrieval quality |
+| 26 | Incremental wiki updates (WikiUpdatePlan model exists but unused) | Feature | Medium | Cost savings |
+| 27 | Real-time sync via Socket Mode | Feature | High | User experience |
+| 28 | Cross-channel search | Feature | Medium | User experience |
+| 29 | Memory pruning / TTL | Feature | Medium | Storage management |
+| 30 | Settings import-time instantiation (breaks testability) | DX | Low | Testability |
+| 31 | Deprecated FastAPI `on_event` usage | Tech debt | Low | Future compatibility |
+
+### P3 — Backlog
+
+| # | Issue | Type | Effort | Impact |
+|---|-------|------|--------|--------|
+| 32 | Version mismatch (3.2 vs 3.3) | DX | Low | Confusion |
+| 33 | Documentation out of sync with v3.3 | DX | Medium | Developer onboarding |
+| 34 | LLM abstraction layer (Gemini-only lock-in) | Architecture | High | Provider flexibility |
+| 35 | Relevance feedback loop | Feature | High | Retrieval quality |
+| 36 | Multi-workspace support | Feature | High | Enterprise readiness |
+| 37 | Docker multi-stage build + non-root user | Ops | Low | Image size + security |
+| 38 | Structured logging (JSON) | Ops | Medium | Observability |
+| 39 | Token budget tracking per channel/operation | Cost | Medium | Cost visibility |
+| 40 | Dynamic/flexible tier structure (evolve beyond fixed 3 tiers) | Architecture | High | Long-term scalability |
+
+---
+
+---
+
+## 9. Positive Observations
+
+Despite the issues identified, the project has several well-engineered aspects:
+
+1. **Well-structured configuration management**: The `Settings` class in `config.py` uses pydantic-settings properly, with sensible defaults and clear categorization. Per-task model selection (flash-lite for cheap tasks, flash for quality) is a thoughtful cost optimization.
+
+2. **Deterministic UUID generation for deduplication**: The `_generate_memory_uuid` function in `weaviate_client.py` uses SHA-256 hashing of full content for deterministic deduplication, with a clear comment explaining why the previous approach (first 100 chars) was insufficient.
+
+3. **Adaptive hybrid search**: The `get_adaptive_alpha` function intelligently adjusts BM25-vs-vector balance based on query characteristics (short keyword queries favor BM25; longer semantic queries favor vector). This improves retrieval quality without user tuning.
+
+4. **Graceful degradation throughout**: Many services fail gracefully — returning empty results when Weaviate collections don't exist, falling back to quick extraction when LLM tagging fails, and recovering stuck syncs on startup (`__main__.py` resets stale processing states).
+
+5. **Comprehensive docstrings**: Nearly every public function has thorough docstrings with Args/Returns sections, making the codebase navigable and self-documenting.
+
+6. **Two-phase sync architecture**: The fetch-then-process design (`sync.py`) with MongoDB as an intermediate store ensures no redundant Slack API calls on retry, failed items are automatically retried from local storage, and there's a clear audit trail of fetched vs. processed items.
+
+7. **Citation validation**: The grounding service (`grounding.py:165-184`) actively removes hallucinated citation IDs from LLM responses, preventing the common RAG problem of fabricated references.
+
+8. **Cost-optimized wiki architecture**: The wiki-first approach with free cached reads and paid LLM queries only for specific questions is a well-designed cost optimization that could save significant LLM costs for read-heavy workloads.
+
+---
+
+*Analysis performed by reviewing all 43 source files, 6 test files, Docker/deployment configuration, and documentation. Findings cross-referenced across architecture, code quality, and operational readiness dimensions using 3 parallel analysis agents (architecture, code quality, docs/deployment).*
diff --git a/docs/v1-archive/RETRIEVAL_IMPROVEMENT_IDEAS.md b/docs/v1-archive/RETRIEVAL_IMPROVEMENT_IDEAS.md
new file mode 100644
index 00000000..e07a1ae9
--- /dev/null
+++ b/docs/v1-archive/RETRIEVAL_IMPROVEMENT_IDEAS.md
@@ -0,0 +1,711 @@
+# Beever Atlas — Retrieval System Improvement Ideas
+
+> **Date**: March 20, 2026
+> **Status**: Design exploration — validated weaknesses with proposed solutions
+> **Scope**: Hierarchical memory retrieval, query classification, consolidation, and search quality
+>
+> **How to use this doc**: Check boxes `[x]` to mark items as agreed/done. Add discussion notes inline under each item.
+
+---
+
+## Table of Contents
+
+1. [Validated Weaknesses](#1-validated-weaknesses)
+2. [Proposed Solutions](#2-proposed-solutions)
+3. [Implementation Roadmap](#3-implementation-roadmap)
+
+---
+
+## Quick Decision Checklist
+
+Use this during team discussions to quickly mark decisions:
+
+**Weaknesses — Do we agree these are real?**
+- [ ] 1.1 Top-down only retrieval
+- [ ] 1.2 Meaningless expansion thresholds
+- [ ] 1.3 Detail queries bypass hierarchy (HIGH)
+- [ ] 1.4 Temporal decay never applied (HIGH)
+- [ ] 1.5 No feedback loop
+- [ ] 1.6 Single workspace / Slack only
+- [ ] 1.7 No real-time sync
+- [ ] 1.8 No memory expiration
+- [ ] 1.9 ADK migration incomplete
+- [ ] 1.10 Brittle regex query classifier
+- [ ] 1.11 Cluster linking is a no-op (HIGH — blocker)
+- [ ] 1.12 No cross-channel search
+- [ ] 1.13 Memory quality 5.25/10 (HIGH)
+- [ ] 1.14 No adaptive alpha in hierarchical retrieval
+- [ ] 1.15 No semantic dedup across tiers
+
+**Solutions — What do we want to build?**
+- [ ] A: Two-stage topic-first retrieval
+- [ ] B: Bidirectional tier expansion
+- [ ] C: Score-based expansion thresholds
+- [ ] D: Apply temporal decay (1-line fix)
+- [ ] E: LLM-augmented query classification
+- [ ] F: Memory quality pipeline
+- [ ] G: Adaptive alpha per query (1-line fix)
+- [ ] H: Cross-tier semantic dedup
+
+**Phases — What are we committing to?**
+- [ ] Phase 1: Quick wins (D, G, C) — 1-2 days
+- [ ] Phase 2: Consolidation fix — 2-3 days
+- [ ] Phase 3: Retrieval redesign (A, B, E, H) — 1-2 weeks
+- [ ] Phase 4: Quality & ecosystem (F, feedback, cross-channel) — 2-4 weeks
+
+**Discussion Notes:**
+> _Add team discussion notes, decisions, and open questions here._
+>
+> _Date:_ _______________
+>
+> _Participants:_ _______________
+>
+> _Key decisions:_
+>
+>
+>
+> _Open questions:_
+>
+>
+>
+> _Next steps:_
+>
+>
+
+---
+
+## 1. Validated Weaknesses
+
+Each weakness below has been verified against the codebase with specific file/line references.
+
+### 1.1 Top-Down Only Retrieval (No Bottom-Up) (Need to study)
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: Medium | File: `services/hierarchical_retrieval.py:170-203`**
+
+The `HierarchicalRetrievalService.retrieve()` only expands **downward**:
+
+```
+Tier 0 (sparse?) → expand to Tier 1
+Tier 1 (sparse?) → expand to Tier 2
+Tier 2           → stop (no upward path)
+```
+
+There is no upward path. If a detail query at Tier 2 returns weak results, the system cannot navigate up to a parent cluster for broader context. This is a one-way escalation that can only get more granular, never more abstract.
+
+**Why this matters:** When a user asks "What's the overall project status?", the system goes to Tier 0. If the Tier 0 summary is stale (only updated during consolidation), the user gets outdated information. A bottom-up approach would start from Tier 2 (always the freshest data), group by topic dynamically, and synthesize a live overview.
+
+**Evidence:** The `retrieve()` method has three branches — all three can only call downward:
+- `depth == "summary"` → may expand to `_retrieve_clusters`
+- `depth == "cluster"` → may expand to `_retrieve_atomic`
+- `depth == "atomic"` → terminal, no expansion
+
+### 1.2 Hardcoded Expansion Thresholds Are Meaningless (Need to study)
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: Medium | File: `services/hierarchical_retrieval.py:176, 191`**
+
+The expansion logic uses arbitrary count thresholds:
+
+```python
+# Summary → Cluster expansion
+if len(memories) < 2:    # Why 2?
+    cluster_memories = await self._retrieve_clusters(...)
+
+# Cluster → Atomic expansion
+if len(memories) < 3:    # Why 3?
+    atomic_memories = await self._retrieve_atomic(...)
+```
+
+These are raw result **counts**, not relevance **scores**. A search could return 5 results that are all irrelevant (low similarity scores), and the system considers that "enough" and skips expansion. Conversely, a search that returns 1 highly relevant result would trigger expansion unnecessarily, diluting the context.
+
+**What should be measured instead:**
+- **Best result confidence**: Is the top result actually relevant? (`score > 0.7`?)
+- **Average quality**: Is the result set overall useful?
+- **Coverage**: Does the result set actually address the query's intent?
+
+### 1.3 Detail Queries Don't Benefit from Hierarchical Structure (Need to study)
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: High | File: `services/hierarchical_retrieval.py:199-203`**
+
+When a query is classified as `DETAIL`, it goes straight to a flat Tier 2 search across **all** atomic memories for the channel:
+
+```python
+else:  # atomic
+    memories = await self._retrieve_atomic(
+        channel_id, query, topic_filter, action_filter, max_results
+    )
+```
+
+This is identical to a flat vector search — the 3-tier hierarchy provides **zero benefit** for detail queries. For a channel with 10,000 atomic memories, the search scans all of them with no topic scoping.
+
+**The better approach:** First identify relevant topic clusters (fast, small search space), then search atomic memories *within those clusters* using `member_ids` as a filter. This narrows the search space from thousands to tens, improving both precision and speed.
+
+### 1.4 Temporal Decay Exists But Is Never Applied to Retrieval Ranking (Need to study)
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: High | Files: `services/temporal.py:153-181`, `services/query.py:269`, `services/grounding.py:82`**
+
+The `TemporalResolutionService` has a well-designed `apply_temporal_decay()` method that computes:
+
+```python
+decay_factor = math.exp(-self.decay_rate * (days_ago / 30))
+score = original_score * decay_factor
+```
+
+**However, this method is never called anywhere in production code.** The only temporal method actually used is `enrich_memories_with_temporal()`, which adds text labels ("2 days ago", "Last week") but **does not affect ranking**.
+
+Call sites that use the label-only method (no score adjustment):
+- `query.py:269` — `self.temporal.enrich_memories_with_temporal(memories)`
+- `grounding.py:82` — `self.temporal_service.enrich_memories_with_temporal(limited_memories)`
+- `hierarchical_retrieval.py` — never calls temporal service at all
+
+**Impact:** A decision from 6 months ago has **identical retrieval weight** as one from yesterday. The temporal context only appears as text labels in the LLM generation prompt, relying entirely on the LLM to reason about recency — which is unreliable and inconsistent.
+
+### 1.5 No Feedback Loop for Retrieval Quality (No need first)
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: Medium | File: `docs/architecture/09-MEMORY_EVAL_PLAN.md`**
+
+There is no mechanism for:
+- User thumbs up/down on answers
+- Tracking which retrieved memories were actually cited in final responses
+- Learning from bad retrievals to improve future ranking
+- Active learning for relevance tuning
+
+The eval plan document (`09-MEMORY_EVAL_PLAN.md`) proposes metrics like Precision@K, Recall@K, and MRR, but these are **documentation only** — no evaluation pipeline runs in production. The eval framework code is in the doc as examples, not as shipped code.
+
+### 1.6 Single Workspace, Slack Only (Can see ChatSDK -> See ChatSDK documentation)
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: Medium | File: `docs/reference/01-standalone-mcp-server-plan.md:1533-1534`**
+
+The system is hardcoded to a single Slack workspace. The roadmap explicitly lists multi-workspace support as a v1.2 feature (not implemented). No support for Microsoft Teams, Discord, or other communication platforms.
+
+### 1.7 No Real-Time Sync (Need to discuss)
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: Medium | Files: `config.py:17-18`, `docs/reference/01-standalone-mcp-server-plan.md:1540`**
+
+Sync is pull-based (triggered manually via `sync_channel()`). The `slack_app_token` and `slack_signing_secret` config fields exist but are unused — Socket Mode was planned but never wired into the sync pipeline. The roadmap lists "Slack Events API integration (real-time)" as a v2.0 feature.
+
+### 1.8 No Memory Expiration / Storage Growth Management (Not a big problem, but focus on the query and retrieval and matching score first)
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: Medium | File: `services/consolidation.py`**
+
+Architecture docs describe a `monthly` consolidation type that would `archive_old_memories`, but the actual implementation only supports `daily`, `weekly`, and `full` — **no archival or expiration logic exists**.
+
+Manual deletion is possible via API (`DELETE /memories`, `DELETE /channels/{channel_id}/memories`), but there is no automated TTL, importance-based pruning, or storage growth management. Over time, channels accumulate unbounded memories with no decay.
+
+### 1.9 ADK Migration Incomplete
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: Low | File: `docs/architecture/11-ADK_MIGRATION_PLAN.md`**
+
+The migration plan from direct `google.genai` SDK calls to Google ADK agent architecture is documented but only partially executed. The `agents/` directory has scaffolding (coordinator, orchestrator, retrieval agents), but the MCP tools in `server.py` still use the services path for primary operations.
+
+### 1.10 Query Classification Uses Brittle Regex, Not LLM (need to do)
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: Medium | File: `services/hierarchical_retrieval.py:49-120`**
+
+The `QueryClassifier` uses hardcoded regex patterns:
+
+```python
+OVERVIEW_PATTERNS = [r"what.*happening", r"summarize", r"overview", ...]
+TOPIC_PATTERNS = [(r"about\s+(?:the\s+)?(\w+)", 1), ...]  # Single word only!
+DETAIL_PATTERNS = [r"who\s+said", r"when\s+did", r"yesterday", ...]
+```
+
+Problems:
+- **Single-word topic capture**: `(\w+)` only captures one word — "API design" becomes just "API"
+- **Priority misclassification**: DETAIL patterns are checked first, so "who said something about authentication" matches `r"who\s+said"` and is classified as DETAIL, skipping topic extraction entirely
+- **No LLM fallback**: `model_query_classification = gemini-2.5-flash-lite` is configured in `config.py:27` but **never used** by the classifier
+- **No multi-topic detection**: "Tell me about NBA and FIFA" would be classified as TOPIC_SPECIFIC with topic="NBA" only, completely missing "FIFA"
+
+### 1.11 Cluster Linking Is a No-Op (Blocks Topic-First Retrieval)
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: High | File: `services/consolidation.py:214-231`**
+
+When atomic memories are consolidated into clusters, `_link_memories_to_cluster()` should update atomic memories with their `cluster_id`. But it's implemented as a no-op:
+
+```python
+async def _link_memories_to_cluster(self, memories, cluster_id):
+    # In a future version, we could update memories in Weaviate
+    logger.debug(f"Linked {len(memories)} memories to cluster {cluster_id}")
+```
+
+**This is the single biggest blocker for retrieval improvement.** Without cluster linkage:
+- Atomic memories don't know which cluster they belong to
+- The "topic → atomic" filtered retrieval path is impossible
+- Every consolidation run re-selects the same memories (they never get marked as clustered)
+- Duplicate clusters accumulate indefinitely
+
+### 1.12 No Cross-Channel Search (Can purpose, lower priority)
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: Medium | Files: `server.py` (all tools), `api/chat_routes.py`**
+
+Every query tool requires a `channel_id` parameter. There is no way to search across multiple channels simultaneously. If a decision about authentication was discussed in `#backend` but the user asks in `#frontend`, it won't be found.
+
+### 1.13 Memory Quality Is Low (5.25/10 Average) (Graph Related Memory maybe can fix, can see the open source project, DeepWiki MCP can help - Claude Code operate related repo to study)
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: High | File: `docs/architecture/09-MEMORY_EVAL_PLAN.md:35`**
+
+The eval plan documents significant quality problems from a 319-memory audit:
+
+| Metric | Value | Target |
+|--------|-------|--------|
+| Average quality score | 5.25/10 | >7.0 |
+| Facts per message | 2.44 | 1.5-2.0 |
+| High quality (>6) | 2.2% | >50% |
+| Vague/generic | 17% | <5% |
+
+Problematic memory examples:
+- "The user does not use 'uv'." (no context — what is this about?)
+- "The output was adjusted accordingly." (what output? how?)
+- "The process runs through all steps." (what process?)
+
+These low-quality memories pollute the retrieval index, causing the system to return vague facts instead of actionable information. **Garbage in, garbage out** — even a perfect retrieval system would struggle with this quality level.
+
+### 1.14 No Per-Query-Type Hybrid Alpha Tuning
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: Low | Files: `services/weaviate_client.py:318-354`, `services/hierarchical_retrieval.py:238,265,295`**
+
+A `get_adaptive_alpha()` function exists in `weaviate_client.py` that adjusts alpha based on query length:
+- 1-2 short words → `alpha=0.2` (favor BM25)
+- 1-2 long words → `alpha=0.35`
+- 3-5 words → `alpha=0.5`
+- 6+ words → `alpha=0.7` (favor vector)
+
+**However, the hierarchical retrieval service bypasses this.** It uses:
+- Tier 0 summaries: hardcoded `alpha=0.3`
+- Tier 1 clusters: `settings.hybrid_alpha` (default 0.6)
+- Tier 2 atomic: `settings.hybrid_alpha` (default 0.6)
+
+The adaptive alpha is only used in the raw `hybrid_search` function when no alpha is explicitly passed. Since the hierarchical retrieval always passes an explicit alpha, the adaptive logic never runs for hierarchical queries.
+
+### 1.15 No Semantic Deduplication Across Tiers
+
+- [ ] **Agreed** | - [ ] **Prioritized** | - [ ] **Fixed**
+
+**Severity: Low | File: `services/hierarchical_retrieval.py:206-213`**
+
+When retrieval expands from one tier to another (e.g., cluster → atomic), deduplication only checks by memory ID:
+
+```python
+mem_id = m.get("id") or m.get("memory", "")[:50]
+if mem_id not in seen_ids:
+    seen_ids.add(mem_id)
+    unique_memories.append(m)
+```
+
+But the same information can exist in both a Tier 1 cluster summary and its constituent Tier 2 atomic memories. For example:
+- **Tier 1 cluster**: "The team decided to use JWT with RS256 for authentication"
+- **Tier 2 atomic**: "Team chose RS256 algorithm for JWT signing"
+
+These are semantically identical but have different IDs, so both get included in the LLM context, wasting token budget and potentially confusing the response generator.
+
+---
+
+## 2. Proposed Solutions
+
+### Solution A: Two-Stage Topic-First Retrieval
+
+- [ ] **Approved** | - [ ] **In Progress** | - [ ] **Done**
+
+**Addresses: #1.1, #1.3, #1.11**
+
+Replace the flat "pick a tier and search" approach with a two-stage coarse-to-fine retrieval:
+
+```
+Stage 1: Topic Identification (coarse)
+  Input:  "What did the team decide about JWT authentication?"
+  Action: hybrid_search(tier="tier1_cluster", query=question, limit=5)
+  Output: Matched clusters with their member_ids
+          → "authentication" cluster (members: [uuid1, uuid2, ..., uuid15])
+          → "security" cluster (members: [uuid20, uuid21, ..., uuid28])
+
+Stage 2: Focused Retrieval (fine)
+  Input:  member_ids from matched clusters
+  Action: hybrid_search(ids=member_ids, query=question, limit=10)
+  Output: Precise results from a narrowed search space (43 memories instead of 10,000)
+```
+
+**Why this is better than current approach:**
+
+| Aspect | Current (flat Tier 2) | Topic-First |
+|--------|----------------------|-------------|
+| Search space | All atomic memories | Only cluster members |
+| Precision | Low — irrelevant topics compete | High — pre-filtered by topic |
+| Scalability | Degrades with channel size | Bounded by cluster size |
+| Context coherence | Mixed topics in results | Topically focused |
+
+**Prerequisites:**
+1. Fix `_link_memories_to_cluster()` to actually set `cluster_id` on atomic memories (Weakness #1.11)
+2. Add a Weaviate filter for `cluster_id IN [...]` or use `member_ids` list lookup
+
+**Estimated effort:** Medium (after consolidation fix)
+
+### Solution B: Bidirectional Tier Expansion
+
+- [ ] **Approved** | - [ ] **In Progress** | - [ ] **Done**
+
+**Addresses: #1.1, #1.3**
+
+Allow retrieval to expand both downward (current) and upward:
+
+```python
+async def retrieve(self, channel_id, query, depth="auto", max_results=20):
+    memories = await self._search_at_depth(depth, ...)
+
+    # Current: only downward expansion
+    if self._should_expand(memories, direction="down"):
+        expanded = await self._search_deeper(...)
+        memories.extend(expanded)
+
+    # NEW: upward expansion when detail results are weak
+    if self._should_expand(memories, direction="up"):
+        broader = await self._search_broader(...)
+        memories = self._merge_and_rerank(memories, broader)
+```
+
+**Upward expansion scenarios:**
+- Detail query returns 0 results → expand to cluster for topic summary
+- Detail query returns low-confidence results → check if a cluster summary answers the question directly
+- Overview query with stale Tier 0 → synthesize from fresh Tier 2 atomics (bottom-up)
+
+**Estimated effort:** Low-Medium
+
+### Solution C: Score-Based Expansion Thresholds
+
+- [ ] **Approved** | - [ ] **In Progress** | - [ ] **Done**
+
+**Addresses: #1.2**
+
+Replace magic count thresholds with relevance-score-based decisions:
+
+```python
+def _should_expand(self, memories: list[dict], direction: str) -> bool:
+    """Decide whether to expand to adjacent tier based on result quality."""
+    if not memories:
+        return True  # No results — always expand
+
+    scores = [m.get("score", 0) for m in memories]
+    max_score = max(scores)
+    avg_score = sum(scores) / len(scores)
+
+    # Expand if best result is low-confidence
+    if max_score < self.expansion_score_threshold:  # configurable, e.g., 0.6
+        return True
+
+    # Expand if overall quality is poor
+    if avg_score < self.expansion_avg_threshold:  # configurable, e.g., 0.4
+        return True
+
+    # Don't expand if we have good results
+    return False
+```
+
+After expansion, **re-rank the combined results** by score instead of just appending:
+
+```python
+all_memories = original + expanded
+all_memories.sort(key=lambda m: m.get("score", 0), reverse=True)
+return all_memories[:max_results]
+```
+
+**Estimated effort:** Low
+
+### Solution D: Apply Temporal Decay to Retrieval Ranking
+
+- [ ] **Approved** | - [ ] **In Progress** | - [ ] **Done**
+
+**Addresses: #1.4**
+
+Wire the existing `apply_temporal_decay()` method into the retrieval pipeline:
+
+```python
+# In HierarchicalRetrievalService.retrieve(), before returning:
+if settings.temporal_decay_rate > 0:
+    self.temporal_service.apply_temporal_decay(result.memories)
+```
+
+This single line change would:
+- Boost recent memories (decay_factor ≈ 1.0 for today)
+- Penalize old memories (decay_factor ≈ 0.9 for 1 month, ≈ 0.74 for 3 months)
+- Re-sort results so recency-weighted relevance determines order
+
+The decay rate is already configurable via `temporal_decay_rate` (default 0.1 = 10% per 30 days).
+
+**Estimated effort:** Very low (1 line + import)
+
+### Solution E: LLM-Augmented Query Classification
+
+- [ ] **Approved** | - [ ] **In Progress** | - [ ] **Done**
+
+**Addresses: #1.10**
+
+Replace the regex classifier with a hybrid approach:
+
+```python
+class QueryClassifier:
+    async def classify(self, query: str) -> tuple[QueryType, dict]:
+        # Fast path: clear-cut queries via regex (zero cost)
+        regex_result = self._regex_classify(query)
+        if regex_result.confidence > 0.9:
+            return regex_result
+
+        # Slow path: ambiguous queries via LLM (flash-lite, ~0.001$/call)
+        return await self._llm_classify(query)
+
+    async def _llm_classify(self, query: str) -> tuple[QueryType, dict]:
+        """Use the already-configured model_query_classification."""
+        prompt = f"""Classify this query and extract topics:
+        Query: {query}
+
+        Output JSON:
+        {{"type": "overview|topic|detail", "topics": ["topic1", "topic2"], "temporal": "recent|any"}}
+        """
+        response = self.gemini.models.generate_content(
+            model=settings.model_query_classification,  # gemini-flash-lite
+            contents=prompt,
+        )
+        return self._parse_classification(response.text)
+```
+
+**Key improvements over pure regex:**
+- Multi-word topic extraction: "API design" instead of just "API"
+- Multi-topic detection: "NBA and FIFA" → `["NBA", "FIFA"]`
+- Nuanced classification: "Can you tell me everything about the database migration timeline?" → correctly identified as needing both cluster and atomic data
+- Temporal intent detection: "What happened yesterday" → `temporal="recent"`
+
+**Cost:** ~$0.001 per query using `gemini-2.5-flash-lite` (the model is already configured but unused)
+
+**Estimated effort:** Medium
+
+### Solution F: Memory Quality Pipeline
+
+- [ ] **Approved** | - [ ] **In Progress** | - [ ] **Done**
+
+**Addresses: #1.13**
+
+The eval plan documents quality at 5.25/10. Improvements:
+
+**F1: Extraction-time quality filter**
+```python
+def is_quality_memory(fact: str) -> bool:
+    """Reject low-quality extractions before storing."""
+    # Too short to be useful
+    if len(fact) < 40:
+        return False
+
+    # Vague/context-dependent patterns
+    vague = ["the user", "the process", "this was", "it was",
+             "the output", "the same", "as mentioned", "was adjusted"]
+    if any(p in fact.lower() for p in vague):
+        return False
+
+    # Must contain at least one specific noun/entity
+    # (simple heuristic: has a capitalized word that isn't sentence-start)
+    words = fact.split()
+    has_entity = any(w[0].isupper() for w in words[1:] if len(w) > 1)
+    has_number = any(c.isdigit() for c in fact)
+    if not has_entity and not has_number:
+        # Likely too generic
+        return False
+
+    return True
+```
+
+**F2: Reduce facts-per-message target**
+Adjust the extraction prompt to request 1-2 high-quality facts instead of extracting everything:
+```
+Extract only the MOST IMPORTANT 1-2 facts from this message.
+Each fact MUST be self-contained — understandable without reading the original message.
+Do NOT extract obvious, trivial, or context-dependent statements.
+```
+
+**F3: Post-extraction quality scoring**
+Score memories at insertion time and store the score. Use it as a retrieval-time boost:
+```python
+# In hybrid_search, adjust score by quality
+for mem in results:
+    quality = mem.get("quality_score", 0.5)
+    mem["score"] = mem["score"] * (0.7 + 0.3 * quality)  # Quality-weighted
+```
+
+**Estimated effort:** Medium
+
+### Solution G: Adaptive Hybrid Alpha Per Query Type
+
+- [ ] **Approved** | - [ ] **In Progress** | - [ ] **Done**
+
+**Addresses: #1.14**
+
+Wire `get_adaptive_alpha()` into the hierarchical retrieval service:
+
+```python
+# In _retrieve_clusters and _retrieve_atomic:
+async def _retrieve_clusters(self, channel_id, query, topic_filter, limit):
+    return await hybrid_search(
+        channel_id=channel_id,
+        query=query,
+        tier_filter=MemoryTier.TOPIC_CLUSTER.value,
+        topic_filter=topic_filter,
+        limit=limit,
+        alpha=None,  # Let hybrid_search use get_adaptive_alpha()
+    )
+```
+
+By passing `alpha=None` instead of `settings.hybrid_alpha`, the existing `get_adaptive_alpha()` function will be used automatically. This is a one-line change per retrieval method.
+
+**Estimated effort:** Very low
+
+### Solution H: Cross-Tier Semantic Deduplication
+
+- [ ] **Approved** | - [ ] **In Progress** | - [ ] **Done**
+
+**Addresses: #1.15**
+
+After combining results from multiple tiers, perform semantic dedup:
+
+```python
+def _semantic_dedup(self, memories: list[dict], threshold: float = 0.85) -> list[dict]:
+    """Remove semantically similar memories across tiers, preferring more specific ones."""
+    unique = []
+    for mem in memories:
+        is_dup = False
+        for existing in unique:
+            # Simple text similarity via word overlap (no embedding needed)
+            sim = self._jaccard_similarity(
+                mem.get("memory", "").lower().split(),
+                existing.get("memory", "").lower().split()
+            )
+            if sim > threshold:
+                # Keep the more specific one (longer text, or lower tier)
+                if len(mem.get("memory", "")) > len(existing.get("memory", "")):
+                    unique.remove(existing)
+                    unique.append(mem)
+                is_dup = True
+                break
+        if not is_dup:
+            unique.append(mem)
+    return unique
+```
+
+**Estimated effort:** Low
+
+---
+
+## 3. Implementation Roadmap
+
+### Phase 1: Quick Wins (1-2 days)
+
+- [ ] **Started** | - [ ] **Completed**
+
+These require minimal code changes and have immediate impact:
+
+| # | Solution | Effort | Impact | Addresses |
+|---|----------|--------|--------|-----------|
+| 1 | **D: Apply temporal decay** — add 1 line to `retrieve()` | Very low | High | #1.4 |
+| 2 | **G: Adaptive alpha** — pass `alpha=None` in 3 methods | Very low | Medium | #1.14 |
+| 3 | **C: Score-based expansion** — replace count checks with score checks | Low | High | #1.2 |
+
+### Phase 2: Consolidation Fix (2-3 days)
+
+- [ ] **Started** | - [ ] **Completed**
+
+This is the prerequisite for all cluster-based improvements:
+
+| # | Solution | Effort | Impact | Addresses |
+|---|----------|--------|--------|-----------|
+| 4 | **Fix `_link_memories_to_cluster()`** — implement Weaviate property update | Low | Critical | #1.11 |
+| 5 | **Clean up existing duplicate clusters** — one-time migration | Low | High | #1.11 |
+| 6 | **Add `cluster_id` filter to `hybrid_search()`** | Low | Medium | Prereq for A |
+
+### Phase 3: Retrieval Redesign (1-2 weeks)
+
+- [ ] **Started** | - [ ] **Completed**
+
+With consolidation fixed, implement the core retrieval improvements:
+
+| # | Solution | Effort | Impact | Addresses |
+|---|----------|--------|--------|-----------|
+| 7 | **A: Topic-first retrieval** — two-stage coarse-to-fine search | Medium | High | #1.1, #1.3, #1.11 |
+| 8 | **B: Bidirectional expansion** — add upward expansion path | Medium | Medium | #1.1, #1.3 |
+| 9 | **E: LLM query classification** — hybrid regex + flash-lite | Medium | High | #1.10 |
+| 10 | **H: Semantic dedup** — cross-tier similarity check | Low | Low | #1.15 |
+
+### Phase 4: Quality & Ecosystem (2-4 weeks)
+
+- [ ] **Started** | - [ ] **Completed**
+
+Longer-term improvements for overall system quality:
+
+| # | Solution | Effort | Impact | Addresses |
+|---|----------|--------|--------|-----------|
+| 11 | **F: Memory quality pipeline** — extraction filter + scoring | Medium | High | #1.13 |
+| 12 | **Feedback loop** — track citation usage + user ratings | High | High | #1.5 |
+| 13 | **Cross-channel search** — multi-channel query routing | Medium | Medium | #1.12 |
+| 14 | **Real-time sync** — Socket Mode integration | High | Medium | #1.7 |
+| 15 | **Memory TTL/pruning** — automated expiration | Medium | Medium | #1.8 |
+
+### Dependency Graph
+
+```
+Phase 1 (Quick Wins) ──────────────────────────────────────────────┐
+  D: Temporal decay ─────────────────────── standalone             │
+  G: Adaptive alpha ─────────────────────── standalone             │
+  C: Score-based expansion ──────────────── standalone             │
+                                                                   │
+Phase 2 (Consolidation Fix) ──────────────────────────────────┐    │
+  Fix _link_memories_to_cluster ──┬── prerequisite for ───┐   │    │
+  Clean up duplicate clusters ────┘                       │   │    │
+  Add cluster_id filter ──────────────────────────────────│───┘    │
+                                                          │        │
+Phase 3 (Retrieval Redesign) ─────────────────────────────│────────┘
+  A: Topic-first retrieval ◄──────────────────────────────┘
+  B: Bidirectional expansion ──── standalone (enhanced by A)
+  E: LLM query classification ── standalone (enhanced by A)
+  H: Semantic dedup ──────────── standalone
+
+Phase 4 (Quality & Ecosystem) ── all standalone
+  F: Memory quality pipeline
+  Feedback loop
+  Cross-channel search
+  Real-time sync
+  Memory TTL/pruning
+```
+
+### Expected Cumulative Impact
+
+| After Phase | Retrieval Precision | Key Capability Gained |
+|-------------|--------------------|-----------------------|
+| Phase 1 | +15-20% | Recent results rank higher; better alpha tuning; smarter expansion |
+| Phase 2 | +5% (indirect) | Unlocks cluster-based retrieval; stops duplicate cluster growth |
+| Phase 3 | +30-40% | Topic-scoped search; multi-topic queries; bidirectional expansion |
+| Phase 4 | +10-15% | Cleaner index; cross-channel; real-time data |
+
+---
+
+*This document captures improvement ideas based on validated codebase analysis. Precision estimates are directional, not measured. Actual impact should be validated using the evaluation framework proposed in `docs/architecture/09-MEMORY_EVAL_PLAN.md`.*
diff --git a/docs/v1-archive/TECHNICAL_PROPOSAL_MONOLITH.md b/docs/v1-archive/TECHNICAL_PROPOSAL_MONOLITH.md
new file mode 100644
index 00000000..9ee7bf1a
--- /dev/null
+++ b/docs/v1-archive/TECHNICAL_PROPOSAL_MONOLITH.md
@@ -0,0 +1,2105 @@
+# Beever Atlas v2: Technical Architecture Proposal
+
+> **Date**: 2026-03-24 (v3 — final revision)
+> **Status**: Proposal — under review
+> **Scope**: Full architecture redesign from demo to production-ready system
+> **Deliverable**: Architecture validation document
+
+---
+
+## 1. Executive Summary
+
+Beever Atlas v1 demonstrated that a wiki-first, hierarchical memory system for Slack channels is viable. However, the demo-stage implementation has 15 validated weaknesses: cluster linking is a no-op, the query classifier uses brittle regex, memory quality is 5.25/10, temporal decay is never applied, and there is no support for relational queries.
+
+**Beever Atlas v2** redesigns the system around two complementary memory systems:
+
+- **Semantic Memory (Weaviate)** — Hierarchical 3-tier memory (improved from v1) handling factual, topic-based, and overview queries via hybrid BM25+vector search. Handles ~80% of queries. Cheap, fast.
+- **Graph Memory (Neo4j)** — Flexible knowledge graph capturing entity relationships and temporal evolution from conversations. Handles relational queries that semantic search can't answer. ~20% of queries.
+- **Smart Router** — LLM-powered query understanding that routes to Semantic, Graph, or both in parallel based on query type and cost optimization.
+
+**Design Principle**: Each memory system does what it's best at. They don't duplicate each other's work. Weaviate owns facts and topics. Neo4j owns entities and relationships. The router decides which to use.
+
+```
+┌─────────────────────────────────────────────────────────────────────────┐
+│                        BEEVER ATLAS v2 OVERVIEW                        │
+│                                                                         │
+│                         ┌──────────────┐                                │
+│                         │  Smart Query │                                │
+│              ┌──────────│    Router    │──────────┐                     │
+│              │          └──────────────┘          │                     │
+│              ▼                                    ▼                     │
+│  ┌─────────────────────────┐     ┌─────────────────────────┐          │
+│  │   SEMANTIC MEMORY       │     │    GRAPH MEMORY         │          │
+│  │   (Weaviate)            │     │    (Neo4j)              │          │
+│  │                         │     │                         │          │
+│  │  Tier 0: Summary        │     │  Flexible entities:     │          │
+│  │  Tier 1: Topic Clusters │     │  Person, Decision,      │          │
+│  │  Tier 2: Atomic Facts   │     │  Project, Technology,   │          │
+│  │                         │     │  Team, Meeting, ...     │          │
+│  │  Hybrid BM25+Vector     │     │  Flexible relationships │          │
+│  │  Cross-modal (img/pdf)  │     │  Temporal tracking      │          │
+│  │  Wiki-first (free reads)│     │  Multi-hop traversal    │          │
+│  │                         │     │                         │          │
+│  │  "What was discussed?"  │     │  "Who decided what?"    │          │
+│  │  "Find docs about X"   │     │  "How did X evolve?"    │          │
+│  │  "Show me the overview" │     │  "What blocks project?" │          │
+│  │                         │     │                         │          │
+│  │  ~80% of queries        │     │  ~20% of queries        │          │
+│  │  < 200ms, low cost      │     │  200ms-1s, medium cost  │          │
+│  └────────────┬────────────┘     └────────────┬────────────┘          │
+│               │                                │                       │
+│               └────────────┬───────────────────┘                       │
+│                            ▼                                           │
+│                   ┌──────────────┐                                     │
+│                   │   Response   │                                     │
+│                   │  Generator   │──▶  Grounded answer + citations     │
+│                   └──────────────┘                                     │
+│                                                                         │
+│  ┌──────────┐    ┌──────────────┐    ┌──────────────┐                  │
+│  │  Slack   │    │  Ingestion   │    │   MongoDB    │                  │
+│  │  Teams   │───▶│  Pipeline    │    │  (state +    │                  │
+│  │  Discord │    │              │───▶│   wiki cache)│                  │
+│  └──────────┘    └──────┬───────┘    └──────────────┘                  │
+│                         │                                               │
+│                    Writes to BOTH                                       │
+│                  Weaviate AND Neo4j                                     │
+└─────────────────────────────────────────────────────────────────────────┘
+```
+
+---
+
+## 2. Current System Weaknesses (Lessons Learned)
+
+Validated against the v1 codebase. Each weakness has a specific fix in v2.
+
+### Critical
+| # | Weakness | File Reference | v2 Fix |
+|---|----------|---------------|--------|
+| 1.11 | Cluster linking is a no-op | `consolidation.py:214-231` | Actually write `cluster_id` to atomic memories in Weaviate |
+| 1.3 | Detail queries bypass hierarchy | `hierarchical_retrieval.py:199-203` | Two-stage topic-first retrieval (Solution A) |
+| 1.13 | Memory quality 5.25/10 | `09-MEMORY_EVAL_PLAN.md:35` | Quality gate: reject vague facts, max 2 per message |
+| 1.10 | Brittle regex classifier | `hierarchical_retrieval.py:49-120` | LLM-powered query understanding (flash-lite) |
+
+### High
+| # | Weakness | File Reference | v2 Fix |
+|---|----------|---------------|--------|
+| 1.4 | Temporal decay never applied | `temporal.py:153-181` | Wire `apply_temporal_decay()` into retrieval ranking |
+| 1.1 | Top-down only retrieval | `hierarchical_retrieval.py:170-203` | Bidirectional expansion (up + down) |
+| 1.2 | Meaningless expansion thresholds | `hierarchical_retrieval.py:176,191` | Score-based expansion (`max_score < 0.6`) |
+| 1.6 | Slack only | entire codebase | Python adapter layer with NormalizedMessage |
+
+### Medium
+| # | Weakness | v2 Fix |
+|---|----------|--------|
+| 1.5 | No feedback loop | Citation tracking + retrieval quality metrics |
+| 1.7 | No real-time sync | Optional Chat SDK webhook bridge (Phase 2) |
+| 1.12 | No cross-channel search | Graph memory naturally spans channels |
+| 1.14 | No adaptive alpha | Wire `get_adaptive_alpha()` (pass `alpha=None`) |
+| 1.15 | No semantic dedup | Jaccard similarity dedup across tiers |
+
+---
+
+## 3. Dual-Memory Architecture
+
+### 3.1 Design Principle: Separation of Concerns
+
+Each memory system handles what it's naturally best at. **They do not duplicate each other.**
+
+| | Semantic Memory (Weaviate) | Graph Memory (Neo4j) |
+|---|---|---|
+| **What it stores** | Facts, summaries, topic clusters, multimodal content | Entities, relationships, temporal evolution |
+| **How it's structured** | 3-tier hierarchy (summary → topics → facts) | Flexible knowledge graph (nodes + edges) |
+| **How it's queried** | BM25 + vector hybrid search | Cypher graph traversal |
+| **What questions it answers** | "What was discussed about X?", "Show overview", "Find docs" | "Who decided X?", "What blocks Y?", "How did Z evolve?" |
+| **Query share** | ~80% (most questions are factual/topical) | ~20% (relational/temporal) |
+| **Cost** | Low (embedding search only) | Medium (graph traversal + Weaviate enrichment) |
+| **Latency** | < 200ms | 200ms-1s |
+
+**Why not just one?**
+- Weaviate can't do multi-hop traversal: "Person → works on → Project → has decision → blocked by → Constraint" requires a graph
+- Neo4j can't do fuzzy semantic search across 10K facts with BM25+vector hybrid ranking
+- Using both gives us the best of GraphRAG (from reference papers): vector search for finding relevant content + graph traversal for navigating relationships
+
+### 3.2 Semantic Memory: Weaviate (3-Tier, Improved)
+
+The v1 hierarchical design was sound — the implementation was broken. v2 keeps the 3-tier architecture but fixes every weakness.
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│              SEMANTIC MEMORY: WEAVIATE (3-Tier)                      │
+│                                                                      │
+│  ┌───────────────────────────────────────────────────────────────┐  │
+│  │  TIER 0: Channel Summary                                      │  │
+│  │  • Channel-level overview ("what's happening?")               │  │
+│  │  • Updated by consolidation service                           │  │
+│  │  • Used for wiki overview section                             │  │
+│  │  • Query: "Catch me up", "Overview", "Status update"          │  │
+│  │  • Access: FREE (cached, no LLM needed)                       │  │
+│  └───────────────────────────────────────────────────────────────┘  │
+│                              │                                       │
+│                    consolidates from                                  │
+│                              ▼                                       │
+│  ┌───────────────────────────────────────────────────────────────┐  │
+│  │  TIER 1: Topic Clusters                                       │  │
+│  │  • Grouped memories by topic (authentication, deployment...)  │  │
+│  │  • Each cluster has: summary, member_ids, topic_tags          │  │
+│  │  • member_ids ACTUALLY LINKED to Tier 2 atomics (v1 fix!)    │  │
+│  │  • Used for topic-level questions and wiki topic sections     │  │
+│  │  • Query: "Tell me about auth", "What about deployment?"     │  │
+│  │  • Access: FREE (cached, no LLM needed)                       │  │
+│  │                                                                │  │
+│  │  v2 FIXES:                                                     │  │
+│  │  ✓ _link_memories_to_cluster() actually writes cluster_id    │  │
+│  │  ✓ MERGE-based dedup prevents duplicate clusters              │  │
+│  │  ✓ Two-stage topic-first retrieval (coarse → fine)           │  │
+│  └───────────────────────────────────────────────────────────────┘  │
+│                              │                                       │
+│                    consolidates from                                  │
+│                              ▼                                       │
+│  ┌───────────────────────────────────────────────────────────────┐  │
+│  │  TIER 2: Atomic Facts                                         │  │
+│  │  • Individual facts with full metadata and citations          │  │
+│  │  • Named vectors: text (2048-dim), image, doc (Jina v4)      │  │
+│  │  • Cross-modal search (text query → find images/PDFs)         │  │
+│  │  • Quality-scored at extraction (v2: reject < 0.5)            │  │
+│  │  • Linked to Neo4j via graph_entity_ids                       │  │
+│  │  • Query: "What exactly did Alice say?", "Find the diagram"  │  │
+│  │  • Access: PAID (uses embedding for search)                   │  │
+│  └───────────────────────────────────────────────────────────────┘  │
+│                                                                      │
+│  Wiki-First Cost Optimization (preserved from v1):                  │
+│  • Tier 0 + Tier 1 reads = FREE (pre-generated, cached)            │
+│  • Tier 2 search = CHEAP (embedding only, ~$0.001)                  │
+│  • LLM synthesis = PAID (only when needed, ~$0.02)                  │
+│  • Average query cost: ~$0.01 (5x cheaper than competitors)         │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+#### Weaviate Schema
+
+```python
+properties = [
+    # === Core ===
+    Property(name="memory", data_type=DataType.TEXT),
+    Property(name="channel_id", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="source", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="platform", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="timestamp", data_type=DataType.NUMBER),
+
+    # === Hierarchy (FIXED in v2) ===
+    Property(name="tier", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="cluster_id", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="member_ids", data_type=DataType.TEXT_ARRAY, skip_vectorization=True),
+    Property(name="member_count", data_type=DataType.INT),
+
+    # === Graph Linkage (NEW) ===
+    Property(name="graph_entity_ids", data_type=DataType.TEXT_ARRAY,
+             description="Neo4j node IDs extracted from this memory"),
+
+    # === Quality (NEW) ===
+    Property(name="quality_score", data_type=DataType.NUMBER),
+
+    # === Temporal (NEW) ===
+    Property(name="valid_at", data_type=DataType.DATE),
+    Property(name="invalid_at", data_type=DataType.DATE),
+
+    # === Tagging ===
+    Property(name="topic_tags", data_type=DataType.TEXT_ARRAY, skip_vectorization=True),
+    Property(name="entity_tags", data_type=DataType.TEXT_ARRAY, skip_vectorization=True),
+    Property(name="action_tags", data_type=DataType.TEXT_ARRAY, skip_vectorization=True),
+    Property(name="importance", data_type=DataType.TEXT, skip_vectorization=True),
+
+    # === Citations ===
+    Property(name="message_ts", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="thread_ts", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="user_name", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="slack_user_id", data_type=DataType.TEXT, skip_vectorization=True),
+
+    # === Files ===
+    Property(name="file_id", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="filename", data_type=DataType.TEXT, skip_vectorization=True),
+]
+# Named vectors: text_vector, image_vector, doc_vector (2048-dim Jina v4)
+```
+
+#### Retrieval Improvements (All 15 Weaknesses Fixed)
+
+```python
+class ImprovedSemanticRetriever:
+    """Weaviate retrieval with all v1 weaknesses fixed."""
+
+    async def retrieve(self, query: str, channel_id: str,
+                       query_understanding: QueryUnderstanding) -> list[dict]:
+
+        depth = query_understanding.semantic_depth  # "overview", "topic", "detail", "auto"
+
+        if depth == "overview":
+            # Tier 0 → optional expand to Tier 1
+            memories = await self._retrieve_summary(channel_id, query)
+            if self._should_expand(memories, "down"):
+                memories += await self._retrieve_clusters(channel_id, query)
+
+        elif depth == "topic":
+            # FIX 1.3 + 1.11: Two-stage topic-first retrieval
+            # Stage 1 (coarse): Find relevant topic clusters
+            clusters = await self._retrieve_clusters(
+                channel_id, query,
+                topic_filter=query_understanding.topics,
+                alpha=None,  # FIX 1.14: Adaptive alpha
+            )
+            # Stage 2 (fine): Search atomics WITHIN matched clusters
+            if clusters:
+                member_ids = self._collect_member_ids(clusters)
+                atomics = await self._retrieve_atomics_scoped(
+                    channel_id, query, member_ids,
+                    alpha=None,  # FIX 1.14
+                )
+                memories = clusters + atomics
+            else:
+                # No matching clusters → fall back to global atomic search
+                memories = await self._retrieve_atomics(channel_id, query)
+
+            # FIX 1.1: Bidirectional — expand UP if results are weak
+            if self._should_expand(memories, "up"):
+                summaries = await self._retrieve_summary(channel_id, query)
+                memories = self._merge_and_rerank(memories, summaries)
+
+        else:  # detail
+            # Direct atomic search, with optional upward expansion
+            memories = await self._retrieve_atomics(
+                channel_id, query, alpha=None,  # FIX 1.14
+            )
+            # FIX 1.1: Can expand UP to clusters for broader context
+            if self._should_expand(memories, "up"):
+                clusters = await self._retrieve_clusters(channel_id, query)
+                memories = self._merge_and_rerank(memories, clusters)
+
+        # FIX 1.4: Apply temporal decay to ranking
+        self._apply_temporal_decay(memories)
+
+        # FIX 1.13: Quality-weighted ranking boost
+        self._apply_quality_boost(memories)
+
+        # FIX 1.15: Semantic dedup across tiers
+        memories = self._semantic_dedup(memories)
+
+        return memories[:max_results]
+
+    def _should_expand(self, memories: list, direction: str) -> bool:
+        """FIX 1.2: Score-based expansion, not count-based."""
+        if not memories:
+            return True
+        scores = [m.get("score", 0) for m in memories]
+        return max(scores) < 0.6 or (sum(scores) / len(scores)) < 0.4
+
+    def _apply_temporal_decay(self, memories: list) -> None:
+        """FIX 1.4: Actually apply the existing temporal decay function."""
+        for m in memories:
+            days_ago = self._days_since(m.get("timestamp"))
+            m["score"] = self.temporal_decay.apply(m["score"], days_ago, m)
+        memories.sort(key=lambda m: m.get("score", 0), reverse=True)
+
+    def _apply_quality_boost(self, memories: list) -> None:
+        """FIX 1.13: Quality-weighted ranking — good memories score higher."""
+        for m in memories:
+            quality = m.get("quality_score", 0.5)
+            m["score"] = m["score"] * (0.7 + 0.3 * quality)
+        memories.sort(key=lambda m: m.get("score", 0), reverse=True)
+
+    def _semantic_dedup(self, memories: list, threshold=0.85) -> list:
+        """FIX 1.15: Remove near-duplicates across tiers."""
+        unique = []
+        for mem in memories:
+            is_dup = any(
+                self._jaccard_similarity(mem["memory"], e["memory"]) > threshold
+                for e in unique
+            )
+            if not is_dup:
+                unique.append(mem)
+        return unique
+```
+
+### 4.3 Temporal Decay Configuration
+
+```python
+class TemporalDecay:
+    """Ebbinghaus-based temporal decay with exemptions and reinforcement."""
+    DEFAULT_DECAY_RATE = 0.1
+
+    # Facts with these action_tags decay at half rate
+    SLOW_DECAY_TAGS = {"decision", "architecture", "policy", "deadline"}
+
+    # Facts with these importance levels are exempt from decay
+    EXEMPT_IMPORTANCE = {"high", "critical"}
+
+    def apply(self, score: float, days_ago: float, fact: dict) -> float:
+        """Apply temporal decay to a retrieval score."""
+        if fact.get("importance") in self.EXEMPT_IMPORTANCE:
+            return score  # No decay for high-importance facts
+
+        rate = self.DEFAULT_DECAY_RATE
+        # Half decay for architectural decisions
+        if any(tag in self.SLOW_DECAY_TAGS for tag in fact.get("action_tags", [])):
+            rate *= 0.5
+
+        # Citation reinforcement: cited facts decay slower
+        citation_count = fact.get("citation_count", 0)
+        if citation_count > 0:
+            rate = rate / (1 + 0.1 * citation_count)
+
+        decay = math.exp(-rate * (days_ago / 30))
+        return score * decay
+```
+
+**Decay behavior at `DECAY_RATE = 0.1`:**
+
+| Fact Age | Score Multiplier | Effect |
+|----------|-----------------|--------|
+| 1 day | 0.997 | Essentially no decay |
+| 7 days | 0.977 | Minimal (~2% reduction) |
+| 30 days | 0.905 | Mild (~10% reduction) |
+| 90 days | 0.741 | Moderate (~26% reduction) |
+| 180 days | 0.549 | Significant (~45% reduction) |
+| 365 days | 0.295 | Strong (~70% reduction) |
+
+**Exemptions:**
+- Facts tagged `importance: "high"` or `"critical"` → no decay
+- Facts tagged `action_tags: ["decision", "architecture", "policy"]` → half decay rate (0.05)
+- Facts cited 5+ times → effective rate drops to ~0.067
+
+**Configuration:**
+```python
+# In config.py Settings
+decay_rate: float = 0.1
+decay_slow_tags: list[str] = ["decision", "architecture", "policy", "deadline"]
+decay_exempt_importance: list[str] = ["high", "critical"]
+decay_reinforcement_factor: float = 0.1
+```
+
+#### Citation Tracking (FIX 1.5)
+
+The response generator logs which memories were actually cited, enabling retrieval quality measurement:
+
+```python
+class ResponseGenerator:
+    async def generate(self, query: str, memories: list, ...) -> Response:
+        response = await self._llm_generate(query, memories)
+        cited_ids = self._extract_cited_memory_ids(response)
+
+        # Log to MongoDB for quality analysis
+        await self.mongo.quality_logs.insert_one({
+            "query": query,
+            "route": "semantic" | "graph" | "both",
+            "retrieved_count": len(memories),
+            "retrieved_ids": [m["id"] for m in memories],
+            "cited_ids": cited_ids,
+            "precision": len(cited_ids) / max(len(memories), 1),
+            "timestamp": datetime.utcnow(),
+        })
+        return response
+```
+
+This enables Precision@K tracking, identifying underperforming queries, and future active learning.
+
+#### Consolidation Service (FIXED)
+
+```python
+class ConsolidationService:
+    """Fixed consolidation that ACTUALLY links clusters to atomics."""
+
+    async def _link_memories_to_cluster(self, memories, cluster_id):
+        """v1: no-op. v2: ACTUALLY writes cluster_id to each atomic memory."""
+        collection = self.weaviate.collections.get(COLLECTION_NAME)
+        for memory in memories:
+            if memory.get("id"):
+                collection.data.update(
+                    uuid=memory["id"],
+                    properties={"cluster_id": cluster_id}
+                )
+        logger.info(f"Linked {len(memories)} memories to cluster {cluster_id}")
+
+    async def _consolidate_to_clusters(self, channel_id):
+        """Fixed: uses content hash to detect existing clusters and prevent duplicates."""
+        unclustered = await self._get_unclustered_memories(channel_id)
+        topic_groups = self._group_by_topic(unclustered)
+
+        for topic, memories in topic_groups.items():
+            if len(memories) < self.cluster_threshold:
+                continue
+
+            # Check if cluster for this topic already exists
+            existing = await self._find_existing_cluster(channel_id, topic)
+            if existing:
+                # Update existing cluster summary + add new members
+                await self._update_cluster(existing, memories)
+            else:
+                # Create new cluster
+                cluster_id = await self._create_topic_cluster(channel_id, topic, memories)
+
+            # THIS ACTUALLY WORKS NOW
+            await self._link_memories_to_cluster(memories, cluster_id or existing["id"])
+
+    async def _find_existing_cluster(self, channel_id, topic):
+        """Prevent duplicate clusters by checking for existing topic cluster."""
+        results = await hybrid_search(
+            channel_id=channel_id, query=topic,
+            tier_filter="tier1_cluster", topic_filter=[topic],
+            limit=1, alpha=0.0,  # Pure keyword match on topic
+        )
+        return results[0] if results else None
+```
+
+### 3.3 Graph Memory: Neo4j (Flexible)
+
+The graph memory captures **relationship meaning** from conversations — things that semantic search fundamentally cannot handle.
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│                GRAPH MEMORY: Neo4j (Flexible)                        │
+│                                                                      │
+│  PURPOSE: Capture WHO did WHAT, WHEN, and HOW things RELATE         │
+│                                                                      │
+│  ┌────────────────────────────────────────────────────────────────┐ │
+│  │  GUIDED-FLEXIBLE ENTITY SCHEMA                                 │ │
+│  │                                                                │ │
+│  │  All nodes share a base:                                       │ │
+│  │  ┌──────────────────────────────────┐                         │ │
+│  │  │  name:        str    (required)  │                         │ │
+│  │  │  entity_type: str    (required)  │                         │ │
+│  │  │  description: str    (optional)  │                         │ │
+│  │  │  channel:     str               │                         │ │
+│  │  │  platform:    str               │                         │ │
+│  │  │  properties:  dict   (flexible) │                         │ │
+│  │  │  created_at:  datetime          │                         │ │
+│  │  │  updated_at:  datetime          │                         │ │
+│  │  └──────────────────────────────────┘                         │ │
+│  │                                                                │ │
+│  │  Core types (LLM prefers these):                              │ │
+│  │  Person, Decision, Project, Technology                        │ │
+│  │                                                                │ │
+│  │  Extension types (LLM creates as needed):                     │ │
+│  │  Team, Meeting, Artifact, Constraint, Budget, Deadline, ...   │ │
+│  │                                                                │ │
+│  │  Event node (episodic anchor):                                │ │
+│  │  ┌──────────────────────────────────┐                         │ │
+│  │  │  weaviate_id: str  → links to   │                         │ │
+│  │  │               Weaviate atomic    │                         │ │
+│  │  │  timestamp:   datetime           │                         │ │
+│  │  │  channel:     str               │                         │ │
+│  │  └──────────────────────────────────┘                         │ │
+│  └────────────────────────────────────────────────────────────────┘ │
+│                                                                      │
+│  ┌────────────────────────────────────────────────────────────────┐ │
+│  │  FLEXIBLE RELATIONSHIPS                                        │ │
+│  │                                                                │ │
+│  │  NOT a fixed list — LLM extracts whatever relationship        │ │
+│  │  best captures the meaning:                                    │ │
+│  │                                                                │ │
+│  │  Common patterns:                                              │ │
+│  │  Person  ──DECIDED──▶       Decision                          │ │
+│  │  Person  ──WORKS_ON──▶      Project                           │ │
+│  │  Person  ──MEMBER_OF──▶     Team                              │ │
+│  │  Decision──AFFECTS──▶       Project                           │ │
+│  │  Decision──SUPERSEDES──▶    Decision  (temporal evolution)    │ │
+│  │  Decision──BLOCKED_BY──▶    Constraint                        │ │
+│  │  Decision──USES──▶          Technology                        │ │
+│  │  Project ──DEPENDS_ON──▶    Project                           │ │
+│  │  Meeting ──PRODUCED──▶      Decision                          │ │
+│  │  Any     ──MENTIONED_IN──▶  Event     (episodic link)        │ │
+│  │  Any     ──ALIAS_OF──▶     Any       (entity dedup)          │ │
+│  │                                                                │ │
+│  │  Bidirectional edges (auto-created during ingestion):         │ │
+│  │  DECIDED ↔ DECIDED_BY, BLOCKED_BY ↔ BLOCKS,                  │ │
+│  │  WORKS_ON ↔ HAS_MEMBER, OWNS ↔ OWNED_BY                      │ │
+│  │                                                                │ │
+│  │  LLM can create ANY relationship type. The graph adapts       │ │
+│  │  to whatever patterns exist in the organization's             │ │
+│  │  conversations.                                                │ │
+│  │                                                                │ │
+│  │  Temporal properties on ALL relationships:                    │ │
+│  │  • valid_from:  datetime                                      │ │
+│  │  • valid_until: datetime (null = currently valid)             │ │
+│  │  • created_at:  datetime (bi-temporal tracking)               │ │
+│  │  • confidence:  float                                         │ │
+│  └────────────────────────────────────────────────────────────────┘ │
+│                                                                      │
+│  EPISODIC LINKING (graph ↔ Weaviate):                               │
+│  • Every graph entity connects to Event nodes                       │
+│  • Event.weaviate_id → points to atomic fact in Weaviate           │
+│  • Enables: graph traversal → find entities → follow episodic      │
+│    edges → retrieve original fact text + Slack citations            │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+#### Neo4j Implementation
+
+```python
+class Neo4jStore:
+    """Flexible graph memory — any entity type, any relationship type.
+
+    Entity scoping: Global entities (Person, Technology, Project, Team) are
+    MERGED by name only — the same entity spans all channels. Channel-scoped
+    entities (Decision, Meeting, Artifact) are MERGED by name + channel.
+    """
+
+    QUERY_TIMEOUT_MS = 5000  # Hard limit on all graph queries
+
+    # Cross-channel scoping rules
+    ENTITY_SCOPING = {
+        "Person":     "global",    # Alice is Alice everywhere
+        "Technology": "global",    # React is React everywhere
+        "Project":    "global",    # Project names span channels
+        "Team":       "global",    # Teams span channels
+        "Decision":   "channel",   # Decisions are channel-contextual
+        "Meeting":    "channel",   # Meetings are channel-contextual
+        "Artifact":   "channel",   # Docs are channel-contextual
+        # Extension types default to "channel"
+    }
+
+    async def upsert_entity(self, entity: dict) -> str:
+        """Create/update entity with scope-aware MERGE."""
+        entity_type = entity["type"]
+        scope = self.ENTITY_SCOPING.get(entity_type, "channel")
+
+        if scope == "global":
+            # Global: MERGE on name only, track channels as array
+            cypher = f"""
+                MERGE (n:{entity_type} {{name: $name}})
+                ON CREATE SET n += $props, n.created_at = datetime(),
+                              n.channels = [$channel],
+                              n.quality_score = $quality_score
+                ON MATCH SET n += $props, n.updated_at = datetime(),
+                             n.channels = CASE
+                               WHEN NOT $channel IN n.channels
+                               THEN n.channels + $channel
+                               ELSE n.channels END,
+                             n.quality_score = CASE
+                               WHEN $quality_score > n.quality_score
+                               THEN $quality_score ELSE n.quality_score END
+                RETURN id(n) as node_id
+            """
+        else:
+            # Channel-scoped: MERGE on name + channel
+            cypher = f"""
+                MERGE (n:{entity_type} {{name: $name, channel: $channel}})
+                ON CREATE SET n += $props, n.created_at = datetime(),
+                              n.quality_score = $quality_score
+                ON MATCH SET n += $props, n.updated_at = datetime(),
+                             n.quality_score = CASE
+                               WHEN $quality_score > n.quality_score
+                               THEN $quality_score ELSE n.quality_score END
+                RETURN id(n) as node_id
+            """
+        return await self.execute(cypher,
+            name=entity["name"], channel=entity.get("channel"),
+            quality_score=entity.get("quality_score", 0.5),
+            props={k: v for k, v in entity.get("properties", {}).items()
+                   if v is not None},
+        )
+
+    async def upsert_relationship(self, rel: dict) -> None:
+        """Create relationship with scope-aware matching + provenance."""
+        rel_type = rel["type"]
+        source_scope = self.ENTITY_SCOPING.get(rel.get("source_type"), "channel")
+        target_scope = self.ENTITY_SCOPING.get(rel.get("target_type"), "channel")
+
+        source_match = "{name: $source}" if source_scope == "global" \
+                       else "{name: $source, channel: $channel}"
+        target_match = "{name: $target}" if target_scope == "global" \
+                       else "{name: $target, channel: $channel}"
+
+        cypher = f"""
+            MATCH (s {source_match})
+            MATCH (t {target_match})
+            MERGE (s)-[r:{rel_type}]->(t)
+            SET r.context = $context,
+                r.source_channel = $channel,
+                r.valid_from = coalesce($valid_from, datetime()),
+                r.created_at = datetime(),
+                r.confidence = $confidence,
+                r.evidence = $evidence,
+                r.source_message_id = $source_message_id,
+                r.source_fact_id = $source_fact_id,
+                r.extracted_at = datetime()
+        """
+        await self.execute(cypher, **rel)
+
+    async def create_episodic_link(self, entity_name: str, weaviate_id: str,
+                                    channel: str, timestamp: float) -> None:
+        """Link a graph entity to its source fact in Weaviate."""
+        # Try global match first, then channel-scoped
+        await self.execute("""
+            MATCH (n)
+            WHERE n.name = $name
+              AND (n.channel = $channel OR $channel IN n.channels)
+            MERGE (e:Event {weaviate_id: $wid})
+            ON CREATE SET e.channel = $channel, e.timestamp = $ts
+            MERGE (n)-[:MENTIONED_IN]->(e)
+        """, name=entity_name, channel=channel, wid=weaviate_id, ts=timestamp)
+
+    async def traverse(self, start_entities: list[str], channel: str = None,
+                       max_hops: int = 2) -> list[dict]:
+        """Bounded, directed traversal with APOC path expansion."""
+        return await self.execute_with_timeout("""
+            MATCH (start)
+            WHERE start.name IN $entities
+              AND ($channel IS NULL
+                   OR start.channel = $channel
+                   OR $channel IN start.channels)
+            CALL apoc.path.expandConfig(start, {
+                minLevel: 1,
+                maxLevel: $max_hops,
+                uniqueness: 'NODE_GLOBAL',
+                limit: 50,
+                relationshipFilter: '>'
+            }) YIELD path
+            WHERE all(r IN relationships(path) WHERE
+                r.valid_until IS NULL OR r.valid_until > datetime())
+            RETURN path
+        """, entities=start_entities, channel=channel, max_hops=max_hops)
+
+    async def temporal_chain(self, entity_name: str, channel: str = None) -> list[dict]:
+        """Bounded SUPERSEDES chain (max 5 hops, distinct per level)."""
+        return await self.execute_with_timeout("""
+            MATCH (d:Decision)
+            WHERE d.name CONTAINS $name
+              AND ($channel IS NULL OR d.channel = $channel
+                   OR $channel IN d.channels)
+            MATCH path = (d)-[:SUPERSEDES*0..5]->(older:Decision)
+            WITH DISTINCT older, path
+            RETURN path ORDER BY older.valid_from DESC
+            LIMIT 20
+        """, name=entity_name, channel=channel)
+
+    async def comprehensive_traverse(self, start_entities: list[str],
+                                      channel: str = None,
+                                      max_hops: int = 3,
+                                      max_nodes: int = 200) -> dict:
+        """Collect-all traversal: gather ALL relationships within N hops,
+        then let the LLM analyze relevance. Inspired by Forensic Eyes'
+        Phase 16 pattern — avoids brittleness from pre-filtering edge types.
+
+        Use for complex graph queries where relationship types are diverse
+        and pre-filtering risks missing cross-cutting context.
+
+        Returns structured subgraph JSON for LLM analysis.
+        """
+        return await self.execute_with_timeout("""
+            MATCH (start)
+            WHERE start.name IN $entities
+              AND ($channel IS NULL
+                   OR start.channel = $channel
+                   OR $channel IN start.channels)
+            CALL apoc.path.expandConfig(start, {
+                minLevel: 1,
+                maxLevel: $max_hops,
+                uniqueness: 'NODE_GLOBAL',
+                limit: $max_nodes
+            }) YIELD path
+            WITH path, relationships(path) AS rels, nodes(path) AS ns
+            WHERE all(r IN rels WHERE
+                r.valid_until IS NULL OR r.valid_until > datetime())
+            UNWIND rels AS r
+            WITH DISTINCT r, startNode(r) AS src, endNode(r) AS tgt,
+                 type(r) AS rel_type
+            RETURN src.name AS source, src.entity_type AS source_type,
+                   tgt.name AS target, tgt.entity_type AS target_type,
+                   rel_type, r.context AS context,
+                   r.confidence AS confidence,
+                   r.evidence AS evidence,
+                   r.source_message_id AS source_message_id
+            ORDER BY r.confidence DESC
+        """, entities=start_entities, channel=channel,
+             max_hops=max_hops, max_nodes=max_nodes)
+
+    async def get_episodic_weaviate_ids(self, node_ids: list[int]) -> list[str]:
+        """Get Weaviate IDs for enriching graph results with full text."""
+        return await self.execute("""
+            MATCH (n)-[:MENTIONED_IN]->(e:Event)
+            WHERE id(n) IN $ids
+            RETURN e.weaviate_id
+        """, ids=node_ids)
+
+    async def execute_with_timeout(self, cypher: str, **params) -> list[dict]:
+        """Execute with transaction timeout — returns [] on timeout."""
+        try:
+            async with self.driver.session() as session:
+                result = await session.run(cypher, **params,
+                                            timeout=self.QUERY_TIMEOUT_MS)
+                return await result.data()
+        except TransientError:
+            logger.warning(f"Graph traversal timed out: {cypher[:80]}...")
+            return []  # Retriever falls back to semantic-only
+```
+
+### 3.4 How the Two Memories Connect
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│                  MEMORY INTERCONNECTION                               │
+│                                                                      │
+│  INGESTION (writes to BOTH):                                        │
+│                                                                      │
+│  Message: "Alice decided to use RS256 for JWT — blocked by          │
+│            Carol's security review"                                  │
+│       │                                                              │
+│       ├──▶ WEAVIATE: Atomic fact stored with embedding              │
+│       │    memory: "Alice decided to use RS256 for JWT,             │
+│       │             blocked by Carol's security review"             │
+│       │    id: uuid-abc-123                                          │
+│       │    graph_entity_ids: [neo4j-1, neo4j-2, neo4j-3]           │
+│       │                                                              │
+│       └──▶ NEO4J: Entities + relationships extracted                │
+│            Person(Alice) ──DECIDED──▶ Decision(Use RS256)           │
+│            Decision(Use RS256) ──USES──▶ Technology(JWT)            │
+│            Decision(Use RS256) ──BLOCKED_BY──▶ Person(Carol)        │
+│            All entities ──MENTIONED_IN──▶ Event(weaviate_id:        │
+│                                                uuid-abc-123)        │
+│                                                                      │
+│  QUERY (reads from ONE or BOTH):                                    │
+│                                                                      │
+│  "What was discussed about JWT?"                                    │
+│    → Router: SEMANTIC → Weaviate hybrid search → fast, cheap        │
+│                                                                      │
+│  "Who decided to use RS256?"                                        │
+│    → Router: GRAPH → Neo4j traversal:                               │
+│      Decision(RS256) ←DECIDED── Person(Alice)                       │
+│      → Follow episodic edge → Weaviate(uuid-abc-123) for full text │
+│                                                                      │
+│  "Tell me about the JWT migration"                                  │
+│    → Router: BOTH (ambiguous) → run in parallel:                    │
+│      Weaviate: semantic facts about JWT                             │
+│      Neo4j: entities related to JWT (people, decisions, blockers)   │
+│      → Merge, dedup, rank → comprehensive answer                   │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+---
+
+## 4. Smart Query Router
+
+### 4.0 Query Decomposition (Preserved from v1)
+
+Complex questions are decomposed into focused parallel sub-queries before routing. This was a key v1 feature that the v2 router must preserve and enhance.
+
+```python
+class QueryDecomposer:
+    """Decompose complex questions into parallel sub-queries.
+
+    Example:
+    "What auth method did we decide on and how does it compare to best practices?"
+    → internal_queries:
+        - {"query": "authentication decision JWT", "focus": "decision"}
+        - {"query": "OAuth implementation alice", "focus": "implementation"}
+    → external_queries:
+        - {"query": "JWT vs OAuth best practices 2025", "focus": "comparison"}
+    """
+
+    async def decompose(self, question: str) -> QueryPlan:
+        """Break down a question into internal + external sub-queries."""
+        # Fast path: simple questions → single internal query, no decomposition
+        if self._is_simple(question):
+            return QueryPlan(
+                internal_queries=[{"query": question, "focus": "direct"}],
+                external_queries=[],
+            )
+
+        # Complex questions → LLM decomposition (flash-lite)
+        plan = await self._llm_decompose(question)
+        return plan  # 2-4 internal + 0-2 external queries
+
+DECOMPOSITION_PROMPT = """
+You are a query decomposition specialist. Break down this question into
+focused sub-queries that can be executed in parallel.
+
+OUTPUT JSON:
+{
+    "internal_queries": [
+        {"query": "specific search terms", "focus": "what this targets"}
+    ],
+    "external_queries": [
+        {"query": "web search terms", "focus": "what to learn from web"}
+    ]
+}
+
+RULES:
+1. Generate 2-4 focused internal queries for different aspects
+2. Generate 0-2 external queries ONLY if best practices / documentation
+   comparison is needed
+3. Internal queries should be keyword-focused (not full sentences)
+4. If the question is simple, a single internal query suffices
+"""
+```
+
+The decomposed sub-queries are then each routed independently through the Query Understanding step below, enabling parallel execution across both memory systems AND external search.
+
+### 4.1 LLM-Powered Query Understanding
+
+Replaces the brittle regex classifier (weakness 1.10) with an LLM call (~$0.001/query using flash-lite).
+
+```python
+QUERY_UNDERSTANDING_PROMPT = """
+Classify this query for a team communication knowledge base.
+
+Query: {query}
+Channel: {channel_name}
+
+Determine:
+1. route: One of:
+   - "semantic": Looking for facts, discussions, topics, documents
+     Examples: "What was discussed about auth?", "Find deployment docs", "Overview"
+   - "graph": Looking for entity relationships, people, decisions, temporal changes
+     Examples: "Who decided X?", "What is Alice working on?", "What blocks project Y?"
+   - "both": Could benefit from both fact retrieval AND relationship context
+     Examples: "Tell me about the JWT migration", "What happened with the auth project?"
+2. semantic_depth: "overview" | "topic" | "detail" (for Weaviate tier routing)
+3. entities: Named entities mentioned (people, projects, technologies)
+4. topics: Topic areas referenced
+5. temporal_scope: "recent" | "any" | "historical"
+6. confidence: 0.0-1.0
+
+Output JSON.
+"""
+```
+
+### 4.2 Routing Strategy: Cost-Optimized
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│                       SMART QUERY ROUTER                             │
+│                                                                      │
+│  User Query                                                          │
+│      │                                                               │
+│      ▼                                                               │
+│  ┌──────────────────────────────────────┐                           │
+│  │  QUERY UNDERSTANDING (LLM flash-lite)│  ~$0.001/query            │
+│  │                                      │                           │
+│  │  route: semantic | graph | both      │                           │
+│  │  semantic_depth: overview|topic|detail│                           │
+│  │  entities: ["Alice", "JWT"]          │                           │
+│  │  topics: ["authentication"]          │                           │
+│  │  confidence: 0.0-1.0                 │                           │
+│  └──────┬──────────┬──────────┬─────────┘                           │
+│         │          │          │                                      │
+│    route=semantic  │     route=both                                  │
+│    conf > 0.7      │     OR conf ≤ 0.7                              │
+│         │     route=graph    │                                      │
+│         │     conf > 0.7     │                                      │
+│         ▼          ▼         ▼                                      │
+│  ┌──────────┐ ┌────────┐ ┌────────────────┐                       │
+│  │ SEMANTIC │ │ GRAPH  │ │ BOTH PARALLEL  │                       │
+│  │ ONLY     │ │ ONLY   │ │                │                       │
+│  │          │ │        │ │ Semantic  Graph│                       │
+│  │ Weaviate │ │ Neo4j  │ │ search + trav. │                       │
+│  │ 3-tier   │ │ + Weav.│ │ in parallel   │                       │
+│  │ retrieval│ │ enrich │ │                │                       │
+│  │          │ │        │ │ Merge results  │                       │
+│  │ $0.001   │ │ $0.005 │ │ $0.006        │                       │
+│  │ < 200ms  │ │ ~500ms │ │ ~500ms        │                       │
+│  └────┬─────┘ └───┬────┘ └───────┬────────┘                       │
+│       │           │              │                                  │
+│       │    ┌──────┘              │                                  │
+│       │    │ Fallback: if graph  │                                  │
+│       │    │ results insufficient│                                  │
+│       │    │ → also run semantic │                                  │
+│       │    │                     │                                  │
+│       └────┴─────────┬───────────┘                                  │
+│                      ▼                                               │
+│  ┌──────────────────────────────────────┐                           │
+│  │  RESULT MERGER + RESPONSE GENERATOR  │                           │
+│  │                                      │                           │
+│  │  1. Deduplicate by weaviate_id      │                           │
+│  │  2. Boost cross-validated results   │                           │
+│  │  3. Apply temporal decay            │                           │
+│  │  4. Quality-score weighted ranking  │                           │
+│  │  5. Generate grounded response      │                           │
+│  │     with citations (Gemini Flash)   │                           │
+│  └──────────────────────────────────────┘                           │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+#### Routing Decision Table
+
+| Query Pattern | Route | Why | Cost | Latency |
+|---|---|---|---|---|
+| "What was discussed about auth?" | Semantic | Factual lookup → Weaviate excels | $0.001 | < 200ms |
+| "Show me the overview" | Semantic (Tier 0) | Cached summary → FREE | $0 | < 50ms |
+| "Tell me about deployment" | Semantic (Tier 1) | Topic cluster → FREE | $0 | < 50ms |
+| "Find the architecture diagram" | Semantic (cross-modal) | Image search → Weaviate only | $0.001 | < 200ms |
+| "Who decided to use JWT?" | Graph | Person→Decision traversal | $0.005 | ~500ms |
+| "What is Alice working on?" | Graph | Person→Project traversal | $0.005 | ~500ms |
+| "How did the auth approach evolve?" | Graph (temporal) | Decision→SUPERSEDES chain | $0.005 | ~500ms |
+| "What blocks the migration?" | Graph | Project→BLOCKED_BY traversal | $0.005 | ~500ms |
+| "Tell me about the JWT migration" | Both (parallel) | Needs facts AND relationships | $0.006 | ~500ms |
+| "What happened with auth last week?" | Both (parallel) | Temporal + factual | $0.006 | ~500ms |
+
+### 4.3 External Search (Tavily — Preserved from v1)
+
+The v1 external search via Tavily is preserved in v2. It handles factual queries that require web knowledge (best practices, documentation, industry comparisons) — things NOT in the team's Slack history.
+
+```python
+class ExternalSearchService:
+    """Web search via Tavily API for grounding with external knowledge.
+
+    Why Tavily:
+    - Cost-effective: 1,000 free credits/month vs $35/1K (Google)
+    - Multiple tools: search, extract, crawl
+    - No model restrictions: works with any LLM
+    - Designed for AI/RAG: optimized for LLM consumption
+    """
+
+    async def search(self, query: str, search_depth: str = "basic",
+                     max_results: int = 5, include_answer: bool = True,
+                     include_domains: list[str] | None = None,
+                     exclude_domains: list[str] | None = None,
+                     ) -> ExternalSearchResponse:
+        """Search the web. Returns results + optional AI-generated answer."""
+        ...
+
+    async def search_documentation(self, query: str,
+                                    technology: str | None = None,
+                                    max_results: int = 5,
+                                    ) -> ExternalSearchResponse:
+        """Optimized for finding API docs, tutorials, official docs."""
+        ...
+
+    async def extract_content(self, urls: list[str]) -> dict[str, str]:
+        """Extract clean content from specific URLs."""
+        ...
+```
+
+**Integration with Query Decomposition:**
+
+When the `QueryDecomposer` produces `external_queries`, they are executed via Tavily in parallel with internal queries:
+
+```
+Complex Query → QueryDecomposer
+  ├─ internal_queries → [routed to Semantic/Graph in parallel]
+  └─ external_queries → [executed via Tavily in parallel]
+      → Results merged into response context
+```
+
+**Routing decision:** The router classifies `external` queries via the decomposer, not via the query understanding LLM. Only queries that need web knowledge (comparisons, docs, best practices) generate external sub-queries.
+
+| Config | Default |
+|--------|---------|
+| `TAVILY_API_KEY` | Required for external search |
+| `ENABLE_EXTERNAL_SEARCH` | `true` |
+| `TAVILY_SEARCH_DEPTH` | `"basic"` (1 credit) or `"advanced"` (2 credits) |
+| `TAVILY_MAX_RESULTS` | `5` |
+
+#### Graph Retrieval with Weaviate Enrichment
+
+When the router selects Graph, Neo4j finds the relationships, then follows **episodic edges** back to Weaviate for the actual source text and citations:
+
+```python
+class GraphRetriever:
+    """System-2: Neo4j traversal + Weaviate enrichment."""
+
+    async def retrieve(self, query: str, channel_id: str,
+                       understanding: QueryUnderstanding) -> list[dict]:
+
+        # Step 1: Resolve entities from query to Neo4j nodes
+        matched = await self.neo4j.fuzzy_match_entities(
+            understanding.entities, channel_id
+        )
+        if not matched:
+            return []  # No entities found → fallback to semantic
+
+        # Step 2: Graph traversal (1-2 hops)
+        if understanding.temporal_scope == "historical":
+            paths = await self.neo4j.temporal_chain(matched[0], channel_id)
+        else:
+            paths = await self.neo4j.traverse(
+                [m.name for m in matched], channel_id, max_hops=2
+            )
+
+        # Step 3: Follow episodic edges → get Weaviate memory IDs
+        node_ids = self._extract_node_ids(paths)
+        weaviate_ids = await self.neo4j.get_episodic_weaviate_ids(node_ids)
+
+        # Step 4: Fetch full memories from Weaviate (text + citations)
+        memories = await self.weaviate.fetch_by_ids(weaviate_ids)
+
+        # Step 5: Combine graph structure + memory content
+        return self._merge_graph_and_memories(paths, memories)
+```
+
+---
+
+## 5. Ingestion Pipeline
+
+### 5.1 Multi-Platform Adapters
+
+**Chat SDK Evaluation**: The [Vercel Chat SDK](https://chat-sdk.dev/) is TypeScript-only and designed for bot webhooks — it **cannot fetch message history**. We use Python adapters for batch ingestion, with optional Chat SDK for real-time (Phase 2).
+
+```python
+@dataclass
+class NormalizedMessage:
+    """Unified message model across all platforms."""
+    content: str
+    author: AuthorInfo
+    platform: Platform           # slack | teams | discord
+    channel_id: str
+    channel_name: str
+    message_id: str
+    timestamp: datetime
+    thread_id: str | None = None
+    attachments: list[Attachment] = field(default_factory=list)
+    reactions: list[str] = field(default_factory=list)
+    reply_count: int = 0
+    raw_metadata: dict = field(default_factory=dict)
+
+class BaseAdapter(ABC):
+    @abstractmethod
+    async def fetch_history(self, channel_id, since=None, limit=500) -> list[NormalizedMessage]: ...
+
+class SlackAdapter(BaseAdapter):    # slack-sdk (Python)
+class TeamsAdapter(BaseAdapter):    # Microsoft Graph API
+class DiscordAdapter(BaseAdapter):  # discord.py
+```
+
+### 5.2 Pipeline: Writes to Both Memory Systems
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│                      INGESTION PIPELINE                              │
+│                                                                      │
+│  NormalizedMessage (from any adapter)                                │
+│         │                                                            │
+│         ▼                                                            │
+│  STAGE 1: PREPROCESS                                                │
+│  • Modality detection, attachment parsing, thread assembly           │
+│         │                                                            │
+│         ▼                                                            │
+│  STAGE 2: EXTRACT + QUALITY GATE                                    │
+│  • LLM fact extraction (Gemini Flash Lite)                          │
+│  • Quality scoring → REJECT < 0.5, max 2 facts/message             │
+│         │                                                            │
+│         ▼                                                            │
+│  STAGE 3: ENTITY EXTRACTION + QUALITY GATE (for Graph Memory)        │
+│  • LLM extracts entities (flexible types) + relationships           │
+│  • EntityQualityGate: reject confidence < 0.6, filter hypotheticals │
+│  • Alias resolution via EntityRegistry (fuzzy dedup)                │
+│  • Temporal validity assignment                                      │
+│         │                                                            │
+│         ▼                                                            │
+│  STAGE 4: CLASSIFY + TAG                                            │
+│  • Topic, entity, action tagging + importance scoring               │
+│         │                                                            │
+│         ▼                                                            │
+│  STAGE 5: EMBED (Jina v4, 2048-dim, multimodal)                    │
+│         │                                                            │
+│         ▼                                                            │
+│  STAGE 6: CROSS-BATCH VALIDATION                                     │
+│  • Resolve entities across message batches to canonical forms        │
+│  • Validate relationship consistency (e.g., conflicting roles)       │
+│  • Merge alias variants discovered across chunks                     │
+│  • Create bidirectional edges for key relationship types             │
+│         │                                                            │
+│         ▼                                                            │
+│  STAGE 7: NOVELTY CHECK + PERSIST (Outbox Pattern)                   │
+│  │                                                                   │
+│  ├──▶ MONGODB: Write intent document (atomic transaction)            │
+│  │    {fact, entities, embeddings, status: {weaviate: pending, ...}} │
+│  │                                                                   │
+│  ├──▶ WEAVIATE: Upsert atomic fact (idempotent via deterministic UUID)│
+│  │    Mark intent.status.weaviate = "done"                           │
+│  │                                                                   │
+│  ├──▶ NEO4J: MERGE entities + relationships (idempotent via MERGE)   │
+│  │    Mark intent.status.neo4j = "done" (skip if Neo4j unavailable)  │
+│  │                                                                   │
+│  └──▶ MONGODB: Update sync state, mark intent complete               │
+│       Background reconciler retries "pending"/"failed" every 15min   │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+### 5.3 Entity Extraction Prompt (Guided-Flexible)
+
+```python
+ENTITY_EXTRACTION_PROMPT = """
+Extract entities and relationships from this message.
+
+CORE ENTITY TYPES (prefer these when applicable):
+- Person: individual (fields: name, role, team)
+- Decision: concrete choice (fields: summary, status, rationale, date)
+- Project: initiative (fields: name, status, description)
+- Technology: tool/framework (fields: name, category)
+
+EXTENSION TYPES (use when content doesn't fit core types):
+- Create any type: Team, Meeting, Artifact, Constraint, Deadline, Budget, ...
+
+RELATIONSHIPS:
+- Use descriptive verb phrases: DECIDED, WORKS_ON, BLOCKED_BY, OWNS, ...
+- NOT limited to a fixed set — use whatever captures the meaning
+- Include temporal context when available
+
+EXISTING ENTITIES (reuse names to avoid duplicates):
+{existing_entities}
+
+OUTPUT JSON:
+{
+  "entities": [{"type": "...", "name": "...", "properties": {...},
+                "aliases": ["alternative name 1", "@slack_handle", ...]}],
+  "relationships": [{"source": "...", "type": "...", "target": "...",
+                      "context": "...", "temporal": "current|supersedes:<old>",
+                      "evidence": "exact quote or paraphrase from message",
+                      "confidence": 0.0-1.0}],
+  "confidence": 0.0-1.0
+}
+
+ALIAS RULES:
+- Map all name variants to a canonical form: "Alice", "@alice", "alice.chen" → "Alice Chen"
+- Include Slack handles, nicknames, abbreviated names as aliases
+- For projects: "Atlas", "beever-atlas", "the atlas project" → canonical name
+"""
+```
+
+### 5.4 Quality Gate
+
+```python
+class MemoryQualityGate:
+    MIN_LENGTH = 40
+    MAX_FACTS_PER_MESSAGE = 2
+    MIN_QUALITY_SCORE = 0.5
+    VAGUE_PATTERNS = ["the user", "the process", "this was", "it was",
+                      "the output", "as mentioned", "was adjusted"]
+
+    def score_fact(self, fact: str) -> float:
+        score = 1.0
+        if len(fact) < self.MIN_LENGTH: score -= 0.3
+        for p in self.VAGUE_PATTERNS:
+            if p in fact.lower(): score -= 0.2
+        if any(w[0].isupper() for w in fact.split()[1:] if len(w) > 1): score += 0.1
+        if fact.startswith(("It ", "This ", "That ")): score -= 0.15
+        return max(0.0, min(1.0, score))
+```
+
+### 5.5 Entity Quality Gate
+
+```python
+class EntityQualityGate:
+    """Quality gate for entity extraction — prevents graph pollution.
+
+    Inspired by Forensic Eyes' per-category confidence thresholds:
+    higher bars for high-stakes relationships, lower for casual mentions.
+    """
+    MIN_ENTITY_CONFIDENCE = 0.6
+
+    # Per-relationship-type confidence thresholds
+    # Higher bar for relationships with greater semantic commitment
+    RELATIONSHIP_CONFIDENCE = {
+        "DECIDED":      0.7,   # Decisions must be clearly stated
+        "OWNS":         0.6,   # Ownership/responsibility requires clarity
+        "LEADS":        0.6,   # Leadership roles require clarity
+        "BLOCKED_BY":   0.6,   # Blockers must be explicit
+        "SUPERSEDES":   0.7,   # Temporal evolution must be unambiguous
+        "WORKS_ON":     0.4,   # Work associations are common and casual
+        "MENTIONS":     0.3,   # Low bar — just needs to be real
+        "MEMBER_OF":    0.4,   # Team membership is usually clear
+        "USES":         0.4,   # Technology usage is common
+        "DEPENDS_ON":   0.5,   # Dependencies should be stated
+        "_DEFAULT":     0.5,   # Fallback for LLM-created relationship types
+    }
+
+    HYPOTHETICAL_PATTERNS = [
+        "maybe", "might", "could", "should we", "what if",
+        "let's just", "hypothetically", "joking", "kidding",
+    ]
+
+    def filter_entities(self, extraction_result: dict,
+                         source_message: str) -> dict:
+        """Reject low-confidence entities and hypothetical references."""
+        if extraction_result.get("confidence", 0) < self.MIN_ENTITY_CONFIDENCE:
+            return {"entities": [], "relationships": []}
+
+        # Raise threshold for hypothetical/sarcastic messages
+        msg_lower = source_message.lower()
+        threshold = 0.8 if any(p in msg_lower for p in self.HYPOTHETICAL_PATTERNS) \
+                       else self.MIN_ENTITY_CONFIDENCE
+
+        valid_entities = [
+            e for e in extraction_result.get("entities", [])
+            if self._score_entity(e) >= threshold
+        ]
+
+        # Only keep relationships where both endpoints survived filtering
+        valid_names = {e["name"] for e in valid_entities}
+        valid_rels = [
+            r for r in extraction_result.get("relationships", [])
+            if r["source"] in valid_names and r["target"] in valid_names
+               and r.get("confidence", 0.5) >= self.RELATIONSHIP_CONFIDENCE.get(
+                   r.get("type", ""), self.RELATIONSHIP_CONFIDENCE["_DEFAULT"])
+        ]
+
+        return {"entities": valid_entities, "relationships": valid_rels}
+
+    def _score_entity(self, entity: dict) -> float:
+        score = entity.get("confidence", 0.5)
+        if entity.get("properties", {}).get("role"): score += 0.1
+        if entity.get("properties", {}).get("date"): score += 0.1
+        if entity["name"].lower() in ("it", "this", "that", "someone"): score -= 0.5
+        return max(0.0, min(1.0, score))
+```
+
+### 5.6 Contradiction Detection
+
+Contradictory facts are detected and resolved via SUPERSEDES chains. This runs as a **background job every 15 minutes** (not blocking ingestion).
+
+```python
+class ContradictionDetector:
+    """Detect and resolve contradictory facts via LLM comparison."""
+
+    SIMILARITY_RANGE = (0.70, 0.95)  # Cosine similarity range for candidates
+    CONFIDENCE_THRESHOLD = 0.8       # Auto-supersede above this
+
+    async def detect_batch(self):
+        """Process recently ingested facts for contradictions."""
+        recent = await self.weaviate.get_facts_since(
+            minutes_ago=15, has_contradiction_check=False)
+
+        for fact in recent:
+            await self._check_contradictions(fact)
+            await self.weaviate.mark_contradiction_checked(fact.id)
+
+    async def _check_contradictions(self, new_fact: dict):
+        # METHOD 1: Cosine similarity scan (catches rephrased contradictions)
+        similar = await self.weaviate.search_similar(
+            new_fact["memory"],
+            channel_id=new_fact["channel_id"],
+            min_similarity=self.SIMILARITY_RANGE[0],
+            max_similarity=self.SIMILARITY_RANGE[1],
+            exclude_id=new_fact["id"],
+            limit=5,
+        )
+
+        # METHOD 2: Entity-scoped scan (catches same-topic contradictions
+        # regardless of text similarity — e.g., "Alice is auth lead" vs "Bob is auth lead")
+        if new_fact.get("graph_entity_ids"):
+            entity_related = await self.neo4j.get_facts_for_entities(
+                new_fact["graph_entity_ids"],
+                exclude_weaviate_id=new_fact["id"])
+            similar.extend(entity_related)
+
+        # LLM comparison for each candidate pair
+        for candidate in similar:
+            result = await self._llm_compare(new_fact, candidate)
+            if result["classification"] == "CONTRADICTORY" \
+               and result["confidence"] > self.CONFIDENCE_THRESHOLD:
+                await self._supersede(older=candidate, newer=new_fact,
+                                       reason=result["reason"])
+
+    async def _supersede(self, older, newer, reason):
+        # Mark old fact as invalidated in Weaviate
+        await self.weaviate.update(older["id"], {
+            "invalid_at": datetime.utcnow().isoformat(),
+            "superseded_by": newer["id"],
+            "supersession_reason": reason,
+        })
+
+        # Create SUPERSEDES edge in Neo4j if both have graph entities
+        if newer.get("graph_entity_ids") and older.get("graph_entity_ids"):
+            await self.neo4j.create_supersedes_edge(
+                newer_entity_ids=newer["graph_entity_ids"],
+                older_entity_ids=older["graph_entity_ids"],
+                reason=reason)
+```
+
+**Contradiction comparison prompt:**
+
+```python
+CONTRADICTION_PROMPT = """Compare these two facts from the same channel:
+
+EXISTING (created {old_timestamp}):
+"{old_memory}"
+
+NEW (created {new_timestamp}):
+"{new_memory}"
+
+Classify the relationship:
+- CONTRADICTORY: The new fact replaces or invalidates the old fact
+- PROGRESSIVE: The new fact builds on or extends the old fact (not a contradiction)
+- INDEPENDENT: Different topics, no relationship
+
+Examples:
+- "We use JWT with HS256" → "We switched to RS256 for JWT" = CONTRADICTORY
+- "We use PostgreSQL for users" → "We use MongoDB for analytics" = INDEPENDENT
+- "Alice is exploring Kubernetes" → "Alice deployed to Kubernetes" = PROGRESSIVE
+- "Alice is auth lead" → "Bob is the new auth lead" = CONTRADICTORY
+- "Sprint deadline is March 15" → "Sprint deadline extended to March 22" = CONTRADICTORY
+
+Respond in JSON: {"classification": "...", "confidence": 0.0-1.0, "reason": "..."}"""
+```
+
+**Cost:** ~$0.001 per comparison (Gemini Flash Lite). Typically 0-5 comparisons per new fact. Negligible at scale.
+
+**Retrieval integration:** The `ImprovedSemanticRetriever` filters by `invalid_at IS NULL` — superseded facts are automatically excluded from results without any retrieval code changes.
+
+---
+
+### 5.7 Consolidation Schedule & Triggers
+
+Consolidation builds Tier 0 (channel summaries) and Tier 1 (topic clusters) from Tier 2 (atomic facts). Without consolidation, the wiki has nothing to serve and the "80% free reads" promise doesn't work.
+
+**Three trigger types:**
+
+```python
+class ConsolidationService:
+    """Manages cluster building, summary updates, and wiki refresh."""
+
+    # TRIGGER 1: After sync (incremental — new facts only)
+    async def on_sync_complete(self, channel_id: str):
+        """Runs automatically when a channel sync finishes."""
+        unclustered = await self.weaviate.get_unclustered_facts(channel_id)
+        if not unclustered:
+            return
+
+        touched = await self._assign_to_clusters(channel_id, unclustered)
+        await self._update_cluster_summaries(channel_id, touched)
+        await self._update_channel_summary(channel_id)
+        await self.mongo.mark_wiki_dirty(channel_id)
+
+    # TRIGGER 2: Scheduled full rebuild (daily 2 AM UTC)
+    @scheduled(cron="0 2 * * *")
+    async def daily_full_consolidation(self):
+        """Re-evaluates all clusters: coherence, split/merge, summaries."""
+        for channel_id in await self.get_active_channels():
+            await self._full_reconsolidate(channel_id)
+            await self._rebuild_wiki(channel_id)
+
+    # TRIGGER 3: On-demand via API
+    async def manual_trigger(self, channel_id: str):
+        """Manual refresh for admin use or after bulk operations."""
+        await self._full_reconsolidate(channel_id)
+        await self._rebuild_wiki(channel_id)
+
+    async def _assign_to_clusters(self, channel_id, new_facts) -> set:
+        """Incremental: assign new facts to existing or new clusters."""
+        existing = await self.weaviate.get_tier1_clusters(channel_id)
+        touched = set()
+
+        for fact in new_facts:
+            best_match, best_score = None, 0.0
+            for cluster in existing:
+                score = await self._topic_similarity(fact, cluster)
+                if score > best_score:
+                    best_match, best_score = cluster, score
+
+            if best_score > 0.6:
+                await self.weaviate.link_fact_to_cluster(fact.id, best_match.id)
+                touched.add(best_match.id)
+            else:
+                # New cluster seed — promoted when 3+ members accumulate
+                new_id = await self.weaviate.create_cluster_seed(channel_id, fact)
+                touched.add(new_id)
+
+        return touched
+```
+
+**Cluster health rules** (applied during daily full reconsolidation):
+
+| Condition | Action |
+|-----------|--------|
+| Cluster > 100 members | Split via k-means on embeddings into 2-3 sub-clusters |
+| Two clusters have summary cosine > 0.85 | Merge into single cluster |
+| Cluster coherence score < 0.4 | Re-cluster members from scratch |
+| Cluster has 0 members | Delete cluster |
+
+**Wiki dirty flag** — ensures wiki reflects latest changes:
+
+```python
+# In wiki_cache.py
+async def get_wiki(self, channel_id: str) -> str:
+    cached = await self.cache.find_one({"channel_id": channel_id})
+    dirty = await self.dirty_flags.find_one({"channel_id": channel_id})
+
+    if cached and (not dirty or not dirty.get("dirty")):
+        return cached["content"]  # FREE read — no LLM cost
+
+    # Regenerate: consolidation or entity changes made wiki stale
+    wiki = await self.builder.build(channel_id)
+    await self.cache.update_one(
+        {"channel_id": channel_id},
+        {"$set": {"content": wiki, "generated_at": datetime.utcnow()}},
+        upsert=True)
+    await self.dirty_flags.update_one(
+        {"channel_id": channel_id}, {"$set": {"dirty": False}})
+    return wiki
+```
+
+**What triggers `mark_wiki_dirty`:**
+- After sync → consolidation assigns new facts to clusters
+- Entity extraction writes new Person/Decision/Technology to Neo4j
+- Contradiction detector supersedes a fact
+- Manual reconsolidation trigger
+
+---
+
+## 6. Wiki Generation
+
+The wiki combines both memory systems for a comprehensive view:
+
+```markdown
+# Channel Wiki: #backend-engineering
+
+## Overview
+{From Weaviate Tier 0 summary — FREE read}
+
+## Topics
+{From Weaviate Tier 1 clusters — FREE read}
+### Authentication (23 memories)
+  Team discussed JWT with RS256, migrated from sessions in Q3 2024...
+### Infrastructure (15 memories)
+  AWS EKS deployment, Terraform, ArgoCD...
+
+## People
+{From Neo4j: MATCH (p:Person)-[:MENTIONED_IN]->(e:Event {channel: $ch})}
+| Person | Role | Active In | Recent Decisions |
+|--------|------|-----------|-----------------|
+| Alice  | Lead | Auth, API | JWT migration   |
+
+## Decisions (Timeline)
+{From Neo4j: Decision nodes with SUPERSEDES chains}
+| Date | Decision | By | Status | Supersedes |
+|------|----------|----|--------|------------|
+| Mar 20 | Use RS256 | Alice | Active | Use HS256 |
+
+## Recent Activity (Last 7 Days)
+{From Weaviate: recent atomic memories}
+```
+
+**Cost breakdown:** Overview + Topics sections = FREE (Weaviate cache). People + Decisions = Neo4j query (~$0.001). Only the LLM synthesis costs money.
+
+---
+
+## 7. Research Paper Integration
+
+| Paper | Core Insight | How v2 Uses It |
+|-------|-------------|----------------|
+| **GraphRAG (Weaviate+Neo4j)** | Hybrid vector-graph search | Dual memory: Weaviate for semantic, Neo4j for relational |
+| **H-MEM** | 4-layer hierarchical memory | 3-tier Weaviate (summary→topic→atomic) with fixes |
+| **System-1/System-2 Routing** | Dual-process retrieval | Smart router: semantic (fast) / graph (deep) / both |
+| **Ebbinghaus Forgetting** | R = e^(-t/S) | Applied to retrieval ranking (actually wired in v2) |
+| **MemoryBank** | Nightly distillation | Scheduled consolidation: clusters + summaries + wiki |
+| **Dynamic Knowledge Graphs** | Episodic edges + fact replacement | Event nodes linking Neo4j↔Weaviate; SUPERSEDES edges |
+| **Zep** | Bi-temporal tracking | valid_from/valid_until/created_at on all relationships |
+| **Mem0/Mem0g** | LLM judge for consolidation | Entity extraction dedup: MERGE vs ADD vs SUPERSEDE |
+
+---
+
+## 8. Deployment
+
+```yaml
+# docker-compose.yml (v2)
+services:
+  beever-atlas:          # Python/FastAPI (MCP + REST)
+    build: .
+    ports: ["8000:8000"]
+    depends_on: [weaviate, neo4j, mongodb]
+
+  web:                   # React frontend
+    build: ./web
+    ports: ["3000:80"]
+
+  weaviate:              # Semantic memory
+    image: cr.weaviate.io/semitechnologies/weaviate:1.28.0
+    ports: ["8080:8080", "50051:50051"]
+    volumes: [weaviate_data:/var/lib/weaviate]
+
+  neo4j:                 # Graph memory
+    image: neo4j:5.26-community
+    ports: ["7474:7474", "7687:7687"]
+    environment:
+      NEO4J_AUTH: neo4j/beever_atlas_dev
+      NEO4J_PLUGINS: '["apoc"]'
+    volumes: [neo4j_data:/data]
+
+  mongodb:               # State + cache
+    image: mongo:7.0
+    ports: ["27017:27017"]
+    volumes: [mongo_data:/data/db]
+
+volumes:
+  weaviate_data:
+  neo4j_data:
+  mongo_data:
+```
+
+### 8.1 MCP Tool Specification
+
+**Design decision:** Graph queries are abstracted behind `ask_questions`. The smart router decides when to use Neo4j — users don't need to know about the dual-memory architecture.
+
+**7 tools:**
+
+```python
+@tool("ask_questions")
+async def ask_questions(
+    question: str,           # Natural language query
+    channel_id: str,         # Target channel
+    include_citations: bool = True,
+    max_results: int = 10,
+) -> AskResponse:
+    """Ask a question about channel knowledge. Routes automatically
+    to semantic search, graph traversal, or both based on query type.
+    Cost: $0.001-$0.006 depending on route."""
+
+@tool("search_memories")
+async def search_memories(
+    query: str,              # Search query
+    channel_id: str,
+    tier: str = "all",       # "all" | "summary" | "topic" | "atomic"
+    limit: int = 15,
+    include_images: bool = False,
+) -> SearchResponse:
+    """Direct hybrid search — bypasses router for power users.
+    Cost: ~$0.001"""
+
+@tool("get_wiki")
+async def get_wiki(
+    channel_id: str,
+    section: str = "all",    # "all"|"overview"|"topics"|"people"|"decisions"|"recent"
+) -> WikiResponse:
+    """Read cached wiki content. FREE for cached sections.
+    Returns stale data if wiki is dirty — use refresh_wiki to force update."""
+
+@tool("get_topics")
+async def get_topics(
+    channel_id: str,
+) -> TopicsResponse:
+    """List topic clusters for a channel. FREE (cached Tier 1)."""
+
+@tool("sync_channel")
+async def sync_channel(
+    channel_id: str,
+    max_messages: int = 5000,  # Safety limit to prevent cost explosion
+    since: str = None,         # ISO timestamp, defaults to last sync point
+) -> SyncResponse:
+    """Trigger ingestion for a channel. Runs in background.
+    Cost: ~$0.0025/message (text), ~$0.008/message (with media)."""
+
+@tool("get_sync_status")
+async def get_sync_status(
+    channel_id: str = None,    # None = all channels
+) -> SyncStatusResponse:
+    """Check sync progress and health status. FREE."""
+
+@tool("refresh_wiki")
+async def refresh_wiki(
+    channel_id: str,
+) -> RefreshResponse:
+    """Force wiki regeneration. Triggers full reconsolidation.
+    Cost: ~$0.01 for LLM synthesis."""
+```
+
+**MCP Resources** (read-only, URI-based access):
+
+```python
+@resource("wiki://{channel_id}")           # Full wiki markdown
+@resource("wiki://{channel_id}/overview")  # Tier 0 summary only
+@resource("wiki://{channel_id}/topics")    # Tier 1 cluster list
+```
+
+**Response schemas:**
+
+```python
+class AskResponse:
+    answer: str                    # Grounded response with inline citations
+    citations: list[Citation]      # Source facts with platform permalinks
+    route_used: str                # "semantic" | "graph" | "both"
+    confidence: float              # 0.0-1.0
+    degraded: bool                 # True if a component was unavailable
+    cost_usd: float                # Estimated cost of this query
+
+class Citation:
+    text: str                      # Original fact text
+    channel: str                   # Source channel name
+    user: str                      # Who said it
+    timestamp: str                 # When it was said
+    permalink: str                 # Platform message URL
+    tier: str                      # "atomic" | "topic" | "summary"
+
+class SyncResponse:
+    status: str                    # "started" | "already_running" | "queued"
+    channel_id: str
+    estimated_messages: int        # Approximate message count to process
+    job_id: str                    # For tracking via get_sync_status
+
+class WikiResponse:
+    content: str                   # Markdown wiki content
+    generated_at: str              # When this version was generated
+    is_stale: bool                 # True if wiki_dirty flag is set
+    channel_id: str
+```
+
+---
+
+## 9. Module Structure
+
+```
+src/beever_atlas/
+├── adapters/                    # Multi-platform ingestion
+│   ├── base.py                  # NormalizedMessage, BaseAdapter
+│   ├── slack_adapter.py         # slack-sdk
+│   ├── teams_adapter.py         # Microsoft Graph API
+│   └── discord_adapter.py       # discord.py
+│
+├── pipeline/                    # Ingestion (writes to BOTH stores)
+│   ├── preprocessor.py          # Stage 1
+│   ├── extractor.py             # Stage 2: facts + quality gate
+│   ├── entity_extractor.py      # Stage 3: entities → Neo4j
+│   ├── classifier.py            # Stage 4: tagging
+│   ├── embedder.py              # Stage 5: Jina v4
+│   ├── cross_batch_validator.py  # Stage 6: alias resolution + consistency
+│   ├── persister.py             # Stage 7: write Weaviate + Neo4j + MongoDB
+│   ├── outbox.py                # Write intent + idempotent fan-out
+│   ├── reconciler.py            # Retry incomplete cross-store writes
+│   └── contradiction_detector.py  # Background contradiction detection
+│
+├── stores/                      # Data store clients
+│   ├── weaviate_store.py        # Semantic memory (3-tier)
+│   ├── neo4j_store.py           # Graph memory (flexible)
+│   ├── mongo_store.py           # State + wiki cache
+│   └── entity_registry.py       # Canonical names + alias resolution
+│
+├── retrieval/                   # Query system
+│   ├── query_decomposer.py     # Complex question → parallel sub-queries
+│   ├── query_router.py          # LLM understanding + routing
+│   ├── semantic_retriever.py    # Weaviate 3-tier (improved)
+│   ├── graph_retriever.py       # Neo4j traversal + Weaviate enrichment
+│   ├── external_search.py       # Tavily web search (preserved from v1)
+│   ├── result_merger.py         # Merge + dedup + rank
+│   ├── temporal.py              # Temporal decay (ACTUALLY APPLIED)
+│   ├── consolidation.py         # Cluster building (ACTUALLY LINKS)
+│   └── response_generator.py    # Grounded response + citations
+│
+├── wiki/                        # Wiki from both memory systems
+│   ├── wiki_builder.py          # Weaviate tiers + Neo4j entities → markdown
+│   └── wiki_cache.py            # MongoDB cache
+│
+├── server/                      # Interfaces
+│   ├── tools.py                 # MCP tools
+│   ├── resources.py             # MCP resources (wiki://)
+│   └── api_routes.py            # REST API
+│
+└── infra/                        # Cross-cutting infrastructure
+    ├── health_registry.py        # Circuit breakers per dependency
+    ├── llm_provider.py           # LLM abstraction + fallback chain
+    ├── telemetry.py              # OpenTelemetry traces + metrics
+    ├── access_control.py         # Channel-level ACL from Slack membership
+    ├── dead_letter_queue.py      # Failed ingestion retry queue
+    └── consistency_checker.py    # Cross-store orphan detection
+```
+
+---
+
+## 10. Key Design Decisions
+
+| Decision | Choice | Rationale | Rejected Alternative |
+|----------|--------|-----------|---------------------|
+| Memory architecture | Dual (Weaviate + Neo4j) | Each does what it's best at — semantic vs. relational | Neo4j only (can't do hybrid BM25+vector), Weaviate only (can't do multi-hop graph) |
+| Weaviate tiers | Keep 3 tiers, fix bugs | Sound design; Tier 0+1 give free reads (wiki-first); just needs working cluster linking | Remove tiers (loses free wiki reads, loses topic scoping) |
+| Graph schema | Guided-flexible | Core types + LLM creates extensions; captures any relationship | Fixed schema (misses Budget, Team, Meeting...), Full triplets (too noisy) |
+| Relationships | Fully flexible | LLM extracts whatever verb phrase captures the meaning | Fixed relationship list (can't capture BLOCKED_BY, POSTPONED_UNTIL...) |
+| Query routing | Hybrid (route OR parallel) | Semantic-first saves cost (80%); parallel for ambiguous | Pure router (misclassification), Pure parallel (wasteful) |
+| Multi-platform | Python adapters | Chat SDK is TS-only, can't fetch history | Chat SDK only (no batch history) |
+| Quality gate | Reject at extraction | Prevent garbage from entering system | Post-hoc cleanup (harder) |
+| Cluster linking | Actually write cluster_id | v1's biggest bug — no-op | Keep as no-op (breaks everything) |
+
+---
+
+## 11. Open Questions
+
+1. **Entity extraction cost**: ~$0.001/message for flash-lite. 10K messages = ~$10 initial sync. Acceptable?
+2. **Graph type normalization**: How aggressively should we merge "Team"/"Group"/"Squad" into one type? LLM pass or rule-based?
+3. ~~**Consolidation frequency**~~: **RESOLVED** — Three triggers: after sync (incremental), daily 2 AM UTC (full), on-demand API. See §5.7.
+4. ~~**MCP surface**~~: **RESOLVED** — Graph queries abstracted behind `ask_questions`. 7 tools defined. See §8.1.
+5. **Chat SDK bridge**: Worth building the TypeScript webhook service for real-time ingestion in Phase 2?
+6. **Decomposition threshold**: When should queries be decomposed vs. sent as-is? Token length? LLM confidence?
+
+---
+
+## 12. Resilience & Degradation Design
+
+The v2 architecture depends on 6 external services: Weaviate, Neo4j, MongoDB, Gemini, Jina, and Tavily. Any component failure must degrade gracefully — not cause total system failure.
+
+### 12.1 Dependency Health Registry
+
+```python
+class DependencyHealth:
+    """Circuit breaker per external dependency (CLOSED → OPEN → HALF_OPEN)."""
+
+    DEPENDENCIES = {
+        "weaviate":  {"critical": True,  "timeout_s": 5},
+        "neo4j":     {"critical": False, "timeout_s": 5},
+        "mongodb":   {"critical": True,  "timeout_s": 5},
+        "gemini":    {"critical": True,  "timeout_s": 10},
+        "jina":      {"critical": False, "timeout_s": 10},
+        "tavily":    {"critical": False, "timeout_s": 5},
+    }
+
+    async def check(self, name: str) -> bool:
+        """Returns True if dependency is available."""
+        if self.states[name] == CircuitState.OPEN:
+            if time_since_open > RECOVERY_WINDOW:  # e.g., 30s
+                self.states[name] = CircuitState.HALF_OPEN
+                return True  # Probe with one request
+            return False
+        return True
+
+    def record_failure(self, name: str):
+        """After 3 consecutive failures, open the circuit."""
+        self.failure_counts[name] += 1
+        if self.failure_counts[name] >= 3:
+            self.states[name] = CircuitState.OPEN
+            logger.error(f"Circuit OPEN for {name}")
+
+    def record_success(self, name: str):
+        """Reset failure count, close circuit if half-open."""
+        self.failure_counts[name] = 0
+        if self.states[name] == CircuitState.HALF_OPEN:
+            self.states[name] = CircuitState.CLOSED
+```
+
+### 12.2 Degradation Matrix
+
+| Component Down | Ingestion Impact | Retrieval Impact | Behavior |
+|----------------|-----------------|------------------|----------|
+| **Neo4j** | Stage 3 skipped; facts stored in Weaviate only; entities queued for backfill | `route=graph` → reclassify as `route=semantic` | Wiki People/Decisions show "temporarily unavailable" |
+| **Gemini** | Messages queued in dead letter queue | Fall back to v1 regex classifier for routing; return cached wiki only | Alert fired; retry on recovery |
+| **Jina** | Embeddings queued; facts stored text-only in Weaviate | Existing embeddings work; new facts use BM25-only | Backfill embeddings when Jina recovers |
+| **Tavily** | No impact | Silently drop external sub-queries; return internal-only results | User sees "external search unavailable" note |
+| **Weaviate** | Full ingestion paused (queue in MongoDB) | Return cached wiki; graph-only for relational queries | Critical alert — system severely degraded |
+| **MongoDB** | Full system paused | Read-only from Weaviate/Neo4j if cached connections survive | Critical alert — system offline |
+
+### 12.3 LLM Provider Abstraction
+
+All LLM calls go through a provider abstraction layer with automatic failover:
+
+```python
+class LLMProvider:
+    """Unified LLM interface with circuit-breaker failover."""
+
+    TIERS = {
+        "fast":    {"primary": "gemini-flash-lite", "fallback": "claude-haiku"},
+        "quality": {"primary": "gemini-flash",      "fallback": "claude-sonnet"},
+    }
+
+    async def call(self, tier: str, prompt: str, **kwargs) -> str:
+        config = self.TIERS[tier]
+        # Try primary
+        if await self.health.check(config["primary"]):
+            try:
+                return await asyncio.wait_for(
+                    self._generate(config["primary"], prompt, **kwargs),
+                    timeout=10,
+                )
+            except (TimeoutError, APIError) as e:
+                self.health.record_failure(config["primary"])
+
+        # Try fallback
+        if config.get("fallback"):
+            return await self._generate(config["fallback"], prompt, **kwargs)
+
+        raise LLMUnavailableError(f"All providers failed for tier={tier}")
+```
+
+**Fallback chain per call site:**
+
+| Call Site | Primary | Fallback | Last Resort |
+|-----------|---------|----------|-------------|
+| Query Router | Gemini Flash Lite | Claude Haiku | v1 regex classifier |
+| Fact Extraction (Stage 2) | Gemini Flash Lite | Claude Haiku | Dead letter queue |
+| Entity Extraction (Stage 3) | Gemini Flash Lite | Claude Haiku | Skip (Weaviate-only) |
+| Classification (Stage 4) | Gemini Flash Lite | Rule-based tagger | Skip (no tags) |
+| Response Generation | Gemini Flash | Claude Sonnet | Return raw results |
+| Wiki Generation | Gemini Flash Lite | Claude Haiku | Serve stale cache |
+
+### 12.4 Ingestion Pipeline Resilience
+
+Each pipeline stage is independently skippable. If a non-critical stage fails, the pipeline continues:
+
+```python
+async def ingest_message(self, msg: NormalizedMessage):
+    # Stage 1: Preprocess (required)
+    preprocessed = await self.preprocessor.process(msg)
+
+    # Stage 2: Extract facts (required — queue to DLQ on failure)
+    try:
+        facts = await self.extractor.extract(preprocessed)
+    except LLMUnavailableError:
+        await self.dead_letter_queue.enqueue(msg)
+        return
+
+    # Stage 3: Entity extraction (optional — skip if Neo4j/LLM down)
+    entities = []
+    if await self.health.check("neo4j") and await self.health.check("gemini"):
+        try:
+            entities = await self.entity_extractor.extract(preprocessed, facts)
+        except Exception as e:
+            logger.warning(f"Entity extraction failed, continuing: {e}")
+            await self.backfill_queue.enqueue("entities", msg.id, preprocessed)
+
+    # Stage 4: Classify (optional — skip gracefully)
+    tags = await self._safe_classify(preprocessed, facts)
+
+    # Stage 5: Embed (optional — queue if Jina down)
+    embeddings = None
+    if await self.health.check("jina"):
+        embeddings = await self.embedder.embed(facts)
+    else:
+        await self.backfill_queue.enqueue("embeddings", msg.id, facts)
+
+    # Stage 7: Persist via outbox pattern
+    await self.persister.persist(facts, entities, embeddings, tags)
+```
+
+### 12.5 Write Safety — Outbox Pattern
+
+Stage 7 uses a MongoDB outbox pattern for cross-store write safety:
+
+```python
+class OutboxPersister:
+    """Two-phase persist: commit intent to MongoDB first, then fan out."""
+
+    async def persist(self, facts, entities, embeddings, tags) -> str:
+        # PHASE 1: Write intent (single MongoDB transaction)
+        intent = WriteIntent(
+            id=deterministic_uuid(facts),
+            facts=facts, entities=entities,
+            embeddings=embeddings, tags=tags,
+            status={"weaviate": "pending",
+                    "neo4j": "pending" if entities else "skipped",
+                    "state": "pending"},
+            retry_count=0,
+        )
+        await self.mongo.write_intents.insert_one(intent.dict())
+
+        # PHASE 2: Fan out (idempotent, independently retryable)
+        await self._fan_out(intent)
+        return intent.id
+
+    async def _fan_out(self, intent: WriteIntent):
+        # Weaviate — idempotent via deterministic UUID
+        if intent.status["weaviate"] == "pending":
+            try:
+                await self.weaviate.upsert(intent.facts, intent.embeddings)
+                await self._mark(intent.id, "weaviate", "done")
+            except Exception:
+                await self._mark(intent.id, "weaviate", "failed")
+
+        # Neo4j — idempotent via MERGE semantics
+        if intent.status["neo4j"] == "pending":
+            try:
+                for entity in intent.entities:
+                    await self.neo4j.upsert_entity(entity)
+                await self._mark(intent.id, "neo4j", "done")
+            except Exception:
+                await self._mark(intent.id, "neo4j", "failed")
+
+        # MongoDB sync state — final step
+        await self._update_sync_state(intent)
+        await self._mark(intent.id, "state", "done")
+```
+
+**Background reconciler** (runs every 15 minutes):
+
+```python
+class WriteReconciler:
+    """Retry incomplete cross-store writes."""
+
+    async def reconcile(self):
+        stale = await self.mongo.write_intents.find({
+            "$or": [
+                {"status.weaviate": {"$in": ["pending", "failed"]}},
+                {"status.neo4j": {"$in": ["pending", "failed"]}},
+            ],
+            "created_at": {"$lt": now() - timedelta(minutes=5)},
+            "retry_count": {"$lt": 5},
+        }).to_list()
+
+        for intent in stale:
+            await self.persister._fan_out(WriteIntent(**intent))
+            await self.mongo.write_intents.update_one(
+                {"id": intent["id"]}, {"$inc": {"retry_count": 1}})
+```
+
+---
+
+## 13. Observability & Operations
+
+### 13.1 Health Endpoints
+
+```python
+@app.get("/health")
+async def health_check():
+    checks = await asyncio.gather(
+        check_weaviate(),   # .is_ready()
+        check_neo4j(),      # driver.verify_connectivity()
+        check_mongodb(),    # ping
+        check_gemini(),     # list_models() with 5s timeout
+        check_jina(),       # embed test vector with 5s timeout
+    )
+    status = "healthy" if all(c.ok for c in checks) else \
+             "degraded" if any(c.ok for c in checks if c.critical) else \
+             "unhealthy"
+    return {"status": status,
+            "components": {c.name: c.dict() for c in checks}}
+```
+
+### 13.2 Key Metrics
+
+| Category | Metric | Type | Alert Threshold |
+|----------|--------|------|-----------------|
+| **Ingestion** | `ingestion.messages.processed` | Counter | Rate drops > 50% |
+| | `ingestion.quality_gate.rejected_ratio` | Gauge | > 60% |
+| | `ingestion.stage.duration_ms` | Histogram/stage | p95 > 5s |
+| | `ingestion.write_intent.pending_count` | Gauge | > 100 |
+| | `ingestion.dead_letter.count` | Counter | Any increase |
+| **Retrieval** | `retrieval.route.distribution` | Counter | graph > 40% |
+| | `retrieval.latency_ms` | Histogram/route | p95 > 3s |
+| | `retrieval.empty_results_ratio` | Gauge | > 30% |
+| **Stores** | `store.{name}.latency_ms` | Histogram | p95 > 2s |
+| | `store.{name}.error_rate` | Gauge | > 1% |
+| | `store.neo4j.entity_count` | Gauge | Growth > 1K/day |
+| | `store.orphan.count` | Gauge | Any increase |
+| **LLM** | `llm.{site}.latency_ms` | Histogram | p95 > 5s |
+| | `llm.{site}.error_rate` | Gauge | > 2% |
+| | `llm.{site}.token_cost` | Counter | Daily > budget |
+
+### 13.3 Distributed Tracing
+
+Every ingestion message and query carries a trace ID through all stages and stores:
+
+```python
+@tracer.start_as_current_span("ingest_message")
+async def process_message(msg: NormalizedMessage):
+    span = trace.get_current_span()
+    span.set_attribute("message.id", msg.id)
+    span.set_attribute("message.channel", msg.channel_id)
+    span.set_attribute("message.platform", msg.platform)
+
+    with tracer.start_as_current_span("stage_2_extract"):
+        facts = await extract(msg)
+    with tracer.start_as_current_span("stage_3_entities"):
+        entities = await extract_entities(msg, facts)
+    with tracer.start_as_current_span("stage_7_persist"):
+        await persist(facts, entities, embeddings)
+```
+
+### 13.4 Backup & Recovery
+
+| Store | Method | Frequency | Retention |
+|-------|--------|-----------|-----------|
+| Weaviate | `weaviate backup create` → S3 | Daily 3 AM UTC | 30 days |
+| Neo4j | `neo4j-admin dump` → S3 | Daily 3 AM UTC | 30 days |
+| MongoDB | `mongodump` → S3 | Daily 3 AM UTC | 30 days |
+
+### 13.5 Cross-Store Consistency Checks
+
+Weekly background job validates referential integrity:
+
+```python
+class ConsistencyChecker:
+    async def check_episodic_links(self):
+        """Verify Neo4j Event.weaviate_id → Weaviate object exists."""
+        event_ids = await self.neo4j.get_all_weaviate_ids()
+        for batch in chunks(event_ids, 100):
+            existing = await self.weaviate.batch_exists(batch)
+            orphaned = set(batch) - set(existing)
+            if orphaned:
+                metrics.record("store.orphan.episodic_links", len(orphaned))
+
+    async def check_entity_references(self):
+        """Verify Weaviate fact.graph_entity_ids → Neo4j nodes exist."""
+        facts = await self.weaviate.get_facts_with_graph_ids()
+        for fact in facts:
+            for neo4j_id in fact.graph_entity_ids:
+                if not await self.neo4j.node_exists(neo4j_id):
+                    metrics.record("store.orphan.entity_refs", 1)
+```
+
+---
+
+## 14. Access Control
+
+### 14.1 Channel-Level ACL
+
+Access control is inherited from the source platform's channel membership:
+
+```python
+class ChannelACL:
+    """Access control based on platform channel membership."""
+
+    # MongoDB collection: channel_acl
+    # {channel_id, platform, is_private, member_ids, last_synced}
+
+    async def sync_from_platform(self, channel_id: str, platform: str):
+        """Pull current membership from platform API."""
+        if platform == "slack":
+            members = await self.slack.conversations_members(channel=channel_id)
+            info = await self.slack.conversations_info(channel=channel_id)
+            is_private = info["channel"]["is_private"]
+        # ... similar for Teams, Discord
+
+        await self.collection.update_one(
+            {"channel_id": channel_id},
+            {"$set": {"is_private": is_private,
+                      "member_ids": members,
+                      "last_synced": datetime.utcnow()}},
+            upsert=True)
+
+    async def check_access(self, user_id: str, channel_id: str) -> bool:
+        acl = await self.collection.find_one({"channel_id": channel_id})
+        if not acl or not acl.get("is_private"):
+            return True  # Public channels visible to all workspace members
+        return user_id in acl.get("member_ids", [])
+
+    async def filter_results(self, user_id: str, results: list) -> list:
+        """Remove results from channels the user cannot access."""
+        accessible_cache = {}
+        filtered = []
+        for r in results:
+            ch = r.get("channel_id")
+            if ch not in accessible_cache:
+                accessible_cache[ch] = await self.check_access(user_id, ch)
+            if accessible_cache[ch]:
+                filtered.append(r)
+        return filtered
+```
+
+### 14.2 Integration Points
+
+- **API authentication**: Bearer token middleware validates user identity before any operation
+- **Retrieval pipeline**: `semantic_retriever` and `graph_retriever` call `acl.filter_results()` before returning
+- **Wiki builder**: Private channel sections show "[restricted]" for unauthorized users
+- **Neo4j traversal**: Global entities are visible, but relationships with `source_channel` from private channels are filtered
+- **ACL sync**: Membership is refreshed on each channel sync and cached for 1 hour
+
+```python
+@app.middleware("http")
+async def authenticate(request: Request, call_next):
+    token = request.headers.get("Authorization", "").replace("Bearer ", "")
+    if not token:
+        return JSONResponse(status_code=401, content={"error": "Missing auth token"})
+    user = await verify_workspace_token(token)
+    request.state.user_id = user.id
+    request.state.workspace_id = user.workspace_id
+    return await call_next(request)
+```
+
+---
+
+## Sources
+
+- [Vercel Chat SDK](https://chat-sdk.dev/) — [GitHub (vercel/chat)](https://github.com/vercel/chat)
+- [Chat SDK Adapters](https://chat-sdk.dev/docs/adapters) — [Changelog](https://vercel.com/changelog/chat-sdk)
+- [GraphRAG via Weaviate & Neo4j](https://weaviate.io/blog/graph-rag)
+- [H-MEM: Hierarchical Memory](https://arxiv.org/pdf/2507.22925)
+- [System-1/System-2 Graph Retrieval](https://arxiv.org/pdf/2602.15313)
+- [Zep Bi-Temporal Model](https://arxiv.org/pdf/2501.13956)
+- [Mem0/Mem0g](https://arxiv.org/pdf/2504.19413)
+- [Dynamic Knowledge Graphs](https://www.ijcai.org/proceedings/2025/0002.pdf)
+
+---
+
+*This proposal balances two complementary memory systems — Weaviate for semantic retrieval (improved 3-tier hierarchy handling 80% of queries cheaply) and Neo4j for flexible relational knowledge (handling the 20% that need entity relationships). The smart router optimizes for cost by defaulting to Weaviate-first, escalating to Neo4j only when relationships matter, and running both in parallel when the query is ambiguous.*
diff --git a/docs/v2/01-architecture-overview.md b/docs/v2/01-architecture-overview.md
new file mode 100644
index 00000000..f76c2ae8
--- /dev/null
+++ b/docs/v2/01-architecture-overview.md
@@ -0,0 +1,229 @@
+# Beever Atlas v2: Architecture Overview
+
+> **Status**: Implemented — core pipeline, dual-memory system, wiki generation, and web frontend are operational.
+> **Scope**: Production-ready knowledge intelligence system built on dual semantic + graph memory.
+
+---
+
+## 1. Executive Summary
+
+Beever Atlas v1 demonstrated that a wiki-first, hierarchical memory system for Slack channels is viable. However, the demo-stage implementation has 15 validated weaknesses: cluster linking is a no-op, the query classifier uses brittle regex, memory quality is 5.25/10, temporal decay is never applied, and there is no support for relational queries. See [`weakness-resolution-map.md`](weakness-resolution-map.md) for the full mapping of v1 weaknesses to v2 fixes.
+
+**Beever Atlas v2** redesigns the system around two complementary memory systems:
+
+- **Semantic Memory (Weaviate)** — Hierarchical 3-tier memory (improved from v1) handling factual, topic-based, and overview queries via hybrid BM25+vector search. Handles ~80% of queries. Cheap, fast. → [`02-semantic-memory.md`](02-semantic-memory.md)
+- **Graph Memory (Neo4j)** — Flexible knowledge graph capturing entity relationships and temporal evolution from conversations. Handles relational queries that semantic search can't answer. ~20% of queries. → [`03-graph-memory.md`](03-graph-memory.md)
+- **Smart Router** — LLM-powered query understanding that routes to Semantic, Graph, or both in parallel based on query type and cost optimization. → [`04-query-router.md`](04-query-router.md)
+
+**Design Principle**: Each memory system does what it's best at. They don't duplicate each other's work. Weaviate owns facts and topics. Neo4j owns entities and relationships. The router decides which to use.
+
+---
+
+## 1.1 Technology Stack
+
+| Layer | Technology | Purpose |
+|-------|-----------|---------|
+| **Agent Framework** | [Google ADK](https://google.github.io/adk-docs/) (Python) | Orchestrates all LLM-powered operations as composable agents (routing, extraction, response generation). Replaces direct LLM API calls. → [`13-adk-integration.md`](13-adk-integration.md) |
+| **Chat Bot** | [Vercel Chat SDK](https://chat-sdk.dev/) (TypeScript) | Real-time conversational interface across Slack, Teams, Discord. Handles mentions, follow-ups, action buttons. → [`13-adk-integration.md`](13-adk-integration.md) |
+| **Backend API** | FastAPI (Python) | MCP server + REST API. Shared service layer for both interfaces. → [`12-api-design.md`](12-api-design.md) |
+| **Semantic Store** | Weaviate 1.28 | 3-tier hierarchical memory with hybrid BM25+vector search. → [`02-semantic-memory.md`](02-semantic-memory.md) |
+| **Graph Store** | Neo4j 5.26 + APOC | Flexible knowledge graph with temporal tracking and multi-hop traversal. → [`03-graph-memory.md`](03-graph-memory.md) |
+| **State Store** | MongoDB 7.0 | Sync state, wiki cache, write intents (outbox), quality logs. → [`07-deployment.md`](07-deployment.md) |
+| **Session Store** | Redis 7 | Chat SDK conversation state. → [`13-adk-integration.md`](13-adk-integration.md) |
+| **Embeddings** | Jina v4 (2048-dim) | Multimodal named vectors (text, image, doc). → [`05-ingestion-pipeline.md`](05-ingestion-pipeline.md) |
+| **LLM (fast)** | Gemini 2.0 Flash Lite | Query routing, fact extraction, entity extraction, classification. Fallback: Claude Haiku 4.5 via LiteLLM. → [`08-resilience.md`](08-resilience.md) |
+| **LLM (quality)** | Gemini 2.0 Flash | Response generation, wiki synthesis. Fallback: Claude Sonnet 4.6 via LiteLLM. → [`08-resilience.md`](08-resilience.md) |
+| **Web Search** | Tavily API | External knowledge grounding (best practices, docs). → [`04-query-router.md`](04-query-router.md) |
+| **Frontend** | React 19 + TypeScript + Vite + TailwindCSS + shadcn/ui | Web dashboard for knowledge exploration, graph visualization, admin. → [`11-frontend-design.md`](11-frontend-design.md) |
+| **Graph Viz** | cytoscape.js | Interactive knowledge graph canvas in frontend. → [`11-frontend-design.md`](11-frontend-design.md) |
+| **Observability** | OpenTelemetry | Distributed tracing, metrics, health checks across all services. → [`09-observability.md`](09-observability.md) |
+| **Ingestion** | Python adapters (slack-sdk, MS Graph, discord.py) | Batch historical message fetch from all platforms. → [`05-ingestion-pipeline.md`](05-ingestion-pipeline.md) |
+
+```
+┌─────────────────────────────────────────────────────────────────────────┐
+│                        BEEVER ATLAS v2 OVERVIEW                        │
+│                                                                         │
+│                         ┌──────────────┐                                │
+│                         │  Smart Query │                                │
+│              ┌──────────│    Router    │──────────┐                     │
+│              │          └──────────────┘          │                     │
+│              ▼                                    ▼                     │
+│  ┌─────────────────────────┐     ┌─────────────────────────┐          │
+│  │   SEMANTIC MEMORY       │     │    GRAPH MEMORY         │          │
+│  │   (Weaviate)            │     │    (Neo4j)              │          │
+│  │                         │     │                         │          │
+│  │  Tier 0: Summary        │     │  Flexible entities:     │          │
+│  │  Tier 1: Topic Clusters │     │  Person, Decision,      │          │
+│  │  Tier 2: Atomic Facts   │     │  Project, Technology,   │          │
+│  │                         │     │  Team, Meeting, ...     │          │
+│  │  Hybrid BM25+Vector     │     │  Flexible relationships │          │
+│  │  Cross-modal (img/pdf)  │     │  Temporal tracking      │          │
+│  │  Wiki-first (free reads)│     │  Multi-hop traversal    │          │
+│  │                         │     │                         │          │
+│  │  "What was discussed?"  │     │  "Who decided what?"    │          │
+│  │  "Find docs about X"   │     │  "How did X evolve?"    │          │
+│  │  "Show me the overview" │     │  "What blocks project?" │          │
+│  │                         │     │                         │          │
+│  │  ~80% of queries        │     │  ~20% of queries        │          │
+│  │  < 200ms, low cost      │     │  200ms-1s, medium cost  │          │
+│  └────────────┬────────────┘     └────────────┬────────────┘          │
+│               │                                │                       │
+│               └────────────┬───────────────────┘                       │
+│                            ▼                                           │
+│                   ┌──────────────┐                                     │
+│                   │   Response   │                                     │
+│                   │  Generator   │──▶  Grounded answer + citations     │
+│                   └──────────────┘                                     │
+│                                                                         │
+│  ┌──────────┐    ┌──────────────┐    ┌──────────────┐                  │
+│  │  Slack   │    │  Ingestion   │    │   MongoDB    │                  │
+│  │  Teams   │───▶│  Pipeline    │    │  (state +    │                  │
+│  │  Discord │    │              │───▶│   wiki cache)│                  │
+│  └──────────┘    └──────┬───────┘    └──────────────┘                  │
+│                         │                                               │
+│                    Writes to BOTH                                       │
+│                  Weaviate AND Neo4j                                     │
+└─────────────────────────────────────────────────────────────────────────┘
+
+> **ADK Agent Layer:** All LLM-powered components above (Query Router, Response
+> Generator, Ingestion Pipeline) are implemented as [Google ADK](https://google.github.io/adk-docs/)
+> agents. The Query Router is the root `LlmAgent`, retrieval runs via `ParallelAgent`
+> (semantic + graph), ingestion via `SequentialAgent`, and consolidation via `LoopAgent`.
+> Store operations are wrapped as ADK `FunctionTool` instances. Model fallback is
+> handled by LiteLLM. See [`13-adk-integration.md`](13-adk-integration.md) for the
+> full agent hierarchy and tool mapping.
+```
+
+---
+
+## 2. v1 Weaknesses Summary
+
+Validated against the v1 codebase. Each weakness has a specific fix in v2. See [`weakness-resolution-map.md`](weakness-resolution-map.md) for full detail.
+
+### Critical
+| # | Weakness | v2 Fix |
+|---|----------|--------|
+| 1.11 | Cluster linking is a no-op | Actually write `cluster_id` to atomic memories in Weaviate |
+| 1.3 | Detail queries bypass hierarchy | Two-stage topic-first retrieval (Solution A) |
+| 1.13 | Memory quality 5.25/10 | Quality gate: reject vague facts, max 2 per message |
+| 1.10 | Brittle regex classifier | LLM-powered query understanding (flash-lite) |
+
+### High
+| # | Weakness | v2 Fix |
+|---|----------|--------|
+| 1.4 | Temporal decay never applied | Wire `apply_temporal_decay()` into retrieval ranking |
+| 1.1 | Top-down only retrieval | Bidirectional expansion (up + down) |
+| 1.2 | Meaningless expansion thresholds | Score-based expansion (`max_score < 0.6`) |
+| 1.6 | Slack only | Python adapter layer with NormalizedMessage |
+
+### Medium
+| # | Weakness | v2 Fix |
+|---|----------|--------|
+| 1.5 | No feedback loop | Citation tracking + retrieval quality metrics |
+| 1.7 | No real-time sync | Optional Chat SDK webhook bridge (Phase 2) |
+| 1.12 | No cross-channel search | Graph memory naturally spans channels |
+| 1.14 | No adaptive alpha | Wire `get_adaptive_alpha()` (pass `alpha=None`) |
+| 1.15 | No semantic dedup | Jaccard similarity dedup across tiers |
+
+---
+
+## 3. Dual-Memory Architecture
+
+### 3.1 Design Principle: Separation of Concerns
+
+Each memory system handles what it's naturally best at. **They do not duplicate each other.**
+
+| | Semantic Memory (Weaviate) | Graph Memory (Neo4j) |
+|---|---|---|
+| **What it stores** | Facts, summaries, topic clusters, multimodal content | Entities, relationships, temporal evolution |
+| **How it's structured** | 3-tier hierarchy (summary → topics → facts) | Flexible knowledge graph (nodes + edges) |
+| **How it's queried** | BM25 + vector hybrid search | Cypher graph traversal |
+| **What questions it answers** | "What was discussed about X?", "Show overview", "Find docs" | "Who decided X?", "What blocks Y?", "How did Z evolve?" |
+| **Query share** | ~80% (most questions are factual/topical) | ~20% (relational/temporal) |
+| **Cost** | Low (embedding search only) | Medium (graph traversal + Weaviate enrichment) |
+| **Latency** | < 200ms | 200ms-1s |
+
+**Why not just one?**
+- Weaviate can't do multi-hop traversal: "Person → works on → Project → has decision → blocked by → Constraint" requires a graph
+- Neo4j can't do fuzzy semantic search across 10K facts with BM25+vector hybrid ranking
+- Using both gives us the best of GraphRAG (from reference papers): vector search for finding relevant content + graph traversal for navigating relationships
+
+### 3.2 How the Two Memories Connect
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│                  MEMORY INTERCONNECTION                               │
+│                                                                      │
+│  INGESTION (writes to BOTH):                                        │
+│                                                                      │
+│  Message: "Alice decided to use RS256 for JWT — blocked by          │
+│            Carol's security review"                                  │
+│       │                                                              │
+│       ├──▶ WEAVIATE: Atomic fact stored with embedding              │
+│       │    memory: "Alice decided to use RS256 for JWT,             │
+│       │             blocked by Carol's security review"             │
+│       │    id: uuid-abc-123                                          │
+│       │    graph_entity_ids: [neo4j-1, neo4j-2, neo4j-3]           │
+│       │                                                              │
+│       └──▶ NEO4J: Entities + relationships extracted                │
+│            Person(Alice) ──DECIDED──▶ Decision(Use RS256)           │
+│            Decision(Use RS256) ──USES──▶ Technology(JWT)            │
+│            Decision(Use RS256) ──BLOCKED_BY──▶ Person(Carol)        │
+│            All entities ──MENTIONED_IN──▶ Event(weaviate_id:        │
+│                                                uuid-abc-123)        │
+│                                                                      │
+│  QUERY (reads from ONE or BOTH):                                    │
+│                                                                      │
+│  "What was discussed about JWT?"                                    │
+│    → Router: SEMANTIC → Weaviate hybrid search → fast, cheap        │
+│                                                                      │
+│  "Who decided to use RS256?"                                        │
+│    → Router: GRAPH → Neo4j traversal:                               │
+│      Decision(RS256) ←DECIDED── Person(Alice)                       │
+│      → Follow episodic edge → Weaviate(uuid-abc-123) for full text │
+│                                                                      │
+│  "Tell me about the JWT migration"                                  │
+│    → Router: BOTH (ambiguous) → run in parallel:                    │
+│      Weaviate: semantic facts about JWT                             │
+│      Neo4j: entities related to JWT (people, decisions, blockers)   │
+│      → Merge, dedup, rank → comprehensive answer                   │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+The cross-reference mechanism:
+- Every Weaviate atomic fact stores `graph_entity_ids` — the Neo4j node IDs of entities mentioned in that fact
+- Every Neo4j entity node stores a `MENTIONED_IN` edge to an `Event` node, which holds the `weaviate_id` of the source fact
+- This bidirectional linking allows graph queries to pull full text from Weaviate, and semantic queries to optionally enrich results with graph context
+
+---
+
+## 4. Individual Component Docs
+
+Each memory system, pipeline stage, and operational concern is documented separately:
+
+### Data Layer
+- [`02-semantic-memory.md`](02-semantic-memory.md) — Weaviate 3-tier schema, retrieval improvements, temporal decay, quality boost
+- [`03-graph-memory.md`](03-graph-memory.md) — Neo4j flexible schema, entity scoping, traversal methods, episodic linking
+
+### Query & Retrieval
+- [`04-query-router.md`](04-query-router.md) — Query decomposition, LLM understanding, cost-optimized routing, external search (Tavily)
+- [`06-wiki-generation.md`](06-wiki-generation.md) — Wiki template, cost breakdown, consolidation → cache flow
+
+### Ingestion
+- [`05-ingestion-pipeline.md`](05-ingestion-pipeline.md) — 6-stage pipeline, multi-platform adapters, quality gates, entity extraction, contradiction detection, outbox pattern
+
+### Interfaces
+- [`12-api-design.md`](12-api-design.md) — MCP tools + REST API spec, response schemas, rate limiting, error handling
+- [`11-frontend-design.md`](11-frontend-design.md) — React web dashboard, pages, component architecture, interaction flows
+- [`13-adk-integration.md`](13-adk-integration.md) — Google ADK agent hierarchy, Vercel Chat SDK bot, model config, ADK tools
+
+### Operations
+- [`07-deployment.md`](07-deployment.md) — Docker Compose, MCP tool spec, module structure
+- [`08-resilience.md`](08-resilience.md) — Circuit breakers, degradation matrix, LLM fallback chain, outbox write safety
+- [`09-observability.md`](09-observability.md) — Health endpoints, metrics, distributed tracing, backups, cross-store consistency
+- [`10-access-control.md`](10-access-control.md) — Channel ACL from platform membership, auth middleware, private channel filtering
+
+### Context & Decisions
+- [`decisions.md`](decisions.md) — Key design decisions, open questions, research paper integration
+- [`weakness-resolution-map.md`](weakness-resolution-map.md) — v1 → v2 weakness fix mapping (all 15 weaknesses, all 8 solutions)
+- [`reference-papers.md`](reference-papers.md) — Detailed analysis of 9 research papers/frameworks informing the design
diff --git a/docs/v2/02-semantic-memory.md b/docs/v2/02-semantic-memory.md
new file mode 100644
index 00000000..3122903d
--- /dev/null
+++ b/docs/v2/02-semantic-memory.md
@@ -0,0 +1,273 @@
+# Semantic Memory: Weaviate 3-Tier Design
+
+## Context
+
+Beever Atlas uses a dual-memory architecture. This document specifies the **semantic memory layer** (Weaviate), which handles approximately **80% of all queries** — factual lookups, topical questions, and multimodal content retrieval. The other 20% of queries (relational, temporal, multi-hop) are handled by the graph memory layer; see [`03-graph-memory.md`](./03-graph-memory.md).
+
+These two stores are complementary, not redundant. Weaviate cannot perform multi-hop graph traversal ("who decided X, and what blocked them?"), and Neo4j cannot do fuzzy BM25+vector hybrid ranking across tens of thousands of atomic facts. For how data enters both stores simultaneously during ingestion, see [`05-ingestion-pipeline.md`](./05-ingestion-pipeline.md).
+
+---
+
+## 3.2 Semantic Memory: Weaviate (3-Tier, Improved)
+
+The v1 hierarchical design was sound — the implementation was broken. v2 keeps the 3-tier architecture but fixes every weakness.
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│              SEMANTIC MEMORY: WEAVIATE (3-Tier)                      │
+│                                                                      │
+│  ┌───────────────────────────────────────────────────────────────┐  │
+│  │  TIER 0: Channel Summary                                      │  │
+│  │  • Channel-level overview ("what's happening?")               │  │
+│  │  • Updated by consolidation service                           │  │
+│  │  • Used for wiki overview section                             │  │
+│  │  • Query: "Catch me up", "Overview", "Status update"          │  │
+│  │  • Access: FREE (cached, no LLM needed)                       │  │
+│  └───────────────────────────────────────────────────────────────┘  │
+│                              │                                       │
+│                    consolidates from                                  │
+│                              ▼                                       │
+│  ┌───────────────────────────────────────────────────────────────┐  │
+│  │  TIER 1: Topic Clusters                                       │  │
+│  │  • Grouped memories by topic (authentication, deployment...)  │  │
+│  │  • Each cluster has: summary, member_ids, topic_tags          │  │
+│  │  • member_ids ACTUALLY LINKED to Tier 2 atomics (v1 fix!)    │  │
+│  │  • Used for topic-level questions and wiki topic sections     │  │
+│  │  • Query: "Tell me about auth", "What about deployment?"     │  │
+│  │  • Access: FREE (cached, no LLM needed)                       │  │
+│  │                                                                │  │
+│  │  v2 FIXES:                                                     │  │
+│  │  ✓ _link_memories_to_cluster() actually writes cluster_id    │  │
+│  │  ✓ MERGE-based dedup prevents duplicate clusters              │  │
+│  │  ✓ Two-stage topic-first retrieval (coarse → fine)           │  │
+│  └───────────────────────────────────────────────────────────────┘  │
+│                              │                                       │
+│                    consolidates from                                  │
+│                              ▼                                       │
+│  ┌───────────────────────────────────────────────────────────────┐  │
+│  │  TIER 2: Atomic Facts                                         │  │
+│  │  • Individual facts with full metadata and citations          │  │
+│  │  • Named vectors: text (2048-dim), image, doc (Jina v4)      │  │
+│  │  • Cross-modal search (text query → find images/PDFs)         │  │
+│  │  • Quality-scored at extraction (v2: reject < 0.5)            │  │
+│  │  • Linked to Neo4j via graph_entity_ids                       │  │
+│  │  • Query: "What exactly did Alice say?", "Find the diagram"  │  │
+│  │  • Access: PAID (uses embedding for search)                   │  │
+│  └───────────────────────────────────────────────────────────────┘  │
+│                                                                      │
+│  Wiki-First Cost Optimization (preserved from v1):                  │
+│  • Tier 0 + Tier 1 reads = FREE (pre-generated, cached)            │
+│  • Tier 2 search = CHEAP (embedding only, ~$0.001)                  │
+│  • LLM synthesis = PAID (only when needed, ~$0.02)                  │
+│  • Average query cost: ~$0.01 (5x cheaper than competitors)         │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+---
+
+## Weaviate Schema
+
+```python
+properties = [
+    # === Core ===
+    Property(name="memory", data_type=DataType.TEXT),
+    Property(name="channel_id", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="source", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="platform", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="timestamp", data_type=DataType.NUMBER),
+
+    # === Hierarchy (FIXED in v2) ===
+    Property(name="tier", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="cluster_id", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="member_ids", data_type=DataType.TEXT_ARRAY, skip_vectorization=True),
+    Property(name="member_count", data_type=DataType.INT),
+
+    # === Graph Linkage (NEW) ===
+    Property(name="graph_entity_ids", data_type=DataType.TEXT_ARRAY,
+             description="Neo4j node IDs extracted from this memory"),
+
+    # === Quality (NEW) ===
+    Property(name="quality_score", data_type=DataType.NUMBER),
+
+    # === Temporal (NEW) ===
+    Property(name="valid_at", data_type=DataType.DATE),
+    Property(name="invalid_at", data_type=DataType.DATE),
+
+    # === Tagging ===
+    Property(name="topic_tags", data_type=DataType.TEXT_ARRAY, skip_vectorization=True),
+    Property(name="entity_tags", data_type=DataType.TEXT_ARRAY, skip_vectorization=True),
+    Property(name="action_tags", data_type=DataType.TEXT_ARRAY, skip_vectorization=True),
+    Property(name="importance", data_type=DataType.TEXT, skip_vectorization=True),
+
+    # === Citations ===
+    Property(name="message_ts", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="thread_ts", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="user_name", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="slack_user_id", data_type=DataType.TEXT, skip_vectorization=True),
+
+    # === Files ===
+    Property(name="file_id", data_type=DataType.TEXT, skip_vectorization=True),
+    Property(name="filename", data_type=DataType.TEXT, skip_vectorization=True),
+]
+# Named vectors: text_vector, image_vector, doc_vector (2048-dim Jina v4)
+```
+
+---
+
+## Retrieval Improvements (All 15 Weaknesses Fixed)
+
+> **ADK Implementation:** The retrieval methods below are exposed as ADK `FunctionTool` instances on the `semantic_agent`: `search_weaviate_hybrid`, `get_tier0_summary`, `get_tier1_clusters`. The agent orchestrates the multi-step retrieval flow (coarse-to-fine, expansion, decay, dedup). See [`13-adk-integration.md`](13-adk-integration.md).
+
+```python
+class ImprovedSemanticRetriever:
+    """Weaviate retrieval with all v1 weaknesses fixed."""
+
+    async def retrieve(self, query: str, channel_id: str | None,
+                       query_understanding: QueryUnderstanding) -> list[dict]:
+
+        depth = query_understanding.semantic_depth  # "overview", "topic", "detail", "auto"
+
+        if depth == "overview":
+            # Tier 0 → optional expand to Tier 1
+            memories = await self._retrieve_summary(channel_id, query)
+            if self._should_expand(memories, "down"):
+                memories += await self._retrieve_clusters(channel_id, query)
+
+        elif depth == "topic":
+            # FIX 1.3 + 1.11: Two-stage topic-first retrieval
+            # Stage 1 (coarse): Find relevant topic clusters
+            clusters = await self._retrieve_clusters(
+                channel_id, query,
+                topic_filter=query_understanding.topics,
+                alpha=None,  # FIX 1.14: Adaptive alpha
+            )
+            # Stage 2 (fine): Search atomics WITHIN matched clusters
+            if clusters:
+                member_ids = self._collect_member_ids(clusters)
+                atomics = await self._retrieve_atomics_scoped(
+                    channel_id, query, member_ids,
+                    alpha=None,  # FIX 1.14
+                )
+                memories = clusters + atomics
+            else:
+                # No matching clusters → fall back to global atomic search
+                memories = await self._retrieve_atomics(channel_id, query)
+
+            # FIX 1.1: Bidirectional — expand UP if results are weak
+            if self._should_expand(memories, "up"):
+                summaries = await self._retrieve_summary(channel_id, query)
+                memories = self._merge_and_rerank(memories, summaries)
+
+        else:  # detail
+            # Direct atomic search, with optional upward expansion
+            memories = await self._retrieve_atomics(
+                channel_id, query, alpha=None,  # FIX 1.14
+            )
+            # FIX 1.1: Can expand UP to clusters for broader context
+            if self._should_expand(memories, "up"):
+                clusters = await self._retrieve_clusters(channel_id, query)
+                memories = self._merge_and_rerank(memories, clusters)
+
+        # FIX 1.4: Apply temporal decay to ranking
+        self._apply_temporal_decay(memories)
+
+        # FIX 1.13: Quality-weighted ranking boost
+        self._apply_quality_boost(memories)
+
+        # FIX 1.15: Semantic dedup across tiers
+        memories = self._semantic_dedup(memories)
+
+        return memories[:max_results]
+
+    def _should_expand(self, memories: list, direction: str) -> bool:
+        """FIX 1.2: Score-based expansion, not count-based."""
+        if not memories:
+            return True
+        scores = [m.get("score", 0) for m in memories]
+        return max(scores) < 0.6 or (sum(scores) / len(scores)) < 0.4
+
+    def _apply_temporal_decay(self, memories: list) -> None:
+        """FIX 1.4: Actually apply the existing temporal decay function."""
+        for m in memories:
+            days_ago = self._days_since(m.get("timestamp"))
+            m["score"] = self.temporal_decay.apply(m["score"], days_ago, m)
+        memories.sort(key=lambda m: m.get("score", 0), reverse=True)
+
+    def _apply_quality_boost(self, memories: list) -> None:
+        """FIX 1.13: Quality-weighted ranking — good memories score higher."""
+        for m in memories:
+            quality = m.get("quality_score", 0.5)
+            m["score"] = m["score"] * (0.7 + 0.3 * quality)
+        memories.sort(key=lambda m: m.get("score", 0), reverse=True)
+
+    def _semantic_dedup(self, memories: list, threshold=0.85) -> list:
+        """FIX 1.15: Remove near-duplicates across tiers."""
+        unique = []
+        for mem in memories:
+            is_dup = any(
+                self._jaccard_similarity(mem["memory"], e["memory"]) > threshold
+                for e in unique
+            )
+            if not is_dup:
+                unique.append(mem)
+        return unique
+```
+
+---
+
+## 4.3 Temporal Decay Configuration
+
+```python
+class TemporalDecay:
+    """Ebbinghaus-based temporal decay with exemptions and reinforcement."""
+    DEFAULT_DECAY_RATE = 0.1
+
+    # Facts with these action_tags decay at half rate
+    SLOW_DECAY_TAGS = {"decision", "architecture", "policy", "deadline"}
+
+    # Facts with these importance levels are exempt from decay
+    EXEMPT_IMPORTANCE = {"high", "critical"}
+
+    def apply(self, score: float, days_ago: float, fact: dict) -> float:
+        """Apply temporal decay to a retrieval score."""
+        if fact.get("importance") in self.EXEMPT_IMPORTANCE:
+            return score  # No decay for high-importance facts
+
+        rate = self.DEFAULT_DECAY_RATE
+        # Half decay for architectural decisions
+        if any(tag in self.SLOW_DECAY_TAGS for tag in fact.get("action_tags", [])):
+            rate *= 0.5
+
+        # Citation reinforcement: cited facts decay slower
+        citation_count = fact.get("citation_count", 0)
+        if citation_count > 0:
+            rate = rate / (1 + 0.1 * citation_count)
+
+        decay = math.exp(-rate * (days_ago / 30))
+        return score * decay
+```
+
+**Decay behavior at `DECAY_RATE = 0.1`:**
+
+| Fact Age | Score Multiplier | Effect |
+|----------|-----------------|--------|
+| 1 day | 0.997 | Essentially no decay |
+| 7 days | 0.977 | Minimal (~2% reduction) |
+| 30 days | 0.905 | Mild (~10% reduction) |
+| 90 days | 0.741 | Moderate (~26% reduction) |
+| 180 days | 0.549 | Significant (~45% reduction) |
+| 365 days | 0.295 | Strong (~70% reduction) |
+
+**Exemptions:**
+- Facts tagged `importance: "high"` or `"critical"` → no decay
+- Facts tagged `action_tags: ["decision", "architecture", "policy"]` → half decay rate (0.05)
+- Facts cited 5+ times → effective rate drops to ~0.067
+
+**Configuration:**
+```python
+# In config.py Settings
+decay_rate: float = 0.1
+decay_slow_tags: list[str] = ["decision", "architecture", "policy", "deadline"]
+decay_exempt_importance: list[str] = ["high", "critical"]
+decay_reinforcement_factor: float = 0.1
+```
diff --git a/docs/v2/03-graph-memory.md b/docs/v2/03-graph-memory.md
new file mode 100644
index 00000000..bc4482b6
--- /dev/null
+++ b/docs/v2/03-graph-memory.md
@@ -0,0 +1,328 @@
+# Graph Memory: Neo4j Flexible Knowledge Graph
+
+## Context
+
+Beever Atlas uses a dual-memory architecture. This document specifies the **graph memory layer** (Neo4j), which handles approximately **20% of all queries** — relational questions, temporal evolution tracking, and multi-hop traversal. The other 80% of queries (factual, topical, multimodal) are handled by the semantic memory layer; see [`02-semantic-memory.md`](./02-semantic-memory.md).
+
+Neo4j handles what Weaviate fundamentally cannot: multi-hop traversal ("Person → works on → Project → has decision → blocked by → Constraint"), temporal chains ("how did this decision evolve?"), and precision relational lookups ("who owns this project?"). Graph results are routinely enriched by following episodic edges back to Weaviate to retrieve the original fact text and Slack citations. For how data enters both stores simultaneously during ingestion, see [`05-ingestion-pipeline.md`](./05-ingestion-pipeline.md).
+
+---
+
+## 3.3 Graph Memory: Neo4j (Flexible)
+
+The graph memory captures **relationship meaning** from conversations — things that semantic search fundamentally cannot handle.
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│                GRAPH MEMORY: Neo4j (Flexible)                        │
+│                                                                      │
+│  PURPOSE: Capture WHO did WHAT, WHEN, and HOW things RELATE         │
+│                                                                      │
+│  ┌────────────────────────────────────────────────────────────────┐ │
+│  │  GUIDED-FLEXIBLE ENTITY SCHEMA                                 │ │
+│  │                                                                │ │
+│  │  All nodes share a base:                                       │ │
+│  │  ┌──────────────────────────────────┐                         │ │
+│  │  │  name:        str    (required)  │                         │ │
+│  │  │  entity_type: str    (required)  │                         │ │
+│  │  │  description: str    (optional)  │                         │ │
+│  │  │  channel:     str               │                         │ │
+│  │  │  platform:    str               │                         │ │
+│  │  │  properties:  dict   (flexible) │                         │ │
+│  │  │  created_at:  datetime          │                         │ │
+│  │  │  updated_at:  datetime          │                         │ │
+│  │  └──────────────────────────────────┘                         │ │
+│  │                                                                │ │
+│  │  Core types (LLM prefers these):                              │ │
+│  │  Person, Decision, Project, Technology                        │ │
+│  │                                                                │ │
+│  │  Extension types (LLM creates as needed):                     │ │
+│  │  Team, Meeting, Artifact, Constraint, Budget, Deadline, ...   │ │
+│  │                                                                │ │
+│  │  Event node (episodic anchor):                                │ │
+│  │  ┌──────────────────────────────────┐                         │ │
+│  │  │  weaviate_id: str  → links to   │                         │ │
+│  │  │               Weaviate atomic    │                         │ │
+│  │  │  timestamp:   datetime           │                         │ │
+│  │  │  channel:     str               │                         │ │
+│  │  └──────────────────────────────────┘                         │ │
+│  └────────────────────────────────────────────────────────────────┘ │
+│                                                                      │
+│  ┌────────────────────────────────────────────────────────────────┐ │
+│  │  FLEXIBLE RELATIONSHIPS                                        │ │
+│  │                                                                │ │
+│  │  NOT a fixed list — LLM extracts whatever relationship        │ │
+│  │  best captures the meaning:                                    │ │
+│  │                                                                │ │
+│  │  Common patterns:                                              │ │
+│  │  Person  ──DECIDED──▶       Decision                          │ │
+│  │  Person  ──WORKS_ON──▶      Project                           │ │
+│  │  Person  ──MEMBER_OF──▶     Team                              │ │
+│  │  Decision──AFFECTS──▶       Project                           │ │
+│  │  Decision──SUPERSEDES──▶    Decision  (temporal evolution)    │ │
+│  │  Decision──BLOCKED_BY──▶    Constraint                        │ │
+│  │  Decision──USES──▶          Technology                        │ │
+│  │  Project ──DEPENDS_ON──▶    Project                           │ │
+│  │  Meeting ──PRODUCED──▶      Decision                          │ │
+│  │  Any     ──MENTIONED_IN──▶  Event     (episodic link)        │ │
+│  │  Any     ──ALIAS_OF──▶     Any       (entity dedup)          │ │
+│  │                                                                │ │
+│  │  Bidirectional edges (auto-created during ingestion):         │ │
+│  │  DECIDED ↔ DECIDED_BY, BLOCKED_BY ↔ BLOCKS,                  │ │
+│  │  WORKS_ON ↔ HAS_MEMBER, OWNS ↔ OWNED_BY                      │ │
+│  │                                                                │ │
+│  │  LLM can create ANY relationship type. The graph adapts       │ │
+│  │  to whatever patterns exist in the organization's             │ │
+│  │  conversations.                                                │ │
+│  │                                                                │ │
+│  │  Temporal properties on ALL relationships:                    │ │
+│  │  • valid_from:  datetime                                      │ │
+│  │  • valid_until: datetime (null = currently valid)             │ │
+│  │  • created_at:  datetime (bi-temporal tracking)               │ │
+│  │  • confidence:  float                                         │ │
+│  └────────────────────────────────────────────────────────────────┘ │
+│                                                                      │
+│  EPISODIC LINKING (graph ↔ Weaviate):                               │
+│  • Every graph entity connects to Event nodes                       │
+│  • Event.weaviate_id → points to atomic fact in Weaviate           │
+│  • Enables: graph traversal → find entities → follow episodic      │
+│    edges → retrieve original fact text + Slack citations            │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+---
+
+## Entity Scoping Rules
+
+Global entities (Person, Technology, Project, Team) are **MERGED by name only** — the same entity spans all channels. Channel-scoped entities (Decision, Meeting, Artifact) are **MERGED by name + channel**.
+
+| Entity Type | Scope | Merge Key | Rationale |
+|---|---|---|---|
+| Person | global | name | Alice is Alice everywhere |
+| Technology | global | name | React is React everywhere |
+| Project | global | name | Project names span channels |
+| Team | global | name | Teams span channels |
+| Decision | channel | name + channel | Decisions are channel-contextual |
+| Meeting | channel | name + channel | Meetings are channel-contextual |
+| Artifact | channel | name + channel | Docs are channel-contextual |
+| _(extension types)_ | channel | name + channel | Default for LLM-created types |
+
+---
+
+## Neo4j Implementation
+
+```python
+class Neo4jStore:
+    """Flexible graph memory — any entity type, any relationship type.
+
+    Entity scoping: Global entities (Person, Technology, Project, Team) are
+    MERGED by name only — the same entity spans all channels. Channel-scoped
+    entities (Decision, Meeting, Artifact) are MERGED by name + channel.
+    """
+
+    QUERY_TIMEOUT_MS = 5000  # Hard limit on all graph queries
+
+    # Cross-channel scoping rules
+    ENTITY_SCOPING = {
+        "Person":     "global",    # Alice is Alice everywhere
+        "Technology": "global",    # React is React everywhere
+        "Project":    "global",    # Project names span channels
+        "Team":       "global",    # Teams span channels
+        "Decision":   "channel",   # Decisions are channel-contextual
+        "Meeting":    "channel",   # Meetings are channel-contextual
+        "Artifact":   "channel",   # Docs are channel-contextual
+        # Extension types default to "channel"
+    }
+
+    async def upsert_entity(self, entity: dict) -> str:
+        """Create/update entity with scope-aware MERGE."""
+        entity_type = entity["type"]
+        scope = self.ENTITY_SCOPING.get(entity_type, "channel")
+
+        if scope == "global":
+            # Global: MERGE on name only, track channels as array
+            cypher = f"""
+                MERGE (n:{entity_type} {{name: $name}})
+                ON CREATE SET n += $props, n.created_at = datetime(),
+                              n.channels = [$channel],
+                              n.quality_score = $quality_score
+                ON MATCH SET n += $props, n.updated_at = datetime(),
+                             n.channels = CASE
+                               WHEN NOT $channel IN n.channels
+                               THEN n.channels + $channel
+                               ELSE n.channels END,
+                             n.quality_score = CASE
+                               WHEN $quality_score > n.quality_score
+                               THEN $quality_score ELSE n.quality_score END
+                RETURN id(n) as node_id
+            """
+        else:
+            # Channel-scoped: MERGE on name + channel
+            cypher = f"""
+                MERGE (n:{entity_type} {{name: $name, channel: $channel}})
+                ON CREATE SET n += $props, n.created_at = datetime(),
+                              n.quality_score = $quality_score
+                ON MATCH SET n += $props, n.updated_at = datetime(),
+                             n.quality_score = CASE
+                               WHEN $quality_score > n.quality_score
+                               THEN $quality_score ELSE n.quality_score END
+                RETURN id(n) as node_id
+            """
+        return await self.execute(cypher,
+            name=entity["name"], channel=entity.get("channel"),
+            quality_score=entity.get("quality_score", 0.5),
+            props={k: v for k, v in entity.get("properties", {}).items()
+                   if v is not None},
+        )
+
+    async def upsert_relationship(self, rel: dict) -> None:
+        """Create relationship with scope-aware matching + provenance."""
+        rel_type = rel["type"]
+        source_scope = self.ENTITY_SCOPING.get(rel.get("source_type"), "channel")
+        target_scope = self.ENTITY_SCOPING.get(rel.get("target_type"), "channel")
+
+        source_match = "{name: $source}" if source_scope == "global" \
+                       else "{name: $source, channel: $channel}"
+        target_match = "{name: $target}" if target_scope == "global" \
+                       else "{name: $target, channel: $channel}"
+
+        cypher = f"""
+            MATCH (s {source_match})
+            MATCH (t {target_match})
+            MERGE (s)-[r:{rel_type}]->(t)
+            SET r.context = $context,
+                r.source_channel = $channel,
+                r.valid_from = coalesce($valid_from, datetime()),
+                r.created_at = datetime(),
+                r.confidence = $confidence,
+                r.evidence = $evidence,
+                r.source_message_id = $source_message_id,
+                r.source_fact_id = $source_fact_id,
+                r.extracted_at = datetime()
+        """
+        await self.execute(cypher, **rel)
+
+    async def create_episodic_link(self, entity_name: str, weaviate_id: str,
+                                    channel: str, timestamp: float) -> None:
+        """Link a graph entity to its source fact in Weaviate."""
+        # Try global match first, then channel-scoped
+        await self.execute("""
+            MATCH (n)
+            WHERE n.name = $name
+              AND (n.channel = $channel OR $channel IN n.channels)
+            MERGE (e:Event {weaviate_id: $wid})
+            ON CREATE SET e.channel = $channel, e.timestamp = $ts
+            MERGE (n)-[:MENTIONED_IN]->(e)
+        """, name=entity_name, channel=channel, wid=weaviate_id, ts=timestamp)
+
+    async def traverse(self, start_entities: list[str], channel: str = None,
+                       max_hops: int = 2) -> list[dict]:
+        """Bounded, directed traversal with APOC path expansion."""
+        return await self.execute_with_timeout("""
+            MATCH (start)
+            WHERE start.name IN $entities
+              AND ($channel IS NULL
+                   OR start.channel = $channel
+                   OR $channel IN start.channels)
+            CALL apoc.path.expandConfig(start, {
+                minLevel: 1,
+                maxLevel: $max_hops,
+                uniqueness: 'NODE_GLOBAL',
+                limit: 50,
+                relationshipFilter: '>'
+            }) YIELD path
+            WHERE all(r IN relationships(path) WHERE
+                r.valid_until IS NULL OR r.valid_until > datetime())
+            RETURN path
+        """, entities=start_entities, channel=channel, max_hops=max_hops)
+
+    async def temporal_chain(self, entity_name: str, channel: str = None) -> list[dict]:
+        """Bounded SUPERSEDES chain (max 5 hops, distinct per level)."""
+        return await self.execute_with_timeout("""
+            MATCH (d:Decision)
+            WHERE d.name CONTAINS $name
+              AND ($channel IS NULL OR d.channel = $channel
+                   OR $channel IN d.channels)
+            MATCH path = (d)-[:SUPERSEDES*0..5]->(older:Decision)
+            WITH DISTINCT older, path
+            RETURN path ORDER BY older.valid_from DESC
+            LIMIT 20
+        """, name=entity_name, channel=channel)
+
+    async def comprehensive_traverse(self, start_entities: list[str],
+                                      channel: str = None,
+                                      max_hops: int = 3,
+                                      max_nodes: int = 200) -> dict:
+        """Collect-all traversal: gather ALL relationships within N hops,
+        then let the LLM analyze relevance. Inspired by Forensic Eyes'
+        Phase 16 pattern — avoids brittleness from pre-filtering edge types.
+
+        Use for complex graph queries where relationship types are diverse
+        and pre-filtering risks missing cross-cutting context.
+
+        Returns structured subgraph JSON for LLM analysis.
+        """
+        return await self.execute_with_timeout("""
+            MATCH (start)
+            WHERE start.name IN $entities
+              AND ($channel IS NULL
+                   OR start.channel = $channel
+                   OR $channel IN start.channels)
+            CALL apoc.path.expandConfig(start, {
+                minLevel: 1,
+                maxLevel: $max_hops,
+                uniqueness: 'NODE_GLOBAL',
+                limit: $max_nodes
+            }) YIELD path
+            WITH path, relationships(path) AS rels, nodes(path) AS ns
+            WHERE all(r IN rels WHERE
+                r.valid_until IS NULL OR r.valid_until > datetime())
+            UNWIND rels AS r
+            WITH DISTINCT r, startNode(r) AS src, endNode(r) AS tgt,
+                 type(r) AS rel_type
+            RETURN src.name AS source, src.entity_type AS source_type,
+                   tgt.name AS target, tgt.entity_type AS target_type,
+                   rel_type, r.context AS context,
+                   r.confidence AS confidence,
+                   r.evidence AS evidence,
+                   r.source_message_id AS source_message_id
+            ORDER BY r.confidence DESC
+        """, entities=start_entities, channel=channel,
+             max_hops=max_hops, max_nodes=max_nodes)
+
+    async def get_episodic_weaviate_ids(self, node_ids: list[int]) -> list[str]:
+        """Get Weaviate IDs for enriching graph results with full text."""
+        return await self.execute("""
+            MATCH (n)-[:MENTIONED_IN]->(e:Event)
+            WHERE id(n) IN $ids
+            RETURN e.weaviate_id
+        """, ids=node_ids)
+
+    async def execute_with_timeout(self, cypher: str, **params) -> list[dict]:
+        """Execute with transaction timeout — returns [] on timeout."""
+        try:
+            async with self.driver.session() as session:
+                result = await session.run(cypher, **params,
+                                            timeout=self.QUERY_TIMEOUT_MS)
+                return await result.data()
+        except TransientError:
+            logger.warning(f"Graph traversal timed out: {cypher[:80]}...")
+            return []  # Retriever falls back to semantic-only
+```
+
+---
+
+## Method Reference
+
+| Method | Purpose | Returns on timeout |
+|---|---|---|
+| `upsert_entity` | Create/update node with scope-aware MERGE | n/a (write) |
+| `upsert_relationship` | Create edge with provenance fields | n/a (write) |
+| `create_episodic_link` | Bind entity → Event → Weaviate atomic | n/a (write) |
+| `traverse` | Bounded N-hop APOC path expansion | not applicable (non-timeout path) |
+| `temporal_chain` | SUPERSEDES chain up to 5 hops | `[]` |
+| `comprehensive_traverse` | Collect-all subgraph for LLM analysis | `[]` |
+| `get_episodic_weaviate_ids` | Fetch Weaviate IDs from Event nodes | n/a (fast lookup) |
+| `execute_with_timeout` | Underlying runner with 5s hard limit | `[]` |
+
+All traversal methods return `[]` on `TransientError` (timeout), allowing the query router to fall back to semantic-only results rather than failing the request.
diff --git a/docs/v2/04-query-router.md b/docs/v2/04-query-router.md
new file mode 100644
index 00000000..c4f92d8b
--- /dev/null
+++ b/docs/v2/04-query-router.md
@@ -0,0 +1,270 @@
+# Smart Query Router
+
+> **Status**: Design spec — the query routing logic and agents described here are **in development**. The ingestion pipeline is implemented (see `05-ingestion-pipeline.md`); the Q&A routing layer is next. Only a placeholder agent (`agents/query/echo.py`) currently exists.
+
+Queries arrive from the API layer and pass through three steps before any retrieval happens: decomposition into sub-queries, LLM-powered understanding of each sub-query, and routing to one or both memory stores (or external search). Results from all branches are merged into a single ranked response.
+
+Underlying stores: see [`02-semantic-memory.md`](./02-semantic-memory.md) (Weaviate) and [`03-graph-memory.md`](./03-graph-memory.md) (Neo4j).
+
+> **ADK Implementation:** The entire query flow is orchestrated by the `query_router_agent` (an ADK `LlmAgent`), which delegates to a `retrieval_pipeline` (`ParallelAgent` running `semantic_agent` + `graph_agent`) and a `response_agent`. The behavioral specs below describe *what* each step does; the ADK agent hierarchy in [`13-adk-integration.md`](13-adk-integration.md) describes *how* they are orchestrated.
+
+---
+
+## 4.0 Query Decomposition
+
+Complex questions are decomposed into focused parallel sub-queries before routing. This was a key v1 feature that the v2 router must preserve and enhance.
+
+```python
+class QueryDecomposer:
+    """Decompose complex questions into parallel sub-queries.
+
+    Example:
+    "What auth method did we decide on and how does it compare to best practices?"
+    → internal_queries:
+        - {"query": "authentication decision JWT", "focus": "decision"}
+        - {"query": "OAuth implementation alice", "focus": "implementation"}
+    → external_queries:
+        - {"query": "JWT vs OAuth best practices 2025", "focus": "comparison"}
+    """
+
+    async def decompose(self, question: str) -> QueryPlan:
+        """Break down a question into internal + external sub-queries."""
+        # Fast path: simple questions → single internal query, no decomposition
+        if self._is_simple(question):
+            return QueryPlan(
+                internal_queries=[{"query": question, "focus": "direct"}],
+                external_queries=[],
+            )
+
+        # Complex questions → LLM decomposition (flash-lite)
+        plan = await self._llm_decompose(question)
+        return plan  # 2-4 internal + 0-2 external queries
+
+DECOMPOSITION_PROMPT = """
+You are a query decomposition specialist. Break down this question into
+focused sub-queries that can be executed in parallel.
+
+OUTPUT JSON:
+{
+    "internal_queries": [
+        {"query": "specific search terms", "focus": "what this targets"}
+    ],
+    "external_queries": [
+        {"query": "web search terms", "focus": "what to learn from web"}
+    ]
+}
+
+RULES:
+1. Generate 2-4 focused internal queries for different aspects
+2. Generate 0-2 external queries ONLY if best practices / documentation
+   comparison is needed
+3. Internal queries should be keyword-focused (not full sentences)
+4. If the question is simple, a single internal query suffices
+"""
+```
+
+The decomposed sub-queries are then each routed independently through the Query Understanding step below, enabling parallel execution across both memory systems AND external search.
+
+---
+
+## 4.1 ADK Agent-Powered Query Understanding
+
+Replaces the brittle regex classifier (weakness 1.10) with the `query_router_agent` (ADK `LlmAgent`, ~$0.001/query using flash-lite). The prompt below serves as the agent's system instruction:
+
+```python
+QUERY_UNDERSTANDING_PROMPT = """
+Classify this query for a team communication knowledge base.
+
+Query: {query}
+Channel: {channel_name}
+
+Determine:
+1. route: One of:
+   - "semantic": Looking for facts, discussions, topics, documents
+     Examples: "What was discussed about auth?", "Find deployment docs", "Overview"
+   - "graph": Looking for entity relationships, people, decisions, temporal changes
+     Examples: "Who decided X?", "What is Alice working on?", "What blocks project Y?"
+   - "both": Could benefit from both fact retrieval AND relationship context
+     Examples: "Tell me about the JWT migration", "What happened with the auth project?"
+2. semantic_depth: "overview" | "topic" | "detail" (for Weaviate tier routing)
+3. entities: Named entities mentioned (people, projects, technologies)
+4. topics: Topic areas referenced
+5. temporal_scope: "recent" | "any" | "historical"
+6. confidence: 0.0-1.0
+
+Output JSON.
+"""
+```
+
+---
+
+## 4.2 Routing Strategy: Cost-Optimized
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│                       SMART QUERY ROUTER                             │
+│                                                                      │
+│  User Query                                                          │
+│      │                                                               │
+│      ▼                                                               │
+│  ┌──────────────────────────────────────┐                           │
+│  │  QUERY UNDERSTANDING (LLM flash-lite)│  ~$0.001/query            │
+│  │                                      │                           │
+│  │  route: semantic | graph | both      │                           │
+│  │  semantic_depth: overview|topic|detail│                           │
+│  │  entities: ["Alice", "JWT"]          │                           │
+│  │  topics: ["authentication"]          │                           │
+│  │  confidence: 0.0-1.0                 │                           │
+│  └──────┬──────────┬──────────┬─────────┘                           │
+│         │          │          │                                      │
+│    route=semantic  │     route=both                                  │
+│    conf > 0.7      │     OR conf ≤ 0.7                              │
+│         │     route=graph    │                                      │
+│         │     conf > 0.7     │                                      │
+│         ▼          ▼         ▼                                      │
+│  ┌──────────┐ ┌────────┐ ┌────────────────┐                       │
+│  │ SEMANTIC │ │ GRAPH  │ │ BOTH PARALLEL  │                       │
+│  │ ONLY     │ │ ONLY   │ │                │                       │
+│  │          │ │        │ │ Semantic  Graph│                       │
+│  │ Weaviate │ │ Neo4j  │ │ search + trav. │                       │
+│  │ 3-tier   │ │ + Weav.│ │ in parallel   │                       │
+│  │ retrieval│ │ enrich │ │                │                       │
+│  │          │ │        │ │ Merge results  │                       │
+│  │ $0.001   │ │ $0.005 │ │ $0.006        │                       │
+│  │ < 200ms  │ │ ~500ms │ │ ~500ms        │                       │
+│  └────┬─────┘ └───┬────┘ └───────┬────────┘                       │
+│       │           │              │                                  │
+│       │    ┌──────┘              │                                  │
+│       │    │ Fallback: if graph  │                                  │
+│       │    │ results insufficient│                                  │
+│       │    │ → also run semantic │                                  │
+│       │    │                     │                                  │
+│       └────┴─────────┬───────────┘                                  │
+│                      ▼                                               │
+│  ┌──────────────────────────────────────┐                           │
+│  │  RESULT MERGER + RESPONSE GENERATOR  │                           │
+│  │                                      │                           │
+│  │  1. Deduplicate by weaviate_id      │                           │
+│  │  2. Boost cross-validated results   │                           │
+│  │  3. Apply temporal decay            │                           │
+│  │  4. Quality-score weighted ranking  │                           │
+│  │  5. Generate grounded response      │                           │
+│  │     via response_agent (ADK)        │                           │
+│  └──────────────────────────────────────┘                           │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+### Routing Decision Table
+
+| Query Pattern | Route | Why | Cost | Latency |
+|---|---|---|---|---|
+| "What was discussed about auth?" | Semantic | Factual lookup → Weaviate excels | $0.001 | < 200ms |
+| "Show me the overview" | Semantic (Tier 0) | Cached summary → FREE | $0 | < 50ms |
+| "Tell me about deployment" | Semantic (Tier 1) | Topic cluster → FREE | $0 | < 50ms |
+| "Find the architecture diagram" | Semantic (cross-modal) | Image search → Weaviate only | $0.001 | < 200ms |
+| "Who decided to use JWT?" | Graph | Person→Decision traversal | $0.005 | ~500ms |
+| "What is Alice working on?" | Graph | Person→Project traversal | $0.005 | ~500ms |
+| "How did the auth approach evolve?" | Graph (temporal) | Decision→SUPERSEDES chain | $0.005 | ~500ms |
+| "What blocks the migration?" | Graph | Project→BLOCKED_BY traversal | $0.005 | ~500ms |
+| "Tell me about the JWT migration" | Both (parallel) | Needs facts AND relationships | $0.006 | ~500ms |
+| "What happened with auth last week?" | Both (parallel) | Temporal + factual | $0.006 | ~500ms |
+
+---
+
+## 4.3 External Search (Tavily)
+
+The v1 external search via Tavily is preserved in v2. It handles factual queries that require web knowledge (best practices, documentation, industry comparisons) — things NOT in the team's Slack history.
+
+```python
+class ExternalSearchService:
+    """Web search via Tavily API for grounding with external knowledge.
+
+    Why Tavily:
+    - Cost-effective: 1,000 free credits/month vs $35/1K (Google)
+    - Multiple tools: search, extract, crawl
+    - No model restrictions: works with any LLM
+    - Designed for AI/RAG: optimized for LLM consumption
+    """
+
+    async def search(self, query: str, search_depth: str = "basic",
+                     max_results: int = 5, include_answer: bool = True,
+                     include_domains: list[str] | None = None,
+                     exclude_domains: list[str] | None = None,
+                     ) -> ExternalSearchResponse:
+        """Search the web. Returns results + optional AI-generated answer."""
+        ...
+
+    async def search_documentation(self, query: str,
+                                    technology: str | None = None,
+                                    max_results: int = 5,
+                                    ) -> ExternalSearchResponse:
+        """Optimized for finding API docs, tutorials, official docs."""
+        ...
+
+    async def extract_content(self, urls: list[str]) -> dict[str, str]:
+        """Extract clean content from specific URLs."""
+        ...
+```
+
+**Integration with Query Decomposition:**
+
+When the `QueryDecomposer` produces `external_queries`, they are executed via Tavily in parallel with internal queries:
+
+```
+Complex Query → QueryDecomposer
+  ├─ internal_queries → [routed to Semantic/Graph in parallel]
+  └─ external_queries → [executed via Tavily in parallel]
+      → Results merged into response context
+```
+
+**Routing decision:** The router classifies `external` queries via the decomposer, not via the query understanding LLM. Only queries that need web knowledge (comparisons, docs, best practices) generate external sub-queries.
+
+| Config | Default |
+|--------|---------|
+| `TAVILY_API_KEY` | Required for external search |
+| `ENABLE_EXTERNAL_SEARCH` | `true` |
+| `TAVILY_SEARCH_DEPTH` | `"basic"` (1 credit) or `"advanced"` (2 credits) |
+| `TAVILY_MAX_RESULTS` | `5` |
+
+---
+
+## 4.4 Graph Retrieval with Weaviate Enrichment
+
+When the router selects Graph, Neo4j finds the relationships, then follows **episodic edges** back to Weaviate for the actual source text and citations:
+
+```python
+class GraphRetriever:
+    """System-2: Neo4j traversal + Weaviate enrichment."""
+
+    async def retrieve(self, query: str, channel_id: str | None,
+                       understanding: QueryUnderstanding) -> list[dict]:
+
+        # Step 1: Resolve entities from query to Neo4j nodes
+        matched = await self.neo4j.fuzzy_match_entities(
+            understanding.entities, channel_id
+        )
+        if not matched:
+            return []  # No entities found → fallback to semantic
+
+        # Step 2: Graph traversal (1-2 hops)
+        if understanding.temporal_scope == "historical":
+            paths = await self.neo4j.temporal_chain(matched[0], channel_id)
+        else:
+            paths = await self.neo4j.traverse(
+                [m.name for m in matched], channel_id, max_hops=2
+            )
+
+        # Step 3: Follow episodic edges → get Weaviate memory IDs
+        node_ids = self._extract_node_ids(paths)
+        weaviate_ids = await self.neo4j.get_episodic_weaviate_ids(node_ids)
+
+        # Step 4: Fetch full memories from Weaviate (text + citations)
+        memories = await self.weaviate.fetch_by_ids(weaviate_ids)
+
+        # Step 5: Combine graph structure + memory content
+        return self._merge_graph_and_memories(paths, memories)
+```
+
+The episodic edge pattern is what makes graph queries grounded: Neo4j provides structure and relationships, but the actual text and citations always come from Weaviate. Neither store is queried in isolation for graph-routed requests.
+
+> **ADK Implementation:** The `GraphRetriever` methods above are wrapped as ADK `FunctionTool` instances (`traverse_neo4j`, `temporal_chain`) on the `graph_agent` sub-agent. The Weaviate enrichment step uses `search_weaviate_hybrid`. See [`13-adk-integration.md`](13-adk-integration.md) for the full tool mapping.
diff --git a/docs/v2/05-ingestion-pipeline.md b/docs/v2/05-ingestion-pipeline.md
new file mode 100644
index 00000000..378dce2d
--- /dev/null
+++ b/docs/v2/05-ingestion-pipeline.md
@@ -0,0 +1,436 @@
+# Ingestion Pipeline
+
+Messages from any platform enter the pipeline as a `NormalizedMessage` and pass through **6 stages** before being written to both Weaviate and Neo4j. The pipeline is the single write path for all memory — nothing is written to the stores directly.
+
+Target stores: see [`02-semantic-memory.md`](./02-semantic-memory.md) (Weaviate) and [`03-graph-memory.md`](./03-graph-memory.md) (Neo4j).
+
+> **ADK Implementation:** The 6-stage pipeline is orchestrated by the `create_ingestion_pipeline` factory (an ADK `SequentialAgent`), which chains: `PreprocessorAgent` → parallel(`FactExtractorAgent`, `EntityExtractorAgent`) → parallel(`EmbedderAgent`, `CrossBatchValidatorAgent`) → `PersisterAgent`. Store operations are wrapped as ADK `FunctionTool` instances. For large syncs, the Gemini Batch API can be used instead via `BatchPipelineRunner` in `services/batch_pipeline.py`. See [`13-adk-integration.md`](13-adk-integration.md) for the full agent hierarchy.
+
+---
+
+## 5.1 Multi-Platform Adapters
+
+**Chat SDK Evaluation**: The [Vercel Chat SDK](https://chat-sdk.dev/) is TypeScript-only and designed for bot webhooks — it **cannot fetch message history**. We use Python adapters for batch ingestion, with optional Chat SDK for real-time (Phase 2).
+
+```python
+@dataclass
+class NormalizedMessage:
+    """Unified message model across all platforms."""
+    content: str
+    author: AuthorInfo
+    platform: Platform           # slack | teams | discord
+    channel_id: str
+    channel_name: str
+    message_id: str
+    timestamp: datetime
+    thread_id: str | None = None
+    attachments: list[Attachment] = field(default_factory=list)
+    reactions: list[str] = field(default_factory=list)
+    reply_count: int = 0
+    raw_metadata: dict = field(default_factory=dict)
+
+class BaseAdapter(ABC):
+    @abstractmethod
+    async def fetch_history(self, channel_id, since=None, limit=500) -> list[NormalizedMessage]: ...
+
+class SlackAdapter(BaseAdapter):    # slack-sdk (Python)
+class TeamsAdapter(BaseAdapter):    # Microsoft Graph API
+class DiscordAdapter(BaseAdapter):  # discord.py
+```
+
+---
+
+## 5.2 Pipeline: Writes to Both Memory Systems
+
+```
+┌─────────────────────────────────────────────────────────────────────┐
+│                      INGESTION PIPELINE (6 Stages)                   │
+│                                                                      │
+│  NormalizedMessage (from any adapter)                                │
+│         │                                                            │
+│         ▼                                                            │
+│  STAGE 1: PREPROCESS                                                 │
+│  • Slack mrkdwn → markdown, thread context assembly                  │
+│  • Bot/system message filtering                                      │
+│  • Media processing: images (Gemini vision), PDFs (pypdf),          │
+│    large PDFs chunked into virtual messages                          │
+│         │                                                            │
+│         ▼  ┌──────────────────────────────────────────────────────┐  │
+│  STAGE 2: │  PARALLEL EXTRACTION                                 │  │
+│         │ │                                                        │  │
+│         │ │  FactExtractorAgent (ADK / Gemini Flash Lite)         │  │
+│         │ │  • Extract atomic facts from message + media context  │  │
+│         │ │  • Quality gate: score ≥ 0.5, max 2 facts/message    │  │
+│         │ │                                                        │  │
+│         │ │  EntityExtractorAgent (ADK / Gemini Flash Lite)       │  │
+│         │ │  • Extract entities + relationships (guided-flexible)  │  │
+│         │ │  • Entity quality gate: confidence ≥ 0.6              │  │
+│         │ │  • Filter hypotheticals & sarcasm                     │  │
+│         │ └──────────────────────────────────────────────────────┘  │
+│         │                                                            │
+│         ▼  ┌──────────────────────────────────────────────────────┐  │
+│  STAGE 3: │  PARALLEL ENRICHMENT                                 │  │
+│         │ │                                                        │  │
+│         │ │  EmbedderAgent                                        │  │
+│         │ │  • Jina v4 embeddings (2048-dim, named vectors)       │  │
+│         │ │  • Multimodal: separate text + image vectors          │  │
+│         │ │                                                        │  │
+│         │ │  CrossBatchValidatorAgent                             │  │
+│         │ │  • Resolve entity aliases across message batches      │  │
+│         │ │  • Validate relationship consistency                  │  │
+│         │ │  • Merge alias variants discovered across chunks      │  │
+│         │ └──────────────────────────────────────────────────────┘  │
+│         │                                                            │
+│         ▼                                                            │
+│  STAGE 4: PERSIST (Outbox Pattern)                                   │
+│  │                                                                   │
+│  ├──▶ MONGODB: Write intent document (atomic)                        │
+│  │    {fact, entities, embeddings, status: {weaviate: pending, ...}} │
+│  │                                                                   │
+│  ├──▶ WEAVIATE: Upsert atomic fact (idempotent, deterministic UUID)  │
+│  │    Mark intent.status.weaviate = "done"                           │
+│  │                                                                   │
+│  ├──▶ NEO4J: MERGE entities + relationships (idempotent via MERGE)   │
+│  │    Mark intent.status.neo4j = "done" (skip if Neo4j unavailable)  │
+│  │                                                                   │
+│  └──▶ MONGODB: Update sync state, mark intent complete               │
+│       Background reconciler retries "pending"/"failed" every 15min   │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+> **Batch API mode**: For large initial syncs, the pipeline can run via Gemini Batch API (`use_batch_api=true` on the sync endpoint). `BatchPipelineRunner` submits extraction batches asynchronously, polls for completion, and retries failed batches with smaller sizes. Progress is tracked in MongoDB and visible in the sync status API.
+
+---
+
+## 5.3 Entity Extraction Prompt (Guided-Flexible)
+
+```python
+ENTITY_EXTRACTION_PROMPT = """
+Extract entities and relationships from this message.
+
+CORE ENTITY TYPES (prefer these when applicable):
+- Person: individual (fields: name, role, team)
+- Decision: concrete choice (fields: summary, status, rationale, date)
+- Project: initiative (fields: name, status, description)
+- Technology: tool/framework (fields: name, category)
+
+EXTENSION TYPES (use when content doesn't fit core types):
+- Create any type: Team, Meeting, Artifact, Constraint, Deadline, Budget, ...
+
+RELATIONSHIPS:
+- Use descriptive verb phrases: DECIDED, WORKS_ON, BLOCKED_BY, OWNS, ...
+- NOT limited to a fixed set — use whatever captures the meaning
+- Include temporal context when available
+
+EXISTING ENTITIES (reuse names to avoid duplicates):
+{existing_entities}
+
+OUTPUT JSON:
+{
+  "entities": [{"type": "...", "name": "...", "properties": {...},
+                "aliases": ["alternative name 1", "@slack_handle", ...]}],
+  "relationships": [{"source": "...", "type": "...", "target": "...",
+                      "context": "...", "temporal": "current|supersedes:<old>",
+                      "evidence": "exact quote or paraphrase from message",
+                      "confidence": 0.0-1.0}],
+  "confidence": 0.0-1.0
+}
+
+ALIAS RULES:
+- Map all name variants to a canonical form: "Alice", "@alice", "alice.chen" → "Alice Chen"
+- Include Slack handles, nicknames, abbreviated names as aliases
+- For projects: "Atlas", "beever-atlas", "the atlas project" → canonical name
+"""
+```
+
+---
+
+## 5.4 Quality Gate (MemoryQualityGate)
+
+Applied at Stage 2. Rejects low-signal facts before embedding to keep Weaviate clean.
+
+```python
+class MemoryQualityGate:
+    MIN_LENGTH = 40
+    MAX_FACTS_PER_MESSAGE = 2
+    MIN_QUALITY_SCORE = 0.5
+    VAGUE_PATTERNS = ["the user", "the process", "this was", "it was",
+                      "the output", "as mentioned", "was adjusted"]
+
+    def score_fact(self, fact: str) -> float:
+        score = 1.0
+        if len(fact) < self.MIN_LENGTH: score -= 0.3
+        for p in self.VAGUE_PATTERNS:
+            if p in fact.lower(): score -= 0.2
+        if any(w[0].isupper() for w in fact.split()[1:] if len(w) > 1): score += 0.1
+        if fact.startswith(("It ", "This ", "That ")): score -= 0.15
+        return max(0.0, min(1.0, score))
+```
+
+Facts scoring below `MIN_QUALITY_SCORE` (0.5) are dropped. Each message produces at most 2 facts to prevent over-extraction from verbose messages.
+
+---
+
+## 5.5 Entity Quality Gate (EntityQualityGate)
+
+Applied at Stage 3. Prevents graph pollution from low-confidence or hypothetical entities.
+
+```python
+class EntityQualityGate:
+    """Quality gate for entity extraction — prevents graph pollution.
+
+    Inspired by Forensic Eyes' per-category confidence thresholds:
+    higher bars for high-stakes relationships, lower for casual mentions.
+    """
+    MIN_ENTITY_CONFIDENCE = 0.6
+
+    # Per-relationship-type confidence thresholds
+    # Higher bar for relationships with greater semantic commitment
+    RELATIONSHIP_CONFIDENCE = {
+        "DECIDED":      0.7,   # Decisions must be clearly stated
+        "OWNS":         0.6,   # Ownership/responsibility requires clarity
+        "LEADS":        0.6,   # Leadership roles require clarity
+        "BLOCKED_BY":   0.6,   # Blockers must be explicit
+        "SUPERSEDES":   0.7,   # Temporal evolution must be unambiguous
+        "WORKS_ON":     0.4,   # Work associations are common and casual
+        "MENTIONS":     0.3,   # Low bar — just needs to be real
+        "MEMBER_OF":    0.4,   # Team membership is usually clear
+        "USES":         0.4,   # Technology usage is common
+        "DEPENDS_ON":   0.5,   # Dependencies should be stated
+        "_DEFAULT":     0.5,   # Fallback for LLM-created relationship types
+    }
+
+    HYPOTHETICAL_PATTERNS = [
+        "maybe", "might", "could", "should we", "what if",
+        "let's just", "hypothetically", "joking", "kidding",
+    ]
+
+    def filter_entities(self, extraction_result: dict,
+                         source_message: str) -> dict:
+        """Reject low-confidence entities and hypothetical references."""
+        if extraction_result.get("confidence", 0) < self.MIN_ENTITY_CONFIDENCE:
+            return {"entities": [], "relationships": []}
+
+        # Raise threshold for hypothetical/sarcastic messages
+        msg_lower = source_message.lower()
+        threshold = 0.8 if any(p in msg_lower for p in self.HYPOTHETICAL_PATTERNS) \
+                       else self.MIN_ENTITY_CONFIDENCE
+
+        valid_entities = [
+            e for e in extraction_result.get("entities", [])
+            if self._score_entity(e) >= threshold
+        ]
+
+        # Only keep relationships where both endpoints survived filtering
+        valid_names = {e["name"] for e in valid_entities}
+        valid_rels = [
+            r for r in extraction_result.get("relationships", [])
+            if r["source"] in valid_names and r["target"] in valid_names
+               and r.get("confidence", 0.5) >= self.RELATIONSHIP_CONFIDENCE.get(
+                   r.get("type", ""), self.RELATIONSHIP_CONFIDENCE["_DEFAULT"])
+        ]
+
+        return {"entities": valid_entities, "relationships": valid_rels}
+
+    def _score_entity(self, entity: dict) -> float:
+        score = entity.get("confidence", 0.5)
+        if entity.get("properties", {}).get("role"): score += 0.1
+        if entity.get("properties", {}).get("date"): score += 0.1
+        if entity["name"].lower() in ("it", "this", "that", "someone"): score -= 0.5
+        return max(0.0, min(1.0, score))
+```
+
+---
+
+## 5.6 Contradiction Detection
+
+Contradictory facts are detected and resolved via SUPERSEDES chains. This runs as a **background job every 15 minutes** (not blocking ingestion).
+
+```python
+class ContradictionDetector:
+    """Detect and resolve contradictory facts via LLM comparison."""
+
+    SIMILARITY_RANGE = (0.70, 0.95)  # Cosine similarity range for candidates
+    CONFIDENCE_THRESHOLD = 0.8       # Auto-supersede above this
+
+    async def detect_batch(self):
+        """Process recently ingested facts for contradictions."""
+        recent = await self.weaviate.get_facts_since(
+            minutes_ago=15, has_contradiction_check=False)
+
+        for fact in recent:
+            await self._check_contradictions(fact)
+            await self.weaviate.mark_contradiction_checked(fact.id)
+
+    async def _check_contradictions(self, new_fact: dict):
+        # METHOD 1: Cosine similarity scan (catches rephrased contradictions)
+        similar = await self.weaviate.search_similar(
+            new_fact["memory"],
+            channel_id=new_fact["channel_id"],
+            min_similarity=self.SIMILARITY_RANGE[0],
+            max_similarity=self.SIMILARITY_RANGE[1],
+            exclude_id=new_fact["id"],
+            limit=5,
+        )
+
+        # METHOD 2: Entity-scoped scan (catches same-topic contradictions
+        # regardless of text similarity — e.g., "Alice is auth lead" vs "Bob is auth lead")
+        if new_fact.get("graph_entity_ids"):
+            entity_related = await self.neo4j.get_facts_for_entities(
+                new_fact["graph_entity_ids"],
+                exclude_weaviate_id=new_fact["id"])
+            similar.extend(entity_related)
+
+        # LLM comparison for each candidate pair
+        for candidate in similar:
+            result = await self._llm_compare(new_fact, candidate)
+            if result["classification"] == "CONTRADICTORY" \
+               and result["confidence"] > self.CONFIDENCE_THRESHOLD:
+                await self._supersede(older=candidate, newer=new_fact,
+                                       reason=result["reason"])
+
+    async def _supersede(self, older, newer, reason):
+        # Mark old fact as invalidated in Weaviate
+        await self.weaviate.update(older["id"], {
+            "invalid_at": datetime.utcnow().isoformat(),
+            "superseded_by": newer["id"],
+            "supersession_reason": reason,
+        })
+
+        # Create SUPERSEDES edge in Neo4j if both have graph entities
+        if newer.get("graph_entity_ids") and older.get("graph_entity_ids"):
+            await self.neo4j.create_supersedes_edge(
+                newer_entity_ids=newer["graph_entity_ids"],
+                older_entity_ids=older["graph_entity_ids"],
+                reason=reason)
+```
+
+**Contradiction comparison prompt:**
+
+```python
+CONTRADICTION_PROMPT = """Compare these two facts from the same channel:
+
+EXISTING (created {old_timestamp}):
+"{old_memory}"
+
+NEW (created {new_timestamp}):
+"{new_memory}"
+
+Classify the relationship:
+- CONTRADICTORY: The new fact replaces or invalidates the old fact
+- PROGRESSIVE: The new fact builds on or extends the old fact (not a contradiction)
+- INDEPENDENT: Different topics, no relationship
+
+Examples:
+- "We use JWT with HS256" → "We switched to RS256 for JWT" = CONTRADICTORY
+- "We use PostgreSQL for users" → "We use MongoDB for analytics" = INDEPENDENT
+- "Alice is exploring Kubernetes" → "Alice deployed to Kubernetes" = PROGRESSIVE
+- "Alice is auth lead" → "Bob is the new auth lead" = CONTRADICTORY
+- "Sprint deadline is March 15" → "Sprint deadline extended to March 22" = CONTRADICTORY
+
+Respond in JSON: {"classification": "...", "confidence": 0.0-1.0, "reason": "..."}"""
+```
+
+**Cost:** ~$0.001 per comparison (Gemini Flash Lite). Typically 0-5 comparisons per new fact. Negligible at scale.
+
+**Retrieval integration:** The `ImprovedSemanticRetriever` filters by `invalid_at IS NULL` — superseded facts are automatically excluded from results without any retrieval code changes.
+
+---
+
+## 5.7 Consolidation Schedule & Triggers
+
+Consolidation builds Tier 0 (channel summaries) and Tier 1 (topic clusters) from Tier 2 (atomic facts). Without consolidation, the wiki has nothing to serve and the "80% free reads" promise doesn't work.
+
+> **ADK Implementation:** Consolidation is orchestrated by the `consolidation_agent` (an ADK `LoopAgent`) containing `cluster_assigner` and `health_checker` sub-agents. See [`13-adk-integration.md`](13-adk-integration.md).
+
+**Three trigger types:**
+
+```python
+class ConsolidationService:
+    """Manages cluster building, summary updates, and wiki refresh."""
+
+    # TRIGGER 1: After sync (incremental — new facts only)
+    async def on_sync_complete(self, channel_id: str):
+        """Runs automatically when a channel sync finishes."""
+        unclustered = await self.weaviate.get_unclustered_facts(channel_id)
+        if not unclustered:
+            return
+
+        touched = await self._assign_to_clusters(channel_id, unclustered)
+        await self._update_cluster_summaries(channel_id, touched)
+        await self._update_channel_summary(channel_id)
+        await self.mongo.mark_wiki_dirty(channel_id)
+
+    # TRIGGER 2: Scheduled full rebuild (daily 2 AM UTC)
+    @scheduled(cron="0 2 * * *")
+    async def daily_full_consolidation(self):
+        """Re-evaluates all clusters: coherence, split/merge, summaries."""
+        for channel_id in await self.get_active_channels():
+            await self._full_reconsolidate(channel_id)
+            await self._rebuild_wiki(channel_id)
+
+    # TRIGGER 3: On-demand via API
+    async def manual_trigger(self, channel_id: str):
+        """Manual refresh for admin use or after bulk operations."""
+        await self._full_reconsolidate(channel_id)
+        await self._rebuild_wiki(channel_id)
+
+    async def _assign_to_clusters(self, channel_id, new_facts) -> set:
+        """Incremental: assign new facts to existing or new clusters."""
+        existing = await self.weaviate.get_tier1_clusters(channel_id)
+        touched = set()
+
+        for fact in new_facts:
+            best_match, best_score = None, 0.0
+            for cluster in existing:
+                score = await self._topic_similarity(fact, cluster)
+                if score > best_score:
+                    best_match, best_score = cluster, score
+
+            if best_score > 0.6:
+                await self.weaviate.link_fact_to_cluster(fact.id, best_match.id)
+                touched.add(best_match.id)
+            else:
+                # New cluster seed — promoted when 3+ members accumulate
+                new_id = await self.weaviate.create_cluster_seed(channel_id, fact)
+                touched.add(new_id)
+
+        return touched
+```
+
+**Cluster health rules** (applied during daily full reconsolidation):
+
+| Condition | Action |
+|-----------|--------|
+| Cluster > 100 members | Split via k-means on embeddings into 2-3 sub-clusters |
+| Two clusters have summary cosine > 0.85 | Merge into single cluster |
+| Cluster coherence score < 0.4 | Re-cluster members from scratch |
+| Cluster has 0 members | Delete cluster |
+
+**Wiki dirty flag** — ensures wiki reflects latest changes:
+
+```python
+# In wiki_cache.py
+async def get_wiki(self, channel_id: str) -> str:
+    cached = await self.cache.find_one({"channel_id": channel_id})
+    dirty = await self.dirty_flags.find_one({"channel_id": channel_id})
+
+    if cached and (not dirty or not dirty.get("dirty")):
+        return cached["content"]  # FREE read — no LLM cost
+
+    # Regenerate: consolidation or entity changes made wiki stale
+    wiki = await self.builder.build(channel_id)
+    await self.cache.update_one(
+        {"channel_id": channel_id},
+        {"$set": {"content": wiki, "generated_at": datetime.utcnow()}},
+        upsert=True)
+    await self.dirty_flags.update_one(
+        {"channel_id": channel_id}, {"$set": {"dirty": False}})
+    return wiki
+```
+
+**What triggers `mark_wiki_dirty`:**
+- After sync → consolidation assigns new facts to clusters
+- Entity extraction writes new Person/Decision/Technology to Neo4j
+- Contradiction detector supersedes a fact
+- Manual reconsolidation trigger
diff --git a/docs/v2/06-wiki-generation.md b/docs/v2/06-wiki-generation.md
new file mode 100644
index 00000000..0351b02f
--- /dev/null
+++ b/docs/v2/06-wiki-generation.md
@@ -0,0 +1,861 @@
+# Wiki Generation
+
+The wiki combines both memory systems to produce a comprehensive, **pageable, hierarchical knowledge base** for each channel — similar to how DeepWiki generates multi-page documentation for repositories. Each channel wiki consists of **fixed structural pages** (always present) and **agent-generated topic pages** (dynamically created based on channel content). Large channels naturally get deeper wikis with more pages and sub-sections.
+
+> **Status**: Implemented. The WikiCompiler, WikiBuilder, and WikiCache are all operational. Sub-topic depth (3+ levels) and some fixed pages (Glossary, Tech Stack, Projects) are planned for a future phase.
+Every wiki page supports rich content: Mermaid diagrams, charts, tables, lists, callout boxes, inline citations with original message permalinks, entity chips, and embedded media references.
+
+Design informed by research across 14+ platforms: DeepWiki, Notion, Confluence, Guru, Tettra, Slite, Glean, Dashworks, Devin Wiki, Mem.ai, Sana AI, Microsoft Copilot Pages, Google NotebookLM, Dust.tt, and Slack AI.
+
+---
+
+## Architecture: Fixed Pages + Agent-Generated Pages
+
+The wiki has two kinds of pages:
+
+| Kind | Pages | How they're built |
+|------|-------|-------------------|
+| **Fixed** | Overview, People, Decisions, Recent Activity, FAQ, Glossary, Resources & Media | Template-driven — `wiki_builder.py` fills structured templates from Weaviate + Neo4j data |
+| **Agent-generated** | Topic pages (e.g., "Authentication", "Infrastructure") and their sub-pages | `consolidation_agent` (ADK) analyzes Tier 1 clusters and generates page structure, deciding how to split large topics into sub-pages |
+
+This hybrid approach means:
+- Small channels (50 messages, 2 topics) get a compact wiki: Overview + 2 topic pages + People + Decisions
+- Large channels (10,000 messages, 20 topics) get a deep wiki: Overview + 20 topic pages (some with sub-pages) + People + Decisions + Tech Stack + Projects + full Glossary + rich FAQ
+
+The agent decides the depth — not the template.
+
+---
+
+## Wiki Structure & Navigation
+
+### Sidebar Navigation (DeepWiki-style)
+
+Every channel wiki has a persistent left sidebar showing the page hierarchy:
+
+```
+#backend-engineering Wiki
+─────────────────────────
+
+1. Overview                    ← fixed
+2. Topics                      ← section header
+   2.1 Authentication          ← agent-generated
+     2.1.1 JWT Migration       ← agent-generated sub-page
+     2.1.2 OAuth Integration   ← agent-generated sub-page
+   2.2 Infrastructure          ← agent-generated
+     2.2.1 AWS EKS Setup       ← agent-generated sub-page
+     2.2.2 Terraform Modules   ← agent-generated sub-page
+   2.3 CI/CD Pipeline          ← agent-generated
+   2.4 API Design              ← agent-generated
+3. People & Experts            ← fixed
+4. Decisions                   ← fixed
+5. Tech Stack                  ← fixed
+6. Projects                    ← fixed
+7. Recent Activity             ← fixed
+8. FAQ                         ← fixed
+9. Glossary                    ← fixed
+10. Resources & Media          ← fixed
+```
+
+- Numbered hierarchy like DeepWiki
+- Current page highlighted in sidebar
+- Collapsible sections (topics collapse/expand)
+- Page count badge next to "Topics" showing total topic pages
+- Desktop: persistent 220px sidebar. Mobile: slide-out drawer
+
+### URL Structure
+
+```
+/channels/:id/wiki                     → Overview (default landing)
+/channels/:id/wiki/people              → People & Experts
+/channels/:id/wiki/decisions           → Decisions
+/channels/:id/wiki/tech-stack          → Tech Stack
+/channels/:id/wiki/projects            → Projects
+/channels/:id/wiki/activity            → Recent Activity
+/channels/:id/wiki/faq                 → FAQ
+/channels/:id/wiki/glossary            → Glossary
+/channels/:id/wiki/resources           → Resources & Media
+/channels/:id/wiki/topics/:slug        → Topic page (e.g., /topics/authentication)
+/channels/:id/wiki/topics/:slug/:sub   → Sub-page (e.g., /topics/authentication/jwt-migration)
+```
+
+---
+
+## Fixed Pages — Detailed Spec
+
+### Page 1: Overview (landing page)
+
+The entry point for the channel wiki. Provides a high-level summary and navigation into deeper content.
+
+```
+┌──────────────────────────────────────────────────────────────────┐
+│  Wiki Sidebar (220px)  │  #backend-engineering                   │
+│                        │  Overview                               │
+│  1. Overview ←         │─────────────────────────────────────────│
+│  2. Topics (8)         │                                         │
+│     2.1 Authentication │  Slack · 42 members · 3,241 messages    │
+│     2.2 Infrastructure │  Last synced: 2h ago · Wiki: Fresh ●    │
+│     2.3 CI/CD          │                                         │
+│     2.4 API Design     │  The backend engineering channel is     │
+│     ...                │  the primary hub for API design,        │
+│  3. People             │  infrastructure decisions, and          │
+│  4. Decisions          │  deployment workflows...                │
+│  5. Tech Stack         │                                         │
+│  6. Projects           │  [donut chart: topic distribution]      │
+│  7. Activity           │                                         │
+│  8. FAQ                │  ─────────────────────────────────────  │
+│  9. Glossary           │  Key Highlights                         │
+│  10. Resources         │  • 8 active decisions                   │
+│                        │  • 12 team members identified           │
+│                        │  • 5 active projects                    │
+│                        │  • 23 documents & links shared          │
+│                        │                                         │
+│                        │  ─────────────────────────────────────  │
+│                        │  Topic Overview                         │
+│                        │                                         │
+│                        │  [mermaid: topic relationship graph]    │
+│                        │                                         │
+│                        │  ┌──────────┐ ┌──────────┐ ┌────────┐ │
+│                        │  │Auth (23) │ │Infra (15)│ │CI/CD(8)│ │
+│                        │  │JWT, OAuth│ │EKS, TF   │ │GHA     │ │
+│                        │  │→ Read    │ │→ Read    │ │→ Read  │ │
+│                        │  └──────────┘ └──────────┘ └────────┘ │
+│                        │                                         │
+│                        │  ─────────────────────────────────────  │
+│                        │  Recent Changes (last 7 days)           │
+│                        │  • +8 facts, +1 decision, +2 entities  │
+│                        │  → View full activity                   │
+└──────────────────────────────────────────────────────────────────┘
+```
+
+**Content:**
+- Channel name, platform icon, one-line description
+- Metadata row: member count, messages ingested, last synced, wiki freshness badge
+- 2-3 paragraph auto-generated summary (Tier 0)
+- **Donut chart**: Topic distribution by memory count
+- **Key highlights**: Counts of decisions, people, projects, resources
+- **Topic overview**: Mermaid graph of topic relationships + topic cards linking to topic pages
+- **Recent changes summary**: Brief diff linking to Activity page
+
+**Source**: Weaviate Tier 0 (FREE) + MongoDB channel metadata + Neo4j counts
+
+---
+
+### Page 3: People & Experts
+
+```
+┌─────────────────────────────────────────────────────────────────┐
+│  Sidebar  │  People & Experts                                   │
+│           │─────────────────────────────────────────────────────│
+│           │  [bar chart: messages per week by person]            │
+│           │                                                      │
+│           │  Decision Makers                                     │
+│           │  ┌────────────────────────────────────────────────┐ │
+│           │  │ Alice · Auth Lead                               │ │
+│           │  │ Expertise: Auth  API  OAuth                     │ │
+│           │  │ Decisions: JWT migration [1], Rate limiting [4] │ │
+│           │  │ Active 2 days ago                                │ │
+│           │  └────────────────────────────────────────────────┘ │
+│           │                                                      │
+│           │  Active Contributors                                 │
+│           │  ┌────────────────────────────────────────────────┐ │
+│           │  │ Bob · 12 msgs/week                              │ │
+│           │  │ Topics: Infra  CI/CD  Terraform                 │ │
+│           │  │ Active today                                     │ │
+│           │  └────────────────────────────────────────────────┘ │
+│           │                                                      │
+│           │  Subject Experts (3+ topics)                         │
+│           │  ...                                                 │
+│           │                                                      │
+│           │  ─────────────────────────────────────────────────  │
+│           │  Sources                                             │
+│           │  [1] @alice · Mar 20 · View ↗                       │
+└─────────────────────────────────────────────────────────────────┘
+```
+
+**Content:**
+- **Bar chart**: Messages per week by contributor (last 30 days)
+- Grouped by role (auto-detected from Neo4j edges):
+  - **Decision Makers** — people with `DECIDED` edges
+  - **Active Contributors** — people with high `MENTIONED_IN` frequency
+  - **Subject Experts** — people linked to 3+ topics
+- Each person card: name, role, expertise topic chips (link to topic pages), key decisions, last active
+- Bottom citations section
+
+**Source**: Neo4j `Person → MENTIONED_IN, DECIDED, WORKS_ON` edges
+
+---
+
+### Page 4: Decisions
+
+**Content:**
+- **Mermaid flowchart**: Supersede chains (green = active, red = superseded)
+
+```mermaid
+graph TB
+    D1[❌ Use HS256<br/>Feb 28 · Alice] -->|superseded Mar 20| D2[✅ Use RS256<br/>Mar 20 · Alice]
+    D3[❌ Manual deploy<br/>Jan 10 · Bob] -->|superseded Feb 15| D4[✅ ArgoCD GitOps<br/>Feb 15 · Bob]
+    style D1 fill:#fee2e2,stroke:#ef4444
+    style D3 fill:#fee2e2,stroke:#ef4444
+    style D2 fill:#dcfce7,stroke:#22c55e
+    style D4 fill:#dcfce7,stroke:#22c55e
+```
+
+- Filter toggles: Active only | All | By person
+- Vertical timeline (newest first), each entry:
+  - Date, decision title, who decided, status badge (**Active** / **Superseded** / **Pending**)
+  - What it superseded (strikethrough)
+  - Affected topics (link to topic pages) and technologies as chips
+  - Source citation link
+- Bottom citations section
+
+**Source**: Neo4j `Decision` nodes + `SUPERSEDES` edges
+
+---
+
+### Page 5: Tech Stack
+
+**Content:**
+- Table/grid of Technology nodes scoped to this channel
+- Each entry: technology name, category (language/framework/service/tool), who champions it, related decisions, first mentioned, related topic page link
+- Example: `JWT (RS256) — Auth · Championed by Alice · Decided Mar 20 [1] · See: Authentication`
+
+**Source**: Neo4j `Technology` nodes + relationships
+
+---
+
+### Page 6: Projects
+
+**Content:**
+- **Mermaid graph**: Project dependency graph (green = active, yellow = in progress, red = blocked)
+
+```mermaid
+graph TD
+    P1[🟢 JWT Migration<br/>Alice] --> P2[🟡 Redis Upgrade<br/>Bob]
+    P3[🔴 Rate Limiting<br/>Bob] --> P2
+    P2 --> P4[🟢 EKS v1.28<br/>Charlie]
+    style P1 fill:#dcfce7,stroke:#22c55e
+    style P2 fill:#fef9c3,stroke:#eab308
+    style P3 fill:#fee2e2,stroke:#ef4444
+    style P4 fill:#dcfce7,stroke:#22c55e
+```
+
+- Cards per project: name, lead, status (Active/Completed/Blocked), BLOCKED_BY chips, related decisions/people/technologies, link to related topic pages
+
+**Source**: Neo4j `Project` nodes + `BLOCKED_BY, WORKS_ON, USES` edges
+
+---
+
+### Page 7: Recent Activity
+
+**Content:**
+- **Area chart**: 7-day knowledge growth trend (facts/decisions/entities stacked)
+
+```chart
+{
+  "type": "area",
+  "title": "Knowledge growth (last 7 days)",
+  "data": [
+    { "date": "Apr 01", "facts": 5, "decisions": 1, "entities": 0 },
+    { "date": "Apr 02", "facts": 2, "decisions": 0, "entities": 2 },
+    { "date": "Apr 03", "facts": 8, "decisions": 0, "entities": 1 },
+    { "date": "Apr 04", "facts": 4, "decisions": 1, "entities": 0 },
+    { "date": "Apr 05", "facts": 2, "decisions": 0, "entities": 0 },
+    { "date": "Apr 06", "facts": 6, "decisions": 1, "entities": 2 },
+    { "date": "Apr 07", "facts": 3, "decisions": 0, "entities": 1 }
+  ],
+  "xKey": "date",
+  "series": ["facts", "decisions", "entities"],
+  "colors": ["#6366f1", "#f59e0b", "#22c55e"]
+}
+```
+
+- "What changed since last refresh" diff callout at top
+- Grouped by day: new facts (count + highlights), new decisions, new entities, contradictions resolved
+- Each item links to its topic page
+
+**Source**: Weaviate Tier 2 facts filtered by timestamp
+
+---
+
+### Page 8: FAQ
+
+**Content:**
+- 5-10 auto-generated Q&A pairs from common topics
+- Each Q&A pair with source citations
+- Popular questions from the Ask tab promoted here over time
+- Each answer can link to the relevant topic page for deeper reading
+
+**Source**: LLM-generated by `consolidation_agent` from Tier 1 topics + Tier 2 facts
+
+---
+
+### Page 9: Glossary
+
+**Content:**
+- Alphabetical list of channel-specific terms/acronyms
+- Each entry: term, definition (1-2 sentences), who uses it most, first mentioned date, source citation, link to relevant topic page
+- Example: `CQRS — Command Query Responsibility Segregation. Used by @bob. First mentioned Jan 15. See: Infrastructure [4]`
+
+**Source**: LLM extraction during wiki generation from Tier 2 facts + entity metadata
+
+---
+
+### Page 10: Resources & Media
+
+**Content — grouped by type:**
+
+| Type | Display | Source Field |
+|------|---------|-------------|
+| **Documents** (PDF, DOCX) | Sortable table — name, author, date, topics, view link | `source_media_type = "pdf" \| "doc"` |
+| **Images & Diagrams** | Thumbnail grid (3-col desktop, 2 tablet, 1 mobile) with lightbox. AI-generated alt text from Gemini vision | `source_media_type = "image"` |
+| **Links** | Table — title/URL, author, date, related topics | `source_link_urls` |
+| **Videos** | Table — name, duration, author, transcription summary | `source_media_type = "video"` |
+| **Audio** | Table — name, duration, author, transcription summary | `source_media_type = "audio"` |
+
+- Each media item links to the topic page where it was discussed
+- Filter by type, date, topic
+
+**Source**: Weaviate Tier 2 facts filtered by `source_media_urls != [] OR source_link_urls != []`
+
+---
+
+## Agent-Generated Topic Pages
+
+These are the core knowledge pages — one per Tier 1 topic cluster. The `consolidation_agent` decides:
+1. How many topic pages to create (one per Tier 1 cluster)
+2. Whether a topic is large enough to split into sub-pages
+3. What sub-page structure makes sense for each topic
+
+### Topic Page Structure
+
+Every topic page follows this template:
+
+```
+┌─────────────────────────────────────────────────────────────────┐
+│  Sidebar              │  Authentication                         │
+│                       │  23 memories · 3 sub-pages              │
+│  1. Overview          │─────────────────────────────────────────│
+│  2. Topics (8)        │                                         │
+│     2.1 Auth ←        │  Overview                               │
+│       2.1.1 JWT Migr  │  Team discussed JWT with RS256, migrated│
+│       2.1.2 OAuth     │  from sessions in Q3 2024. Key people:  │
+│       2.1.3 Session   │  @alice (lead), @bob (reviewer)...      │
+│     2.2 Infra         │                                         │
+│     ...               │  [mermaid: sub-topic relationship map]  │
+│  3. People            │                                         │
+│  4. Decisions         │  ─────────────────────────────────────  │
+│                       │  Key Facts                               │
+│                       │  • Alice proposed RS256 over HS256 for   │
+│                       │    asymmetric verification [1]           │
+│                       │  • Migration completed March 20 with     │
+│                       │    zero downtime [2]                     │
+│                       │  • Refresh token rotation enabled with   │
+│                       │    7-day expiry [3]                      │
+│                       │  • Session-based auth fully deprecated   │
+│                       │    after Q3 migration [4]                │
+│                       │  • Rate limiting added to auth endpoints │
+│                       │    to prevent brute force [5]            │
+│                       │                                          │
+│                       │  ─────────────────────────────────────  │
+│                       │  Related Decisions                       │
+│                       │  • ✅ Use RS256 — Alice, Mar 20 [1]     │
+│                       │  • ❌ ~~Use HS256~~ — superseded         │
+│                       │  → View all decisions                    │
+│                       │                                          │
+│                       │  ─────────────────────────────────────  │
+│                       │  Related People                          │
+│                       │  @alice (lead) · @bob (reviewer)         │
+│                       │  → View all people                       │
+│                       │                                          │
+│                       │  ─────────────────────────────────────  │
+│                       │  Related Media                           │
+│                       │  📄 JWT-spec-v3.pdf · Alice · Mar 15    │
+│                       │  🔗 auth0.com/docs/jwt · Alice · Mar 18 │
+│                       │  🖼️ [auth-flow-diagram.png thumbnail]   │
+│                       │  → View all resources                    │
+│                       │                                          │
+│                       │  ─────────────────────────────────────  │
+│                       │  Sub-pages                               │
+│                       │  → 2.1.1 JWT Migration (12 memories)    │
+│                       │  → 2.1.2 OAuth Integration (7 memories) │
+│                       │  → 2.1.3 Session Deprecation (4 memor.) │
+│                       │                                          │
+│                       │  ─────────────────────────────────────  │
+│                       │  Sources                                 │
+│                       │  [1] @alice · Mar 20 · View ↗           │
+│                       │  [2] @alice · Mar 20 · View ↗           │
+│                       │  [3] @bob · Mar 18 · View ↗             │
+└─────────────────────────────────────────────────────────────────┘
+```
+
+**Every topic page contains:**
+
+1. **Topic overview** — 1-2 paragraph summary of the topic
+2. **Key facts** — All important facts (not just top 3 — this is a dedicated page, show them all). Sorted by importance/quality score
+3. **Sub-topic relationship diagram** (Mermaid) — if the topic has sub-pages
+4. **Related decisions** — Decision nodes linked to this topic, with status badges. Links to Decisions page
+5. **Related people** — Person nodes active in this topic. Links to People page
+6. **Related media** — Documents, images, links shared in context of this topic. Shows thumbnails inline for images. Links to Resources page
+7. **Sub-page links** — Navigation to sub-pages if the topic was split
+8. **Source citations** — Bottom panel with all citations for this page
+
+### Sub-Page Structure
+
+Sub-pages are the deepest level. They follow a simpler template:
+
+1. **Summary** — 1-2 sentences
+2. **All facts** — Every atomic fact in this sub-topic, with citations
+3. **Related media** — Documents/images/links specific to this sub-topic, rendered inline (image thumbnails, PDF preview links, video embeds)
+4. **Source citations**
+
+### When Does the Agent Create Sub-Pages?
+
+The `consolidation_agent` creates sub-pages when:
+- A Tier 1 cluster has **15+ atomic facts** (too many for one page)
+- The facts naturally group into **2+ distinct sub-themes** (detected by the agent)
+- The sub-themes have **5+ facts each** (worth their own page)
+
+Small topics (< 15 facts) stay as a single page with no sub-pages.
+
+---
+
+## Per-Page Features (Apply to ALL Pages)
+
+### Source Citations
+
+Every page has its own citations section at the bottom:
+```
+Sources
+[1] @alice in #backend · Mar 20 · View ↗
+[2] @alice in #backend · Mar 20 · View ↗
+[3] @bob in #backend · Mar 18 · 📄[JWT-spec.pdf] · View ↗
+```
+
+- Inline `[1]` markers on every factual claim
+- Hover citation → tooltip with message excerpt
+- Click citation → opens original Slack/Teams/Discord message permalink
+- Media-sourced citations show a type badge: 📄 (doc), 🔗 (link), 🖼️ (image), 🎬 (video), 🎙️ (audio)
+
+### Rich Content Rendering
+
+Every wiki page supports these content types:
+
+| Type | Syntax | Library | Used In |
+|------|--------|---------|---------|
+| **Mermaid diagrams** | ` ```mermaid ` code blocks | `mermaid` (client-side) | Topic graphs, Decision flows, Project deps |
+| **Charts** | JSON spec in ` ```chart ` blocks | `recharts` | Overview (donut), People (bar), Activity (area) |
+| **GFM tables** | Standard `\| col \|` tables | `react-markdown` + `remark-gfm` | People, Tech, Decisions, Media |
+| **Lists** | Bullet/numbered markdown | `react-markdown` | Facts, Glossary, FAQ |
+| **Callout boxes** | `> [!NOTE]` / `> [!TIP]` / `> [!WARNING]` | Custom remark plugin | Stale warnings, key insights |
+| **Citation links** | `[1]` inline markers | Custom remark plugin | All pages |
+| **Entity chips** | `@alice`, `#topic`, `$technology` | Custom remark plugin | All pages — clickable, navigate to relevant page |
+| **Media badges** | `📄[filename]`, `🔗[domain]` | Custom remark plugin | Citations with media sources |
+| **Image thumbnails** | `![alt](url){.thumbnail}` | Custom remark plugin | Topic pages, Resources page |
+
+### Media Inline Rendering
+
+When a page references media, it's shown inline — not just as a link:
+
+- **Images**: Rendered as thumbnails in a responsive grid. Click → lightbox with full-size view. AI-generated alt text from Gemini vision description
+- **PDFs**: Preview card with title, page count, and "Open" link
+- **Links**: Rich preview card with title, domain, and favicon (if available)
+- **Videos**: Thumbnail + duration badge + transcription excerpt
+- **Audio**: Play indicator + duration + transcription excerpt
+
+### Freshness Badge
+
+Every page shows when it was last generated:
+- "Generated 2h ago" (green ● ) — fresh
+- "Stale — new data available" (amber ● ) — `wiki_dirty` is true
+- "Refresh Wiki" button on Overview page triggers full regeneration
+
+### Entity Chips as Cross-Links
+
+`@alice` → navigates to People page, scrolls to Alice's card
+`#authentication` → navigates to Authentication topic page
+`$JWT` → navigates to Tech Stack page, highlights JWT entry
+`Decision: Use RS256` → navigates to Decisions page, highlights that entry
+
+---
+
+## Rendering Stack
+
+### Markdown Renderer
+
+```
+WikiMarkdown.tsx (enhanced react-markdown)
+  ├── remark-gfm          → tables, strikethrough, task lists
+  ├── MermaidBlock.tsx     → ```mermaid → SVG diagram
+  ├── ChartBlock.tsx       → ```chart → recharts component (bar/area/donut)
+  ├── CalloutBox.tsx       → > [!NOTE] → styled card
+  ├── EntityChip.tsx       → @person #topic $tech → clickable chip with navigation
+  ├── CitationLink.tsx     → [1] → hover preview + click permalink
+  ├── MediaBadge.tsx       → 📄[file] → inline media type indicator
+  └── MediaEmbed.tsx       → image thumbnails, PDF preview cards, video thumbnails
+```
+
+### Frontend Tech Stack
+
+- `react-markdown` + `remark-gfm` — base markdown rendering (tables, lists, strikethrough)
+- `mermaid` — client-side diagram rendering
+- `recharts` — chart rendering from `chart` code blocks
+- Custom remark plugins — citations, entity chips, callouts, chart blocks, media embeds
+
+---
+
+## Backend Generation Flow
+
+```
+wiki_builder.py
+
+  Phase 1: Gather data
+  ─────────────────────
+  1.  Read Tier 0 summary (FREE)                    → Overview page data
+  2.  Read Tier 1 clusters (FREE)                    → Topic page list
+  3.  For each cluster: fetch ALL Tier 2 facts       → Topic page content
+  4.  Query Neo4j: Person nodes + edges              → People page data
+  5.  Query Neo4j: Decision nodes + SUPERSEDES       → Decisions page data
+  6.  Query Neo4j: Technology nodes + edges           → Tech Stack page data
+  7.  Query Neo4j: Project nodes + BLOCKED_BY        → Projects page data
+  8.  Fetch recent 7-day Tier 2 facts                → Activity page data
+  9.  Query Weaviate: facts with media/link fields   → Resources page data
+
+  Phase 2: Agent structures topic pages
+  ─────────────────────────────────────
+  10. consolidation_agent (ADK LoopAgent) receives all data and:
+      a. Determines topic page structure (which topics get sub-pages)
+      b. Generates topic page content (overview, key facts, related sections)
+      c. Generates sub-page content where needed
+      d. Generates Mermaid diagrams from graph data
+      e. Generates chart JSON specs from aggregated stats
+      f. Generates FAQ Q&A pairs from common patterns
+      g. Generates Glossary terms from jargon scan
+      h. Builds citation index for every page
+      i. Adds media badges to media-sourced citations
+
+  Phase 3: Assemble and cache
+  ────────────────────────────
+  11. Assemble WikiResponse with all pages
+  12. Write to MongoDB wiki_cache
+  13. Clear wiki_dirty flag
+```
+
+**Estimated cost**: ~$0.03-0.08 per wiki generation (slightly higher than single-page due to richer topic pages)
+
+**`wiki_dirty` flag** set when:
+- Consolidation assigns new facts to clusters after a sync
+- Entity extraction writes new Person/Decision/Technology nodes to Neo4j
+- The contradiction detector supersedes an existing fact
+- A manual reconsolidation is triggered via `refresh_wiki`
+
+---
+
+## API Endpoints
+
+| Method | Path | Purpose |
+|--------|------|---------|
+| `GET` | `/api/channels/:id/wiki` | Wiki structure + Overview page content |
+| `GET` | `/api/channels/:id/wiki/pages/:page_id` | Specific page content |
+| `POST` | `/api/channels/:id/wiki/refresh` | Force wiki regeneration |
+| `GET` | `/api/channels/:id/wiki/structure` | Sidebar navigation tree (lightweight) |
+
+### API Design Rationale
+
+Unlike the single-page design where one endpoint returned everything, the pageable wiki uses **lazy page loading**:
+
+1. `GET /wiki` returns the sidebar structure + Overview page (the landing page)
+2. Navigating to another page calls `GET /wiki/pages/:page_id` — loads only that page's content
+3. `GET /wiki/structure` returns just the sidebar tree (used for navigation without loading page content)
+
+This means large channel wikis don't load 50 pages of content upfront — only the page you're viewing.
+
+---
+
+## Response Schema
+
+### Python (Backend)
+
+```python
+class WikiPage(BaseModel):
+    id: str                          # "overview", "people", "decisions", "topic-authentication",
+                                     # "topic-authentication--jwt-migration" (sub-page)
+    slug: str                        # URL-safe: "authentication", "jwt-migration"
+    title: str                       # "Authentication"
+    page_type: str                   # "fixed" | "topic" | "sub-topic"
+    parent_id: str | None            # None for top-level, parent page ID for sub-pages
+    section_number: str              # "1", "2.1", "2.1.1"
+    content: str                     # Enhanced Markdown (mermaid/chart/callout/media blocks)
+    summary: str                     # 1-2 sentence summary for sidebar tooltip and cards
+    memory_count: int                # Number of facts on this page
+    last_updated: datetime
+    citations: list[Citation]
+    children: list[WikiPageRef]      # Sub-page references (id, title, slug, memory_count)
+
+class WikiPageRef(BaseModel):
+    id: str
+    title: str
+    slug: str
+    section_number: str
+    memory_count: int
+
+class WikiStructure(BaseModel):
+    """Sidebar navigation tree — lightweight, no page content."""
+    channel_id: str
+    channel_name: str
+    platform: str
+    generated_at: datetime
+    is_stale: bool
+    pages: list[WikiPageNode]
+
+class WikiPageNode(BaseModel):
+    id: str
+    title: str
+    slug: str
+    section_number: str
+    page_type: str                   # "fixed" | "topic" | "sub-topic"
+    memory_count: int
+    children: list[WikiPageNode]     # Recursive for sub-pages
+
+class WikiResponse(BaseModel):
+    """Full response from GET /wiki — structure + overview page."""
+    channel_id: str
+    channel_name: str
+    platform: str
+    generated_at: datetime
+    is_stale: bool
+    structure: WikiStructure         # Sidebar tree
+    overview: WikiPage               # Overview page content (landing)
+    metadata: WikiMetadata
+
+class WikiMetadata(BaseModel):
+    member_count: int
+    message_count: int
+    memory_count: int
+    entity_count: int
+    media_count: int
+    page_count: int                  # Total wiki pages
+    generation_cost_usd: float
+    generation_duration_ms: int
+
+class Citation(BaseModel):
+    id: str                          # "[1]"
+    author: str
+    channel: str
+    timestamp: datetime
+    text_excerpt: str                # First 100 chars of original message
+    permalink: str                   # Slack/Teams/Discord message URL
+    media_type: str | None           # "pdf", "image", "link", "video", "audio", None
+    media_name: str | None           # Filename or domain for media-sourced citations
+```
+
+### TypeScript (Frontend)
+
+```typescript
+interface WikiResponse {
+  channel_id: string;
+  channel_name: string;
+  platform: "slack" | "teams" | "discord";
+  generated_at: string;
+  is_stale: boolean;
+  structure: WikiStructure;
+  overview: WikiPage;
+  metadata: WikiMetadata;
+}
+
+interface WikiStructure {
+  channel_id: string;
+  channel_name: string;
+  platform: string;
+  generated_at: string;
+  is_stale: boolean;
+  pages: WikiPageNode[];
+}
+
+interface WikiPageNode {
+  id: string;
+  title: string;
+  slug: string;
+  section_number: string;
+  page_type: "fixed" | "topic" | "sub-topic";
+  memory_count: number;
+  children: WikiPageNode[];
+}
+
+interface WikiPage {
+  id: string;
+  slug: string;
+  title: string;
+  page_type: "fixed" | "topic" | "sub-topic";
+  parent_id: string | null;
+  section_number: string;
+  content: string;                   // Enhanced Markdown
+  summary: string;
+  memory_count: number;
+  last_updated: string;
+  citations: WikiCitation[];
+  children: WikiPageRef[];
+}
+
+interface WikiPageRef {
+  id: string;
+  title: string;
+  slug: string;
+  section_number: string;
+  memory_count: number;
+}
+
+interface WikiMetadata {
+  member_count: number;
+  message_count: number;
+  memory_count: number;
+  entity_count: number;
+  media_count: number;
+  page_count: number;
+  generation_cost_usd: number;
+  generation_duration_ms: number;
+}
+
+interface WikiCitation {
+  id: string;
+  author: string;
+  channel: string;
+  timestamp: string;
+  text_excerpt: string;
+  permalink: string;
+  media_type?: "pdf" | "image" | "link" | "video" | "audio";
+  media_name?: string;
+}
+```
+
+---
+
+## Component Architecture
+
+```
+web/src/components/wiki/
+  │
+  │  ── Layout ──
+  ├── WikiLayout.tsx          # Two-column: sidebar + page content area
+  ├── WikiSidebar.tsx         # Navigation tree: numbered pages, collapsible topics, active highlight
+  ├── WikiBreadcrumb.tsx      # Breadcrumb: Wiki > Topics > Authentication > JWT Migration
+  ├── FreshnessBadge.tsx      # Wiki staleness indicator + refresh button
+  │
+  │  ── Fixed Pages ──
+  ├── OverviewPage.tsx        # Landing: summary, stats, topic cards, highlights, recent changes
+  ├── PeoplePage.tsx          # Grouped by role + bar chart + person cards
+  ├── DecisionsPage.tsx       # Mermaid flow + timeline + filters
+  ├── TechStackPage.tsx       # Technology grid
+  ├── ProjectsPage.tsx        # Project cards + mermaid dependency graph
+  ├── ActivityPage.tsx        # Area chart + daily grouped activity
+  ├── FAQPage.tsx             # Q&A pairs with citations
+  ├── GlossaryPage.tsx        # Alphabetical term list
+  ├── ResourcesPage.tsx       # Media grouped by type (docs, images, links, videos, audio)
+  │
+  │  ── Agent-Generated Pages ──
+  ├── TopicPage.tsx           # Topic page: overview, facts, related items, sub-page links
+  ├── SubTopicPage.tsx        # Sub-page: summary, all facts, media
+  │
+  │  ── Shared Components ──
+  ├── PersonCard.tsx          # Person card with role, expertise chips, decisions
+  ├── DecisionEntry.tsx       # Timeline entry with status badge + supersedes
+  ├── TopicCard.tsx           # Topic overview card (used on Overview page)
+  ├── ProjectCard.tsx         # Project card with blockers
+  ├── CitationPanel.tsx       # Bottom citations section for any page
+  ├── MediaThumbnail.tsx      # Image thumbnail with lightbox + alt text
+  ├── MediaPreviewCard.tsx    # PDF/video/audio preview card with metadata
+  ├── MediaBadge.tsx          # Inline media type badge (📄🔗🖼️🎬🎙️)
+  │
+  │  ── Markdown Rendering ──
+  ├── WikiMarkdown.tsx        # Enhanced markdown renderer (all plugins)
+  ├── MermaidBlock.tsx        # ```mermaid → SVG diagram
+  ├── ChartBlock.tsx          # ```chart → recharts (bar/area/donut)
+  ├── CalloutBox.tsx          # > [!NOTE] → styled card
+  ├── EntityChip.tsx          # @person #topic $tech → clickable navigation chip
+  ├── CitationLink.tsx        # [1] → hover preview + click permalink
+  └── MediaEmbed.tsx          # Inline image/PDF/video rendering
+```
+
+### Hooks
+
+```
+web/src/hooks/
+  ├── useWiki.ts              # GET /wiki → structure + overview, cache with TanStack Query
+  ├── useWikiPage.ts          # GET /wiki/pages/:id → single page content, cache per page
+  ├── useWikiStructure.ts     # GET /wiki/structure → sidebar tree (lightweight)
+  └── useWikiRefresh.ts       # POST /wiki/refresh → trigger regeneration + poll until done
+```
+
+---
+
+## Cost Breakdown
+
+| Data | Source | Cost |
+|------|--------|------|
+| Overview summary | Weaviate Tier 0 | FREE (cached) |
+| Topic list | Weaviate Tier 1 clusters | FREE (cached) |
+| Topic facts (all) | Weaviate Tier 2 per cluster | ~$0.002 |
+| People | Neo4j Person queries | ~$0.001 |
+| Decisions | Neo4j Decision + SUPERSEDES | ~$0.001 |
+| Tech Stack | Neo4j Technology queries | ~$0.001 |
+| Projects | Neo4j Project queries | ~$0.001 |
+| Recent activity | Weaviate Tier 2 (7 days) | ~$0.001 |
+| Media facts | Weaviate Tier 2 (media filter) | ~$0.001 |
+| LLM synthesis | `consolidation_agent` — topic structure, FAQ, glossary, diagrams | ~$0.02-0.06 |
+
+**Total per wiki generation**: ~$0.03-0.08
+
+---
+
+## Phasing
+
+| Phase | What Ships |
+|-------|-----------|
+| **Phase 1 (MVP)** | Overview page, Topic pages (flat — no sub-pages yet), People, Decisions, Activity, Citations, Sidebar navigation |
+| **Phase 1.5** | Sub-page splitting for large topics, Tech Stack, Projects, FAQ, Resources & Media |
+| **Phase 2** | Glossary, Knowledge Gaps (unanswered Ask questions → "Needs Article"), Cross-Channel References |
+
+---
+
+## Design Tokens
+
+| Element | Color |
+|---------|-------|
+| Primary text | `slate-900` / `slate-50` (dark mode) |
+| Accent (links, active sidebar) | `indigo-600` |
+| Topic chips | `indigo-100` text on `indigo-50` bg |
+| Person: Decision Maker | `blue-500` |
+| Person: Contributor | `green-500` |
+| Person: Expert | `purple-500` |
+| Decision: Active | `emerald-500` |
+| Decision: Superseded | `slate-400` + strikethrough |
+| Decision: Pending | `amber-500` |
+| Project: Active | `emerald-500` |
+| Project: In Progress | `amber-500` |
+| Project: Blocked | `red-500` |
+| Citations | `indigo-600` |
+| Fresh badge | `emerald-500` |
+| Stale badge | `amber-500` |
+| Callout NOTE | `blue-50` bg, `blue-600` border |
+| Callout TIP | `emerald-50` bg, `emerald-600` border |
+| Callout WARNING | `amber-50` bg, `amber-600` border |
+| Sidebar active page | `indigo-50` bg, `indigo-600` left border |
+| Sidebar hover | `slate-50` bg |
+| Typography: UI | Inter |
+| Typography: code | JetBrains Mono |
+
+---
+
+## Key Files to Create/Modify
+
+| File | Purpose |
+|------|---------|
+| `src/beever_atlas/services/wiki_builder.py` | **NEW** — Backend generation flow (phases 1-3) |
+| `src/beever_atlas/services/wiki_cache.py` | **NEW** — MongoDB wiki cache (per-page storage + structure) |
+| `src/beever_atlas/stores/weaviate_store.py` | **MODIFY** — Add `fetch_all_facts_per_cluster()`, `fetch_media_facts()` |
+| `src/beever_atlas/stores/neo4j_store.py` | **MODIFY** — Add Technology/Project queries |
+| `src/beever_atlas/server/app.py` | **MODIFY** — Add wiki API routes (structure, pages) |
+| `src/beever_atlas/models/domain.py` | **MODIFY** — Add WikiResponse, WikiPage, WikiStructure, Citation models |
+| `web/src/components/wiki/*` | **NEW** — All wiki components (layout, pages, shared, markdown) |
+| `web/src/hooks/useWiki*.ts` | **NEW** — Wiki data hooks (structure, page, refresh) |
+| `web/src/lib/types.ts` | **MODIFY** — Add TypeScript wiki types |
+| `web/src/pages/wiki/*` | **NEW** — Wiki route pages |
diff --git a/docs/v2/07-deployment.md b/docs/v2/07-deployment.md
new file mode 100644
index 00000000..87a46d0b
--- /dev/null
+++ b/docs/v2/07-deployment.md
@@ -0,0 +1,247 @@
+# Deployment & Module Structure
+
+## Docker Compose
+
+```yaml
+# docker-compose.yml (v2)
+services:
+  beever-atlas:          # Python/FastAPI (MCP + REST)
+    build: .
+    ports: ["8000:8000"]
+    depends_on: [weaviate, neo4j, mongodb]
+
+  web:                   # React frontend
+    build: ./web
+    ports: ["3000:80"]
+
+  weaviate:              # Semantic memory
+    image: cr.weaviate.io/semitechnologies/weaviate:1.28.0
+    ports: ["8080:8080", "50051:50051"]
+    volumes: [weaviate_data:/var/lib/weaviate]
+
+  neo4j:                 # Graph memory
+    image: neo4j:5.26-community
+    ports: ["7474:7474", "7687:7687"]
+    environment:
+      NEO4J_AUTH: neo4j/beever_atlas_dev
+      NEO4J_PLUGINS: '["apoc"]'
+    volumes: [neo4j_data:/data]
+
+  mongodb:               # State + cache
+    image: mongo:7.0
+    ports: ["27017:27017"]
+    volumes: [mongo_data:/data/db]
+
+  redis:                   # Chat SDK session state
+    image: redis:7-alpine
+    ports: ["6379:6379"]
+
+  bot:                     # Chat SDK bot (TypeScript)
+    build: ./bot
+    depends_on: [redis, beever-atlas]
+    environment:
+      BEEVER_API_URL: http://beever-atlas:8000
+      REDIS_URL: redis://redis:6379
+
+volumes:
+  weaviate_data:
+  neo4j_data:
+  mongo_data:
+```
+
+---
+
+## MCP Tool Specification
+
+**Design decision:** Graph queries are abstracted behind `ask_questions`. The smart router decides when to use Neo4j — users don't need to know about the dual-memory architecture.
+
+### 7 Tools
+
+```python
+@tool("ask_questions")
+async def ask_questions(
+    question: str,           # Natural language query
+    channel_id: str = None,  # Target channel (None = cross-channel search, ACL-filtered)
+    include_citations: bool = True,
+    max_results: int = 10,
+) -> AskResponse:
+    """Ask a question about channel knowledge. Routes automatically
+    to semantic search, graph traversal, or both based on query type.
+    Cost: $0.001-$0.006 depending on route."""
+
+@tool("search_memories")
+async def search_memories(
+    query: str,              # Search query
+    channel_id: str,
+    tier: str = "all",       # "all" | "summary" | "topic" | "atomic"
+    limit: int = 15,
+    include_images: bool = False,
+) -> SearchResponse:
+    """Direct hybrid search — bypasses router for power users.
+    Cost: ~$0.001"""
+
+@tool("get_wiki")
+async def get_wiki(
+    channel_id: str,
+    section: str = "all",    # "all"|"overview"|"topics"|"people"|"decisions"|"recent"
+) -> WikiResponse:
+    """Read cached wiki content. FREE for cached sections.
+    Returns stale data if wiki is dirty — use refresh_wiki to force update."""
+
+@tool("get_topics")
+async def get_topics(
+    channel_id: str,
+) -> TopicsResponse:
+    """List topic clusters for a channel. FREE (cached Tier 1)."""
+
+@tool("sync_channel")
+async def sync_channel(
+    channel_id: str,
+    max_messages: int = 5000,  # Safety limit to prevent cost explosion
+    since: str = None,         # ISO timestamp, defaults to last sync point
+) -> SyncResponse:
+    """Trigger ingestion for a channel. Runs in background.
+    Cost: ~$0.0025/message (text), ~$0.008/message (with media)."""
+
+@tool("get_sync_status")
+async def get_sync_status(
+    channel_id: str = None,    # None = all channels
+) -> SyncStatusResponse:
+    """Check sync progress and health status. FREE."""
+
+@tool("refresh_wiki")
+async def refresh_wiki(
+    channel_id: str,
+) -> RefreshResponse:
+    """Force wiki regeneration. Triggers full reconsolidation.
+    Cost: ~$0.01 for LLM synthesis."""
+```
+
+---
+
+## MCP Resources
+
+Read-only, URI-based access to wiki content:
+
+```python
+@resource("wiki://{channel_id}")           # Full wiki markdown
+@resource("wiki://{channel_id}/overview")  # Tier 0 summary only
+@resource("wiki://{channel_id}/topics")    # Tier 1 cluster list
+```
+
+---
+
+## Response Schemas
+
+```python
+class AskResponse:
+    answer: str                    # Grounded response with inline citations
+    citations: list[Citation]      # Source facts with platform permalinks
+    route_used: str                # "semantic" | "graph" | "both"
+    confidence: float              # 0.0-1.0
+    degraded: bool                 # True if a component was unavailable
+    cost_usd: float                # Estimated cost of this query
+
+class Citation:
+    text: str                      # Original fact text
+    channel: str                   # Source channel name
+    user: str                      # Who said it
+    timestamp: str                 # When it was said
+    permalink: str                 # Platform message URL
+    tier: str                      # "atomic" | "topic" | "summary"
+
+class SyncResponse:
+    status: str                    # "started" | "already_running" | "queued"
+    channel_id: str
+    estimated_messages: int        # Approximate message count to process
+    job_id: str                    # For tracking via get_sync_status
+
+class WikiResponse:
+    content: str                   # Markdown wiki content
+    generated_at: str              # When this version was generated
+    is_stale: bool                 # True if wiki_dirty flag is set
+    channel_id: str
+```
+
+---
+
+## Module Structure
+
+```
+src/beever_atlas/
+├── agents/                      # ADK agent definitions (see 13-adk-integration.md)
+│   ├── ingestion/               # 6-stage ingestion pipeline
+│   │   ├── pipeline.py          # create_ingestion_pipeline() SequentialAgent factory
+│   │   ├── preprocessor.py      # Stage 1 — mrkdwn, threads, media
+│   │   ├── fact_extractor.py    # Stage 2 (parallel) — fact extraction
+│   │   ├── entity_extractor.py  # Stage 2 (parallel) — entity + relation extraction
+│   │   ├── embedder.py          # Stage 3 (parallel) — Jina v4 embeddings
+│   │   ├── cross_batch_validator.py  # Stage 3 (parallel) — alias resolution
+│   │   ├── persister.py         # Stage 4 — outbox write to Weaviate + Neo4j + MongoDB
+│   │   ├── contradiction_detector.py
+│   │   └── coreference_resolver.py
+│   ├── consolidation/           # Consolidation agents
+│   │   └── summarizer.py        # LlmAgent for cluster/channel summaries
+│   ├── media/                   # Media processing agents
+│   │   └── document_digester.py # PDF + image processing
+│   ├── query/                   # Q&A routing agents (in development)
+│   │   └── echo.py
+│   ├── tools.py                 # ADK FunctionTool wrappers for store operations
+│   └── runner.py                # ADK Runner initialization
+│
+├── adapters/                    # Multi-platform ingestion adapters
+│   ├── base.py                  # NormalizedMessage, BaseAdapter
+│   ├── slack_adapter.py         # slack-sdk
+│   ├── teams_adapter.py         # Microsoft Graph API
+│   └── discord_adapter.py       # discord.py
+│
+├── api/                         # FastAPI route handlers (REST API)
+│   ├── ask.py                   # Streaming Q&A (SSE)
+│   ├── channels.py              # Channel listing + history
+│   ├── connections.py           # Platform connection CRUD
+│   ├── graph.py                 # Entity + relationship endpoints
+│   ├── memories.py              # Fact search + listing
+│   ├── search.py                # Cross-channel search
+│   ├── stats.py                 # Aggregate stats
+│   ├── sync.py                  # Sync trigger + status
+│   ├── topics.py                # Topic cluster endpoints
+│   └── wiki.py                  # Wiki retrieval + refresh
+│
+├── services/                    # Core business logic
+│   ├── batch_pipeline.py        # Gemini Batch API orchestrator
+│   ├── batch_processor.py       # Per-batch message processor
+│   ├── consolidation.py         # Topic clustering + channel summaries
+│   ├── media_processor.py       # Image/PDF/video processing
+│   ├── reconciler.py            # Retry incomplete cross-store writes
+│   ├── scheduler.py             # Background sync scheduling
+│   └── sync_runner.py           # Sync job coordinator
+│
+├── stores/                      # Data store clients
+│   ├── weaviate_store.py        # Semantic memory (3-tier)
+│   ├── neo4j_store.py           # Graph memory (flexible)
+│   ├── nebula_store.py          # Graph memory (NebulaGraph alternative)
+│   ├── mongodb_store.py         # State + wiki cache
+│   ├── entity_registry.py       # Canonical names + alias resolution
+│   ├── graph_protocol.py        # Shared graph store protocol
+│   └── null_graph.py            # No-op graph store (mock/dev mode)
+│
+├── retrieval/                   # Query retrieval layer (in development)
+│   └── __init__.py              # Planned for Q&A agent phase
+│
+├── wiki/                        # Wiki generation
+│   ├── builder.py               # Orchestrates full wiki build (WikiBuilder)
+│   ├── compiler.py              # LLM page generation (WikiCompiler)
+│   └── cache.py                 # MongoDB wiki cache (WikiCache)
+│
+├── server/                      # Server entry point
+│   └── app.py                   # FastAPI app, lifespan, CORS, router registration
+│
+├── llm/                         # LLM provider abstraction
+│   └── provider.py              # LLMProvider — resolves models from env vars
+│
+├── models/                      # Pydantic domain models
+│
+└── infra/                       # Cross-cutting infrastructure
+    ├── health_registry.py        # Circuit breakers per dependency
+    └── telemetry.py              # OpenTelemetry traces + metrics
+```
diff --git a/docs/v2/08-resilience.md b/docs/v2/08-resilience.md
new file mode 100644
index 00000000..6d40b988
--- /dev/null
+++ b/docs/v2/08-resilience.md
@@ -0,0 +1,204 @@
+# Resilience & Degradation Design
+
+The v2 architecture depends on 6 external services: Weaviate, Neo4j, MongoDB, Gemini, Jina, and Tavily. Any component failure must degrade gracefully — not cause total system failure.
+
+---
+
+## 12.1 Dependency Health Registry
+
+Each external dependency gets a circuit breaker with three states: `CLOSED` (healthy), `OPEN` (failing, requests blocked), and `HALF_OPEN` (probing for recovery).
+
+```python
+class DependencyHealth:
+    """Circuit breaker per external dependency (CLOSED → OPEN → HALF_OPEN)."""
+
+    DEPENDENCIES = {
+        "weaviate":  {"critical": True,  "timeout_s": 5},
+        "neo4j":     {"critical": False, "timeout_s": 5},
+        "mongodb":   {"critical": True,  "timeout_s": 5},
+        "gemini":    {"critical": True,  "timeout_s": 10},
+        "jina":      {"critical": False, "timeout_s": 10},
+        "tavily":    {"critical": False, "timeout_s": 5},
+        "redis":     {"critical": False, "timeout_s": 2},   # Chat SDK state
+    }
+
+    async def check(self, name: str) -> bool:
+        """Returns True if dependency is available."""
+        if self.states[name] == CircuitState.OPEN:
+            if time_since_open > RECOVERY_WINDOW:  # e.g., 30s
+                self.states[name] = CircuitState.HALF_OPEN
+                return True  # Probe with one request
+            return False
+        return True
+
+    def record_failure(self, name: str):
+        """After 3 consecutive failures, open the circuit."""
+        self.failure_counts[name] += 1
+        if self.failure_counts[name] >= 3:
+            self.states[name] = CircuitState.OPEN
+            logger.error(f"Circuit OPEN for {name}")
+
+    def record_success(self, name: str):
+        """Reset failure count, close circuit if half-open."""
+        self.failure_counts[name] = 0
+        if self.states[name] == CircuitState.HALF_OPEN:
+            self.states[name] = CircuitState.CLOSED
+```
+
+Implemented in `src/beever_atlas/infra/health_registry.py`.
+
+---
+
+## 12.2 Degradation Matrix
+
+| Component Down | Ingestion Impact | Retrieval Impact | Behavior |
+|----------------|-----------------|------------------|----------|
+| **Neo4j** | Stage 3 skipped; facts stored in Weaviate only; entities queued for backfill | `route=graph` → reclassify as `route=semantic` | Wiki People/Decisions show "temporarily unavailable" |
+| **Gemini** | Messages queued in dead letter queue | ADK agents fall back to Claude models via LiteLLM; if all LLMs fail, return cached wiki only | Alert fired; retry on recovery |
+| **Redis** | No impact (batch ingestion unaffected) | No impact (MCP queries unaffected) | Chat SDK bot offline; users see "bot unavailable" in Slack/Teams/Discord |
+| **Jina** | Embeddings queued; facts stored text-only in Weaviate | Existing embeddings work; new facts use BM25-only | Backfill embeddings when Jina recovers |
+| **Tavily** | No impact | Silently drop external sub-queries; return internal-only results | User sees "external search unavailable" note |
+| **Weaviate** | Full ingestion paused (queue in MongoDB) | Return cached wiki; graph-only for relational queries | Critical alert — system severely degraded |
+| **MongoDB** | Full system paused | Read-only from Weaviate/Neo4j if cached connections survive | Critical alert — system offline |
+
+---
+
+## 12.3 LLM Fallback via ADK + LiteLLM
+
+All LLM calls are handled by [Google ADK](https://google.github.io/adk-docs/) agents. Model fallback is configured via ADK's native [LiteLLM](https://docs.litellm.ai/) integration rather than a custom provider class. Circuit breakers (Section 12.1) still apply at the dependency health level.
+
+Each ADK agent is configured with a primary model. When the primary is unavailable (timeout, rate limit, circuit open), LiteLLM transparently routes to the fallback model:
+
+| Agent Tier | Primary | Fallback (via LiteLLM) |
+|-----------|---------|------------------------|
+| Fast (routing, extraction, classification) | `gemini-2.0-flash-lite` | `anthropic/claude-haiku-4-5` |
+| Quality (response generation, wiki synthesis) | `gemini-2.0-flash` | `anthropic/claude-sonnet-4-6` |
+
+### Fallback Chain Per ADK Agent
+
+Each call site below corresponds to a specific ADK agent. The primary/fallback models are configured on the agent definition, with LiteLLM handling the failover transparently.
+
+| ADK Agent | Primary | Fallback | Last Resort |
+|-----------|---------|----------|-------------|
+| `query_router_agent` | Gemini Flash Lite | Claude Haiku | Regex fast-path classifier |
+| `fact_extractor_agent` (Stage 2) | Gemini Flash Lite | Claude Haiku | Dead letter queue |
+| `entity_extractor_agent` (Stage 3) | Gemini Flash Lite | Claude Haiku | Skip (Weaviate-only) |
+| Classification (Stage 4) | Gemini Flash Lite | Rule-based tagger | Skip (no tags) |
+| `response_agent` | Gemini Flash | Claude Sonnet | Return raw results |
+| `consolidation_agent` (Wiki) | Gemini Flash Lite | Claude Haiku | Serve stale cache |
+
+Model configuration is defined in the ADK agent declarations. See `src/beever_atlas/agents/` and [`13-adk-integration.md`](13-adk-integration.md).
+
+---
+
+## 12.4 Ingestion Pipeline Resilience
+
+Each pipeline stage is independently skippable. If a non-critical stage fails, the pipeline continues:
+
+```python
+async def ingest_message(self, msg: NormalizedMessage):
+    # Stage 1: Preprocess (required)
+    preprocessed = await self.preprocessor.process(msg)
+
+    # Stage 2a: Extract facts (required — queue to DLQ on failure)
+    try:
+        facts = await self.extractor.extract(preprocessed)
+    except LLMUnavailableError:
+        await self.dead_letter_queue.enqueue(msg)
+        return
+
+    # Stage 2b: Entity extraction (optional — skip if Neo4j/LLM down)
+    entities = []
+    if await self.health.check("neo4j") and await self.health.check("gemini"):
+        try:
+            entities = await self.entity_extractor.extract(preprocessed, facts)
+        except Exception as e:
+            logger.warning(f"Entity extraction failed, continuing: {e}")
+            await self.backfill_queue.enqueue("entities", msg.id, preprocessed)
+
+    # Stage 3: Embed (optional — queue if Jina down)
+    embeddings = None
+    if await self.health.check("jina"):
+        embeddings = await self.embedder.embed(facts)
+    else:
+        await self.backfill_queue.enqueue("embeddings", msg.id, facts)
+
+    # Stage 4: Persist via outbox pattern
+    await self.persister.persist(facts, entities, embeddings)
+```
+
+---
+
+## 12.5 Write Safety — Outbox Pattern
+
+Stage 7 uses a MongoDB outbox pattern for cross-store write safety. Writes are committed as a single intent document first, then fanned out to each store independently and idempotently.
+
+```python
+class OutboxPersister:
+    """Two-phase persist: commit intent to MongoDB first, then fan out."""
+
+    async def persist(self, facts, entities, embeddings, tags) -> str:
+        # PHASE 1: Write intent (single MongoDB transaction)
+        intent = WriteIntent(
+            id=deterministic_uuid(facts),
+            facts=facts, entities=entities,
+            embeddings=embeddings, tags=tags,
+            status={"weaviate": "pending",
+                    "neo4j": "pending" if entities else "skipped",
+                    "state": "pending"},
+            retry_count=0,
+        )
+        await self.mongo.write_intents.insert_one(intent.dict())
+
+        # PHASE 2: Fan out (idempotent, independently retryable)
+        await self._fan_out(intent)
+        return intent.id
+
+    async def _fan_out(self, intent: WriteIntent):
+        # Weaviate — idempotent via deterministic UUID
+        if intent.status["weaviate"] == "pending":
+            try:
+                await self.weaviate.upsert(intent.facts, intent.embeddings)
+                await self._mark(intent.id, "weaviate", "done")
+            except Exception:
+                await self._mark(intent.id, "weaviate", "failed")
+
+        # Neo4j — idempotent via MERGE semantics
+        if intent.status["neo4j"] == "pending":
+            try:
+                for entity in intent.entities:
+                    await self.neo4j.upsert_entity(entity)
+                await self._mark(intent.id, "neo4j", "done")
+            except Exception:
+                await self._mark(intent.id, "neo4j", "failed")
+
+        # MongoDB sync state — final step
+        await self._update_sync_state(intent)
+        await self._mark(intent.id, "state", "done")
+```
+
+### Background Write Reconciler
+
+Runs every 15 minutes to retry any incomplete cross-store writes:
+
+```python
+class WriteReconciler:
+    """Retry incomplete cross-store writes."""
+
+    async def reconcile(self):
+        stale = await self.mongo.write_intents.find({
+            "$or": [
+                {"status.weaviate": {"$in": ["pending", "failed"]}},
+                {"status.neo4j": {"$in": ["pending", "failed"]}},
+            ],
+            "created_at": {"$lt": now() - timedelta(minutes=5)},
+            "retry_count": {"$lt": 5},
+        }).to_list()
+
+        for intent in stale:
+            await self.persister._fan_out(WriteIntent(**intent))
+            await self.mongo.write_intents.update_one(
+                {"id": intent["id"]}, {"$inc": {"retry_count": 1}})
+```
+
+Implemented in `src/beever_atlas/services/reconciler.py`. Outbox intent documents are persisted via `services/batch_processor.py` and `agents/ingestion/persister.py`.
diff --git a/docs/v2/09-observability.md b/docs/v2/09-observability.md
new file mode 100644
index 00000000..f7576e1b
--- /dev/null
+++ b/docs/v2/09-observability.md
@@ -0,0 +1,125 @@
+# Observability & Operations
+
+---
+
+## Health Endpoints
+
+```python
+@app.get("/health")
+async def health_check():
+    checks = await asyncio.gather(
+        check_weaviate(),   # .is_ready()
+        check_neo4j(),      # driver.verify_connectivity()
+        check_mongodb(),    # ping
+        check_gemini(),     # list_models() with 5s timeout
+        check_jina(),       # embed test vector with 5s timeout
+        check_redis(),      # PING (Chat SDK session state)
+    )
+    status = "healthy" if all(c.ok for c in checks) else \
+             "degraded" if any(c.ok for c in checks if c.critical) else \
+             "unhealthy"
+    return {"status": status,
+            "components": {c.name: c.dict() for c in checks}}
+```
+
+Returns `"healthy"` when all components pass, `"degraded"` when at least one critical component is up, and `"unhealthy"` when all critical components are down.
+
+---
+
+## Key Metrics
+
+| Category | Metric | Type | Alert Threshold |
+|----------|--------|------|-----------------|
+| **Ingestion** | `ingestion.messages.processed` | Counter | Rate drops > 50% |
+| | `ingestion.quality_gate.rejected_ratio` | Gauge | > 60% |
+| | `ingestion.stage.duration_ms` | Histogram/stage | p95 > 5s |
+| | `ingestion.write_intent.pending_count` | Gauge | > 100 |
+| | `ingestion.dead_letter.count` | Counter | Any increase |
+| **Retrieval** | `retrieval.route.distribution` | Counter | graph > 40% |
+| | `retrieval.latency_ms` | Histogram/route | p95 > 3s |
+| | `retrieval.empty_results_ratio` | Gauge | > 30% |
+| **Stores** | `store.{name}.latency_ms` | Histogram | p95 > 2s |
+| | `store.{name}.error_rate` | Gauge | > 1% |
+| | `store.neo4j.entity_count` | Gauge | Growth > 1K/day |
+| | `store.orphan.count` | Gauge | Any increase |
+| **LLM** | `llm.{site}.latency_ms` | Histogram | p95 > 5s |
+| | `llm.{site}.error_rate` | Gauge | > 2% |
+| | `llm.{site}.token_cost` | Counter | Daily > budget |
+
+Metrics are emitted via OpenTelemetry from `src/beever_atlas/infra/telemetry.py`.
+
+---
+
+## Distributed Tracing
+
+Every ingestion message and query carries a trace ID through all stages and stores:
+
+```python
+@tracer.start_as_current_span("ingest_message")
+async def process_message(msg: NormalizedMessage):
+    span = trace.get_current_span()
+    span.set_attribute("message.id", msg.id)
+    span.set_attribute("message.channel", msg.channel_id)
+    span.set_attribute("message.platform", msg.platform)
+
+    with tracer.start_as_current_span("stage_2_extract"):
+        facts = await extract(msg)
+    with tracer.start_as_current_span("stage_3_entities"):
+        entities = await extract_entities(msg, facts)
+    with tracer.start_as_current_span("stage_7_persist"):
+        await persist(facts, entities, embeddings)
+```
+
+This ensures full end-to-end visibility: a single trace shows the message moving from ingestion through all pipeline stages into both Weaviate and Neo4j.
+
+---
+
+## Backup & Recovery
+
+| Store | Method | Frequency | Retention |
+|-------|--------|-----------|-----------|
+| Weaviate | `weaviate backup create` → S3 | Daily 3 AM UTC | 30 days |
+| Neo4j | `neo4j-admin dump` → S3 | Daily 3 AM UTC | 30 days |
+| MongoDB | `mongodump` → S3 | Daily 3 AM UTC | 30 days |
+
+---
+
+## Cross-Store Consistency Checks
+
+A weekly background job validates referential integrity between stores, detecting orphaned references before they affect query results:
+
+```python
+class ConsistencyChecker:
+    async def check_episodic_links(self):
+        """Verify Neo4j Event.weaviate_id → Weaviate object exists."""
+        event_ids = await self.neo4j.get_all_weaviate_ids()
+        for batch in chunks(event_ids, 100):
+            existing = await self.weaviate.batch_exists(batch)
+            orphaned = set(batch) - set(existing)
+            if orphaned:
+                metrics.record("store.orphan.episodic_links", len(orphaned))
+
+    async def check_entity_references(self):
+        """Verify Weaviate fact.graph_entity_ids → Neo4j nodes exist."""
+        facts = await self.weaviate.get_facts_with_graph_ids()
+        for fact in facts:
+            for neo4j_id in fact.graph_entity_ids:
+                if not await self.neo4j.node_exists(neo4j_id):
+                    metrics.record("store.orphan.entity_refs", 1)
+```
+
+Orphan counts feed directly into the `store.orphan.count` metric. Any increase triggers an alert. Implemented in `src/beever_atlas/infra/consistency_checker.py`.
+
+---
+
+## ADK Agent Tracing
+
+ADK agents emit OpenTelemetry spans automatically for each agent invocation, tool call, and model request. These integrate with the existing telemetry pipeline — no additional instrumentation is needed for the agent layer.
+
+Each span includes:
+- Agent name and type (`LlmAgent`, `SequentialAgent`, `ParallelAgent`, `LoopAgent`)
+- Tool invocations with input/output (e.g., `search_weaviate_hybrid`, `traverse_neo4j`)
+- Model used (primary or LiteLLM fallback)
+- Token counts and latency per model call
+
+This provides full end-to-end visibility from query receipt → agent orchestration → store operations → response generation. See [`13-adk-integration.md`](13-adk-integration.md) for the agent hierarchy.
diff --git a/docs/v2/10-access-control.md b/docs/v2/10-access-control.md
new file mode 100644
index 00000000..48e810bc
--- /dev/null
+++ b/docs/v2/10-access-control.md
@@ -0,0 +1,91 @@
+# Access Control
+
+Access control in Beever Atlas is **membership-based**: users can only see data from channels they are a member of on the originating platform. This applies to both private AND public channels — a user who has not joined `#backend` cannot see its memories, even if the channel is public.
+
+This is critical for cross-channel search: when `channel_id` is omitted from a query, the system searches across ALL channels the user is a member of, not all channels in the workspace.
+
+---
+
+## ChannelACL
+
+```python
+class ChannelACL:
+    """Access control based on platform channel membership.
+
+    Membership-based model: users can only access channels they are a member of.
+    This applies to BOTH private and public channels. A user who hasn't joined
+    a public channel cannot see its memories through Beever Atlas.
+
+    This is stricter than Slack's native model (where public channels are readable
+    by all workspace members) but matches the user expectation that "I should only
+    see data from channels I'm in."
+    """
+
+    # MongoDB collection: channel_acl
+    # {channel_id, platform, is_private, member_ids, last_synced}
+
+    async def sync_from_platform(self, channel_id: str, platform: str):
+        """Pull current membership from platform API."""
+        if platform == "slack":
+            members = await self.slack.conversations_members(channel=channel_id)
+            info = await self.slack.conversations_info(channel=channel_id)
+            is_private = info["channel"]["is_private"]
+        # ... similar for Teams, Discord
+
+        await self.collection.update_one(
+            {"channel_id": channel_id},
+            {"$set": {"is_private": is_private,
+                      "member_ids": members,
+                      "last_synced": datetime.utcnow()}},
+            upsert=True)
+
+    async def check_access(self, user_id: str, channel_id: str) -> bool:
+        """Check if user is a member of the channel. Applies to ALL channels."""
+        acl = await self.collection.find_one({"channel_id": channel_id})
+        if not acl:
+            return False  # Unknown channel → deny
+        return user_id in acl.get("member_ids", [])
+
+    async def get_accessible_channels(self, user_id: str) -> list[str]:
+        """Get all channel_ids the user is a member of.
+        Used for cross-channel search when channel_id is omitted."""
+        docs = await self.collection.find(
+            {"member_ids": user_id}
+        ).to_list()
+        return [d["channel_id"] for d in docs]
+
+    async def filter_results(self, user_id: str, results: list) -> list:
+        """Remove results from channels the user is not a member of."""
+        accessible = set(await self.get_accessible_channels(user_id))
+        return [r for r in results if r.get("channel_id") in accessible]
+```
+
+Implemented in `src/beever_atlas/infra/access_control.py`.
+
+---
+
+## Integration Points
+
+- **API authentication**: Bearer token middleware validates user identity before any operation
+- **Retrieval pipeline**: `semantic_agent` and `graph_agent` (ADK) call `acl.filter_results()` via their tool implementations before returning results
+- **Wiki builder**: Private channel sections display `[restricted]` for unauthorized users instead of content
+- **Neo4j traversal**: Global entities (Person, Technology, etc.) are visible to all, but relationships carrying `source_channel` from a private channel are filtered
+- **ACL sync**: Membership is refreshed on each channel sync and cached for 1 hour
+
+---
+
+## Auth Middleware
+
+```python
+@app.middleware("http")
+async def authenticate(request: Request, call_next):
+    token = request.headers.get("Authorization", "").replace("Bearer ", "")
+    if not token:
+        return JSONResponse(status_code=401, content={"error": "Missing auth token"})
+    user = await verify_workspace_token(token)
+    request.state.user_id = user.id
+    request.state.workspace_id = user.workspace_id
+    return await call_next(request)
+```
+
+All routes receive `request.state.user_id` and `request.state.workspace_id` after the middleware runs. ACL checks downstream use `user_id` to gate results from private channels.
diff --git a/docs/v2/11-frontend-design.md b/docs/v2/11-frontend-design.md
new file mode 100644
index 00000000..9a0b2564
--- /dev/null
+++ b/docs/v2/11-frontend-design.md
@@ -0,0 +1,709 @@
+# Frontend Design: Web Dashboard
+
+> **Status**: Implemented (MVP) — core wiki, ask, memories, graph, and connections pages are live
+> **Stack**: React 19 + TypeScript + Vite + TailwindCSS + shadcn/ui
+> **Backend**: FastAPI on port 8000, React dev server on port 3000
+
+---
+
+## 1. Overview
+
+The Beever Atlas web dashboard is a **channel-first** knowledge exploration UI. Users see only channels they've joined. Each channel is a self-contained workspace with wiki, Q&A agent, memory browser, and knowledge graph.
+
+**Primary users**: Team leads, engineering managers, anyone browsing channel knowledge.
+
+**Core UX principle**: Click a channel → explore everything about it (wiki, ask questions, browse memories, view graph). Global cross-channel search is secondary.
+
+**Tech stack**:
+- React 19 + TypeScript — component framework
+- Vite — build tool and dev server
+- TailwindCSS + shadcn/ui — styling and component primitives
+- TanStack Query (React Query) — server state, caching, polling
+- React Router v7 — client-side routing
+- cytoscape.js — graph canvas rendering (Graph tab)
+- react-markdown + remark-gfm — wiki markdown rendering (tables, lists, strikethrough)
+- mermaid — wiki diagram rendering (topic graphs, decision flows, project dependencies)
+- recharts — wiki chart rendering (contribution bars, activity trends, topic distribution)
+- Custom remark plugins — citation links, entity chips, callout boxes, chart blocks
+
+---
+
+## 2. Pages & Layout
+
+### Site Layout
+
+```
+┌─────────────────────────────────────────────────────┐
+│ Sidebar (240px)  │  Header (page title + health)    │
+│                  │──────────────────────────────────│
+│ [Logo]           │                                   │
+│                  │  Page Content                     │
+│ Dashboard        │                                   │
+│ Channels         │                                   │
+│  #backend        │                                   │
+│  #frontend       │                                   │
+│  #design         │                                   │
+│ Settings         │                                   │
+│                  │                                   │
+│ [Health Badge]   │                                   │
+└─────────────────────────────────────────────────────┘
+```
+
+Sidebar shows **only channels the user has joined** (via `GET /api/channels` which is ACL-filtered). Channels are listed directly in the sidebar nav for quick switching.
+
+---
+
+### 2.1 Dashboard (Home) — `/`
+
+Overview of system health, stats, and **global cross-channel search** (Phase 2 feature).
+
+**Content**:
+- Stat cards: total channels synced, total memories, total entities, system health
+- Recent activity feed: latest sync completions, new decisions, new entities across all joined channels
+- **Global search bar** (cross-channel) — searches across all joined channels. `Cmd+K` shortcut opens it from anywhere. *(Phase 2: initially shows "Coming soon — use per-channel Ask for now")*
+- System health badge (polls `GET /api/health` every 30s)
+
+**API calls**: `GET /api/health`, `GET /api/stats`, `GET /api/sync/status`
+
+---
+
+### 2.2 Channel List — `/channels`
+
+Simple list of all joined channels with sync status. Clicking a channel navigates to its workspace.
+
+**Channel list row**: channel name + platform icon, message count, last sync, memory count, sync status badge (idle / syncing / error), "Sync Now" button.
+
+**API calls**: `GET /api/channels`, `GET /api/sync/status`
+
+---
+
+### 2.3 Channel Workspace — `/channels/:id` (THE main page)
+
+This is the core of the application. Each channel is a full workspace with **5 tabs**:
+
+```
+┌─────────────────────────────────────────────────────┐
+│  #backend-engineering                    [Sync Now]  │
+│  ─────────────────────────────────────────────────── │
+│  [Wiki] [Ask] [Memories] [Graph] [Settings]          │
+│  ═══════                                             │
+│                                                       │
+│  Tab content area                                     │
+│                                                       │
+└─────────────────────────────────────────────────────┘
+```
+
+#### Tab 1: Wiki — `/channels/:id/wiki` (default tab)
+
+A **pageable, hierarchical knowledge base** for the channel — similar to DeepWiki. Consists of fixed structural pages (always present) and agent-generated topic pages (dynamically created based on channel content). See [`06-wiki-generation.md`](06-wiki-generation.md) for full spec.
+
+**Layout** (two-column: sidebar navigation + page content):
+```
+┌─────────────────────────────────────────────────────┐
+│  #backend-engineering Wiki              [Refresh]    │
+│  ─────────────────────────────────────────────────── │
+│  ┌──────────┐  ┌──────────────────────────────────┐ │
+│  │ PAGES    │  │ Wiki > Topics > Authentication    │ │
+│  │          │  │                                    │ │
+│  │ 1. Overview│ │ Authentication                    │ │
+│  │ 2. Topics │ │ 23 memories · 3 sub-pages          │ │
+│  │  2.1 Auth←│ │                                    │ │
+│  │    2.1.1  │ │ Team discussed JWT with RS256...   │ │
+│  │    2.1.2  │ │                                    │ │
+│  │  2.2 Infra│ │ [mermaid: sub-topic graph]         │ │
+│  │  2.3 CI/CD│ │                                    │ │
+│  │ 3. People │ │ Key Facts                          │ │
+│  │ 4. Decide.│ │ • Alice proposed RS256... [1]      │ │
+│  │ 5. Tech   │ │ • Migration completed... [2]       │ │
+│  │ 6. Project│ │                                    │ │
+│  │ 7. Activit│ │ Related Decisions                  │ │
+│  │ 8. FAQ    │ │ ✅ Use RS256 — Alice [1]           │ │
+│  │ 9. Glossar│ │                                    │ │
+│  │ 10.Resourc│ │ Related Media                      │ │
+│  │          │  │ 📄 JWT-spec.pdf · 🔗 auth0.com    │ │
+│  │          │  │                                    │ │
+│  │          │  │ Sub-pages                          │ │
+│  │          │  │ → 2.1.1 JWT Migration (12 mem.)    │ │
+│  │          │  │ → 2.1.2 OAuth Integration (7 mem.) │ │
+│  │          │  │                                    │ │
+│  │          │  │ Sources                            │ │
+│  │          │  │ [1] @alice · Mar 20 · View ↗      │ │
+│  └──────────┘  └──────────────────────────────────┘ │
+└─────────────────────────────────────────────────────┘
+```
+
+**Pages** (fixed + agent-generated):
+
+| # | Page | Type | Phase | Source |
+|---|------|------|-------|--------|
+| 1 | **Overview** — summary, stats, topic cards, highlights | Fixed | MVP | Weaviate T0 + MongoDB |
+| 2 | **Topic pages** (e.g., "Authentication") — all facts, diagrams, related items | Agent-generated | MVP | Weaviate T1/T2 |
+| 2.x | **Sub-pages** (e.g., "JWT Migration") — deep-dive into sub-themes | Agent-generated | 1.5 | Weaviate T2 |
+| 3 | **People & Experts** — grouped by role, bar chart | Fixed | MVP | Neo4j |
+| 4 | **Decisions** — mermaid flow + timeline + filters | Fixed | MVP | Neo4j |
+| 5 | **Tech Stack** — technology grid | Fixed | 1.5 | Neo4j |
+| 6 | **Projects** — cards + dependency graph | Fixed | 1.5 | Neo4j |
+| 7 | **Recent Activity** — area chart + daily groups | Fixed | MVP | Weaviate T2 |
+| 8 | **FAQ** — auto-generated Q&A | Fixed | 1.5 | LLM generation |
+| 9 | **Glossary** — channel jargon definitions | Fixed | 2 | LLM extraction |
+| 10 | **Resources & Media** — docs, images, links, videos | Fixed | 1.5 | Weaviate T2 media |
+
+**Sidebar navigation**: DeepWiki-style numbered hierarchy. Current page highlighted. Topics section collapsible. Page count badge.
+
+**Cross-cutting**: Inline `[1]` citations on every fact → hover preview + click permalink. Media badges (📄🔗🖼️🎬🎙️) on media-sourced citations. Entity chips (`@alice`, `#topic`, `$tech`) as clickable cross-page navigation. Rich content: Mermaid diagrams, recharts charts, GFM tables, callout boxes.
+
+**Lazy page loading**: `GET /wiki` returns sidebar structure + Overview. Other pages loaded on navigation via `GET /wiki/pages/:page_id`.
+
+**Stale indicator**: Freshness badge on sidebar. Yellow banner + "Refresh Wiki" button when `is_stale === true`.
+
+**API calls**: `GET /api/channels/:id/wiki` (structure + overview), `GET /api/channels/:id/wiki/pages/:page_id` (single page), `GET /api/channels/:id/wiki/structure` (sidebar only), `POST /api/channels/:id/wiki/refresh`
+
+#### Tab 2: Ask — `/channels/:id/ask`
+
+Natural language Q&A agent **with streaming**. This is the primary interaction model.
+
+**Layout**:
+```
+┌─────────────────────────────────────────────────┐
+│  Ask about #backend-engineering                  │
+│                                                   │
+│  ┌─────────────────────────────────────────────┐ │
+│  │ [User question input]              [Submit] │ │
+│  └─────────────────────────────────────────────┘ │
+│                                                   │
+│  ┌─ Agent Response ────────────────────────────┐ │
+│  │                                              │ │
+│  │  ▸ Thinking... (collapsible CoT)            │ │
+│  │    "Analyzing query... route=graph,          │ │
+│  │     entities=[Alice, JWT]..."                │ │
+│  │                                              │ │
+│  │  ▸ Tool: search_weaviate_hybrid             │ │
+│  │    → 5 results found                         │ │
+│  │                                              │ │
+│  │  ▸ Tool: traverse_neo4j                     │ │
+│  │    → Person(Alice) → DECIDED → Decision(...) │ │
+│  │                                              │ │
+│  │  ── Response ──────────────────────────────  │ │
+│  │  Alice decided to use RS256 for JWT in the   │ │
+│  │  March sprint [1]. This was blocked by...    │ │
+│  │                                              │ │
+│  │  ── Citations ─────────────────────────────  │ │
+│  │  [1] 📝 Fact: "Alice decided RS256..."       │ │
+│  │      🔗 Graph: Person(Alice)→Decision(RS256) │ │
+│  │      💬 Original: slack.com/archives/...      │ │
+│  │                                              │ │
+│  │  Route: graph | Confidence: 92% | $0.005     │ │
+│  └──────────────────────────────────────────────┘ │
+│                                                   │
+│  ┌─ Previous Questions ────────────────────────┐ │
+│  │  • What auth method did we decide on?        │ │
+│  │  • Who is working on the migration?          │ │
+│  └──────────────────────────────────────────────┘ │
+└───────────────────────────────────────────────────┘
+```
+
+**Streaming behavior** (via SSE — Server-Sent Events):
+1. User submits question → `POST /api/channels/:id/ask` (streaming)
+2. **CoT stream**: Agent's thinking appears in a collapsible section (auto-collapsed after response starts)
+3. **Tool call events**: Each tool invocation shows as a step (tool name, brief result summary)
+4. **Response stream**: Final answer streams token-by-token with inline citation markers `[1]`, `[2]`
+5. **Citations**: After response completes, citation cards render with 3 types:
+   - **Fact citation**: The atomic memory text from Weaviate
+   - **Graph citation**: The entity/relationship from Neo4j (e.g., `Person(Alice) → DECIDED → Decision(RS256)`)
+   - **Original message**: Permalink to the source Slack/Teams/Discord message
+6. **Metadata footer**: Route badge (semantic/graph/both), confidence bar, cost
+
+**Per-channel history**: Last 20 questions for this channel stored in localStorage.
+
+**API calls**: `POST /api/channels/:id/ask` (SSE streaming)
+
+#### Tab 3: Memories — `/channels/:id/memories`
+
+Browse the 3-tier memory hierarchy for this channel.
+
+**Layout**: Three-column or accordion view:
+- **Tier 0**: Channel summary (single card, always visible at top)
+- **Tier 1**: Topic cluster cards. Click to expand → shows member atomic facts.
+- **Tier 2**: Searchable list of atomic facts with quality score, timestamp, tags. Filter by topic, entity, importance, date range.
+
+Each memory card shows: text, quality score badge, timestamp, author, topic tags, entity tags. Click → expandable detail with full metadata + link to original message.
+
+**API calls**: `GET /api/channels/:id/wiki?section=overview`, `GET /api/channels/:id/topics`, `POST /api/channels/:id/search/memories`
+
+#### Tab 4: Graph — `/channels/:id/graph`
+
+Channel-scoped knowledge graph visualization.
+
+**Canvas**: cytoscape.js rendering entities as colored nodes:
+- Person — blue
+- Decision — amber
+- Project — green
+- Technology — purple
+- Team — teal
+
+**Sidebar filters**: entity type checkboxes, time range picker, relationship type filter.
+
+**Interactions**:
+- Click node → right panel with entity details + connected entities
+- Double-click → expand neighbors (1-hop)
+- Hover edge → tooltip with relationship type + timestamp + confidence
+
+**Decision timeline toggle**: Switch from graph view to vertical timeline of Decision nodes showing SUPERSEDES chains.
+
+**API calls**: `GET /api/graph/entities?channel_id=:id`, `GET /api/graph/entities/:eid/neighbors`, `GET /api/graph/decisions/:id`
+
+#### Tab 5: Channel Settings — `/channels/:id/settings`
+
+Per-channel configuration:
+- Sync schedule (manual / cron)
+- Max messages per sync
+- Enabled/disabled toggle
+- Last sync details (messages processed, duration, errors)
+- "Force Full Re-sync" button
+
+**API calls**: `GET /api/channels/:id`, `POST /api/channels/:id/sync`
+
+---
+
+### 2.4 Settings — `/settings`
+
+**Sections**:
+- **Connected Platforms** — Slack / Teams / Discord OAuth cards with connection status
+- **System Configuration** — LLM provider, cost limits, embedding model
+- **Account** — user profile, workspace info
+
+---
+
+## 3. Component Architecture
+
+```
+src/
+├── app/
+│   ├── layout.tsx              # Root layout: sidebar + header shell
+│   ├── page.tsx                # Dashboard home
+│   ├── channels/
+│   │   ├── page.tsx            # Channel list
+│   │   └── [id]/
+│   │       ├── layout.tsx      # Channel workspace layout (tab bar)
+│   │       ├── wiki/
+│   │       │   ├── page.tsx            # Wiki landing → Overview page
+│   │       │   ├── layout.tsx          # Wiki layout: sidebar nav + page content
+│   │       │   ├── people/page.tsx     # People & Experts page
+│   │       │   ├── decisions/page.tsx  # Decisions page
+│   │       │   ├── tech-stack/page.tsx # Tech Stack page
+│   │       │   ├── projects/page.tsx   # Projects page
+│   │       │   ├── activity/page.tsx   # Recent Activity page
+│   │       │   ├── faq/page.tsx        # FAQ page
+│   │       │   ├── glossary/page.tsx   # Glossary page
+│   │       │   ├── resources/page.tsx  # Resources & Media page
+│   │       │   └── topics/
+│   │       │       ├── [slug]/page.tsx      # Topic page
+│   │       │       └── [slug]/[sub]/page.tsx # Sub-topic page
+│   │       ├── ask/page.tsx    # Ask agent tab
+│   │       ├── memories/page.tsx # 3-tier memory browser
+│   │       ├── graph/page.tsx  # Knowledge graph
+│   │       └── settings/page.tsx # Channel settings
+│   └── settings/
+│       └── page.tsx            # Global settings
+├── components/
+│   ├── layout/
+│   │   ├── Sidebar.tsx         # Nav links + channel list, collapse toggle
+│   │   ├── Header.tsx          # Page title + global search trigger
+│   │   ├── HealthBadge.tsx     # GET /health polling indicator
+│   │   └── ChannelTabs.tsx     # Tab bar: Wiki | Ask | Memories | Graph | Settings
+│   ├── wiki/
+│   │   │  ── Layout ──
+│   │   ├── WikiLayout.tsx      # Two-column: sidebar navigation + page content area
+│   │   ├── WikiSidebar.tsx     # DeepWiki-style numbered page tree, collapsible, active highlight
+│   │   ├── WikiBreadcrumb.tsx  # Breadcrumb: Wiki > Topics > Authentication > JWT Migration
+│   │   ├── FreshnessBadge.tsx  # Wiki staleness indicator + refresh button
+│   │   │  ── Fixed Pages ──
+│   │   ├── OverviewPage.tsx    # Landing: summary, stats, topic cards, highlights, recent changes
+│   │   ├── PeoplePage.tsx      # Grouped by role + bar chart + person cards
+│   │   ├── DecisionsPage.tsx   # Mermaid supersede flow + vertical timeline + filters
+│   │   ├── TechStackPage.tsx   # Technology grid (Phase 1.5)
+│   │   ├── ProjectsPage.tsx    # Project cards + mermaid dependency graph (Phase 1.5)
+│   │   ├── ActivityPage.tsx    # Area chart + daily grouped activity
+│   │   ├── FAQPage.tsx         # Auto-generated Q&A pairs (Phase 1.5)
+│   │   ├── GlossaryPage.tsx    # Alphabetical term list (Phase 2)
+│   │   ├── ResourcesPage.tsx   # Media grouped by type: docs, images, links, videos (Phase 1.5)
+│   │   │  ── Agent-Generated Pages ──
+│   │   ├── TopicPage.tsx       # Topic: overview, all facts, related items, sub-page links
+│   │   ├── SubTopicPage.tsx    # Sub-topic: summary, all facts, inline media
+│   │   │  ── Shared Components ──
+│   │   ├── PersonCard.tsx      # Person card: role, expertise chips, decisions
+│   │   ├── DecisionEntry.tsx   # Timeline entry: status badge + supersedes + chips
+│   │   ├── TopicCard.tsx       # Topic overview card (used on Overview page)
+│   │   ├── ProjectCard.tsx     # Project card: lead, status, blockers
+│   │   ├── CitationPanel.tsx   # Bottom citations section for any page
+│   │   ├── MediaThumbnail.tsx  # Image thumbnail with lightbox + AI alt text
+│   │   ├── MediaPreviewCard.tsx # PDF/video/audio preview card with metadata
+│   │   ├── MediaBadge.tsx      # Inline media type badge (📄🔗🖼️🎬🎙️) on citations
+│   │   │  ── Markdown Rendering ──
+│   │   ├── WikiMarkdown.tsx    # Enhanced react-markdown renderer (all plugins)
+│   │   ├── MermaidBlock.tsx    # ```mermaid → SVG diagram
+│   │   ├── ChartBlock.tsx      # ```chart → recharts (bar, area, donut)
+│   │   ├── CalloutBox.tsx      # > [!NOTE] / [!TIP] / [!WARNING] → styled card
+│   │   ├── EntityChip.tsx      # @person #topic $tech → clickable navigation chip
+│   │   ├── CitationLink.tsx    # [1] → hover preview + click permalink
+│   │   └── MediaEmbed.tsx      # Inline image/PDF/video rendering in page content
+│   ├── ask/
+│   │   ├── AskInput.tsx        # Question input + submit button
+│   │   ├── AgentStream.tsx     # Streaming response container
+│   │   ├── ThinkingBlock.tsx   # Collapsible CoT thinking section
+│   │   ├── ToolCallStep.tsx    # Tool invocation step (name + result summary)
+│   │   ├── ResponseBlock.tsx   # Streaming answer with inline citations
+│   │   ├── CitationCard.tsx    # Expandable citation: fact + graph + original message
+│   │   ├── ResponseMeta.tsx    # Route badge + confidence + cost
+│   │   └── QuestionHistory.tsx # Per-channel question history sidebar
+│   ├── memories/
+│   │   ├── TierBrowser.tsx     # 3-tier accordion/column layout
+│   │   ├── SummaryCard.tsx     # Tier 0 channel summary
+│   │   ├── ClusterCard.tsx     # Tier 1 topic cluster (expandable)
+│   │   ├── FactCard.tsx        # Tier 2 atomic fact with metadata
+│   │   └── MemoryFilters.tsx   # Filter by topic, entity, date, importance
+│   ├── graph/
+│   │   ├── GraphCanvas.tsx     # cytoscape.js wrapper
+│   │   ├── EntityPanel.tsx     # Right-side entity detail panel
+│   │   ├── GraphFilters.tsx    # Entity type + time + relationship filters
+│   │   └── TimelineView.tsx    # Decision timeline toggle view
+│   └── channels/
+│       ├── ChannelList.tsx     # Sortable table of joined channels
+│       ├── ChannelCard.tsx     # Row: name, stats, status badge
+│       └── SyncButton.tsx      # Sync Now with optimistic + polling state
+├── hooks/
+│   ├── useAsk.ts              # SSE streaming for agent Q&A
+│   ├── useWiki.ts             # GET /wiki → structure + overview, cache with TanStack Query
+│   ├── useWikiPage.ts         # GET /wiki/pages/:id → single page content, cache per page
+│   ├── useWikiStructure.ts    # GET /wiki/structure → sidebar tree (lightweight)
+│   ├── useWikiRefresh.ts      # POST /wiki/refresh → trigger regeneration + poll until done
+│   ├── useMemories.ts         # 3-tier data browsing + search
+│   ├── useSync.ts             # sync_channel + get_sync_status polling (5s)
+│   ├── useGraph.ts            # entity fetch + neighbor expansion
+│   ├── useHealth.ts           # GET /health polling every 30s
+│   └── useChannels.ts         # channel list (ACL-filtered)
+├── lib/
+│   ├── api.ts                 # fetch wrapper: baseURL, error handling, auth header
+│   ├── sse.ts                 # Server-Sent Events client for streaming
+│   └── types.ts               # TypeScript types mirroring backend schemas
+└── styles/
+    └── globals.css
+```
+
+---
+
+## 4. API Integration
+
+| Page | API Calls |
+|------|-----------|
+| Dashboard | `GET /api/health`, `GET /api/stats`, `GET /api/sync/status` |
+| Channel List | `GET /api/channels` (ACL-filtered to joined channels) |
+| Wiki Tab (landing) | `GET /api/channels/:id/wiki` (structure + overview page) |
+| Wiki Tab (page nav) | `GET /api/channels/:id/wiki/pages/:page_id` (single page content) |
+| Wiki Tab (sidebar) | `GET /api/channels/:id/wiki/structure` (lightweight sidebar tree) |
+| Wiki Tab (refresh) | `POST /api/channels/:id/wiki/refresh` (force regeneration) |
+| Ask Tab | `POST /api/channels/:id/ask` (SSE streaming) |
+| Memories Tab | `GET /api/channels/:id/wiki?section=overview`, `GET /api/channels/:id/topics`, `POST /api/channels/:id/search/memories` |
+| Graph Tab | `GET /api/graph/entities?channel_id=`, `GET /api/graph/entities/:id/neighbors`, `GET /api/graph/decisions/:id` |
+| Channel Settings | `GET /api/channels/:id`, `POST /api/channels/:id/sync` |
+| Global Settings | `GET /api/settings`, `PUT /api/settings`, `GET /api/platforms` |
+
+**Base URL**: `VITE_API_URL` env var, default `http://localhost:8000`.
+
+**Error handling**: non-2xx responses caught by TanStack Query; `ErrorBoundary` components show inline error states.
+
+---
+
+## 5. Key Interaction Flows
+
+### Ask flow (streaming)
+1. User types question in `AskInput` on the Ask tab
+2. User submits (Enter or button)
+3. `useAsk` opens SSE connection to `POST /api/channels/:id/ask`
+4. **Event: `thinking`** → `ThinkingBlock` shows collapsible CoT (auto-collapses when response starts)
+5. **Event: `tool_call`** → `ToolCallStep` renders tool name + brief result (e.g., "search_weaviate_hybrid → 5 results")
+6. **Event: `response_delta`** → `ResponseBlock` streams answer tokens with citation markers `[1]`, `[2]`
+7. **Event: `citations`** → `CitationCard[]` render with 3 citation types:
+   - Fact citation (Weaviate atomic memory text)
+   - Graph citation (Neo4j entity path)
+   - Original message (platform permalink)
+8. **Event: `done`** → `ResponseMeta` shows route badge, confidence, cost
+9. Question appended to per-channel localStorage history
+
+### Sync flow
+1. User clicks "Sync Now" in channel settings or channel list
+2. `POST /api/channels/:id/sync` → returns `job_id`
+3. `useSync` polls every 5s, progress bar updates
+4. On completion: toast, wiki stale banner appears on Wiki tab
+
+### Wiki navigation flow
+1. User clicks Wiki tab → `GET /api/channels/:id/wiki` returns structure + Overview page
+2. Sidebar renders numbered page tree from `structure.pages`
+3. User clicks a page in sidebar → `GET /api/channels/:id/wiki/pages/:page_id` loads that page
+4. Breadcrumb updates: Wiki > Topics > Authentication > JWT Migration
+5. Page content renders via `WikiMarkdown` (mermaid, charts, tables, citations, media)
+6. Entity chips (`@alice`, `#topic`) are clickable — navigate to the relevant wiki page
+
+### Wiki refresh flow
+1. Yellow stale banner on Wiki sidebar
+2. Click "Refresh Wiki" → `POST /api/channels/:id/wiki/refresh`
+3. `useWikiRefresh` polls until `is_stale` clears
+4. All cached pages invalidated → re-fetch structure + current page with fade transition
+
+### Memory browsing
+1. Tier 0 summary card always visible at top
+2. Tier 1 topic clusters listed as expandable cards
+3. Click cluster → reveals its Tier 2 atomic facts
+4. Search/filter across all atomics by topic, entity, date range
+
+### Graph exploration
+1. Graph loads with entities from this channel
+2. Click node → detail panel on right
+3. Double-click → expand neighbors
+4. Toggle to timeline view for decision SUPERSEDES chains
+
+---
+
+## 6. Streaming Protocol (SSE Events)
+
+The Ask tab uses Server-Sent Events for real-time agent streaming:
+
+```typescript
+// SSE event types from POST /api/channels/:id/ask
+interface AskSSEEvents {
+  thinking: { content: string };           // CoT reasoning chunk
+  tool_call: {
+    tool: string;                           // e.g., "search_weaviate_hybrid"
+    input_summary: string;                  // e.g., "query='JWT auth', channel=backend"
+    output_summary: string;                 // e.g., "5 results, top score 0.87"
+  };
+  response_delta: { content: string };     // Answer token chunk
+  citations: { citations: Citation[] };     // Full citation objects
+  metadata: {
+    route_used: "semantic" | "graph" | "both";
+    confidence: number;
+    cost_usd: number;
+    degraded: boolean;
+  };
+  error: { message: string; code: string }; // Error during streaming
+  done: {};                                 // Stream complete
+}
+
+// Citation types
+interface Citation {
+  id: string;
+  type: "fact" | "graph" | "message";     // 3 citation types
+  // Fact citation (from Weaviate)
+  fact_text?: string;                      // Atomic memory text
+  quality_score?: number;
+  tier?: "atomic" | "topic" | "summary";
+  // Graph citation (from Neo4j)
+  graph_path?: string;                     // e.g., "Person(Alice) → DECIDED → Decision(RS256)"
+  entities?: { name: string; type: string }[];
+  // Original message citation
+  channel: string;
+  user: string;
+  timestamp: string;
+  permalink: string;                       // Platform message URL
+}
+```
+
+---
+
+## 7. TypeScript Types (Backend Schema Mapping)
+
+```typescript
+// lib/types.ts
+
+export interface AskResponse {
+  answer: string;
+  citations: Citation[];
+  route_used: "semantic" | "graph" | "both";
+  confidence: number;
+  degraded: boolean;
+  cost_usd: number;
+}
+
+// ── Wiki types (pageable, hierarchical) ──
+
+export interface WikiResponse {
+  channel_id: string;
+  channel_name: string;
+  platform: "slack" | "teams" | "discord";
+  generated_at: string;
+  is_stale: boolean;
+  structure: WikiStructure;      // Sidebar navigation tree
+  overview: WikiPage;            // Overview page content (landing)
+  metadata: WikiMetadata;
+}
+
+export interface WikiStructure {
+  channel_id: string;
+  channel_name: string;
+  platform: string;
+  generated_at: string;
+  is_stale: boolean;
+  pages: WikiPageNode[];         // Top-level page tree
+}
+
+export interface WikiPageNode {
+  id: string;
+  title: string;
+  slug: string;
+  section_number: string;        // "1", "2.1", "2.1.1"
+  page_type: "fixed" | "topic" | "sub-topic";
+  memory_count: number;
+  children: WikiPageNode[];      // Recursive for sub-pages
+}
+
+export interface WikiPage {
+  id: string;
+  slug: string;
+  title: string;
+  page_type: "fixed" | "topic" | "sub-topic";
+  parent_id: string | null;
+  section_number: string;
+  content: string;               // Enhanced Markdown (mermaid/chart/callout/media)
+  summary: string;               // 1-2 sentence summary for cards/tooltips
+  memory_count: number;
+  last_updated: string;
+  citations: WikiCitation[];
+  children: WikiPageRef[];       // Sub-page references
+}
+
+export interface WikiPageRef {
+  id: string;
+  title: string;
+  slug: string;
+  section_number: string;
+  memory_count: number;
+}
+
+export interface WikiMetadata {
+  member_count: number;
+  message_count: number;
+  memory_count: number;
+  entity_count: number;
+  media_count: number;
+  page_count: number;            // Total wiki pages
+  generation_cost_usd: number;
+  generation_duration_ms: number;
+}
+
+export interface WikiCitation {
+  id: string;                    // "[1]"
+  author: string;
+  channel: string;
+  timestamp: string;
+  text_excerpt: string;          // First 100 chars of original message
+  permalink: string;             // Slack/Teams/Discord message URL
+  media_type?: "pdf" | "image" | "link" | "video" | "audio";
+  media_name?: string;           // Filename or domain for media-sourced citations
+}
+
+export interface SyncResponse {
+  status: "started" | "already_running" | "queued";
+  channel_id: string;
+  estimated_messages: number;
+  job_id: string;
+}
+
+export interface SyncStatusResponse {
+  channel_id: string;
+  state: "idle" | "syncing" | "error";
+  progress_pct: number;
+  messages_processed: number;
+  last_sync_at: string | null;
+  error_message: string | null;
+}
+
+export interface ChannelResponse {
+  channel_id: string;
+  name: string;
+  platform: "slack" | "teams" | "discord";
+  is_private: boolean;
+  last_synced_at: string | null;
+  message_count: number;
+  memory_count: number;
+  entity_count: number;
+  wiki_is_stale: boolean;
+  sync_status: "idle" | "running" | "failed";
+}
+
+export interface HealthResponse {
+  status: "healthy" | "degraded" | "down";
+  components: Record<string, "up" | "down">;
+  latency_ms: Record<string, number>;
+  checked_at: string;
+}
+
+export interface TopicCluster {
+  id: string;
+  summary: string;
+  topic_tags: string[];
+  member_count: number;
+}
+
+export interface AtomicFact {
+  id: string;
+  memory: string;
+  quality_score: number;
+  timestamp: string;
+  user_name: string;
+  topic_tags: string[];
+  entity_tags: string[];
+  importance: string;
+  permalink: string;
+}
+```
+
+---
+
+## 8. Design Tokens
+
+**Color scheme**: light theme default, dark mode via `prefers-color-scheme` + manual toggle.
+
+**Palette**:
+- Primary: `slate-900` / `slate-50`
+- Accent: `indigo-600` — links, active nav, primary buttons
+- Success: `emerald-500` — healthy, sync complete
+- Warning: `amber-500` — stale wiki, degraded
+- Error: `red-500` — sync error, component down
+- Graph nodes: blue (Person), amber (Decision), green (Project), purple (Technology), teal (Team)
+
+**Typography**: Inter (UI), JetBrains Mono (code/technical).
+
+**Spacing**: 4px base unit.
+
+**Cards**: `rounded-lg border border-slate-200 shadow-sm hover:shadow-md transition-shadow`
+
+**Sidebar**: 240px expanded, 64px collapsed. State persisted to localStorage.
+
+---
+
+## 9. Backend Routes Required
+
+| Method | Path | Purpose |
+|--------|------|---------|
+| `GET` | `/api/health` | System health check |
+| `GET` | `/api/stats` | Aggregate statistics |
+| `GET` | `/api/channels` | List joined channels (ACL-filtered) |
+| `GET` | `/api/channels/:id` | Channel details + sync status |
+| `POST` | `/api/channels/:id/sync` | Trigger sync |
+| `GET` | `/api/channels/:id/wiki` | Wiki structure + Overview page (landing) |
+| `GET` | `/api/channels/:id/wiki/pages/:page_id` | Single wiki page content (lazy loaded) |
+| `GET` | `/api/channels/:id/wiki/structure` | Sidebar navigation tree (lightweight) |
+| `POST` | `/api/channels/:id/wiki/refresh` | Force wiki regeneration |
+| `GET` | `/api/channels/:id/topics` | Tier 1 topic clusters |
+| `POST` | `/api/channels/:id/ask` | **Streaming Q&A** (SSE) — per-channel |
+| `POST` | `/api/channels/:id/search/memories` | Search atomic facts in channel |
+| `GET` | `/api/graph/entities` | Entity list (channel-scoped) |
+| `GET` | `/api/graph/entities/:id/neighbors` | N-hop neighborhood |
+| `GET` | `/api/graph/decisions/:channel_id` | Decision timeline |
+| `POST` | `/api/search` | **Global cross-channel search** (Phase 2) |
+| `GET` | `/api/settings` | Workspace config |
+| `PUT` | `/api/settings` | Update workspace config |
+| `GET` | `/api/platforms` | Connected platforms |
+| `POST` | `/api/platforms/:type/connect` | OAuth flow |
+
+**Key new endpoint**: `POST /api/channels/:id/ask` returns an SSE stream (not JSON). See §6 for the event protocol. This is separate from `POST /api/search` which is the global cross-channel endpoint (Phase 2).
diff --git a/docs/v2/12-api-design.md b/docs/v2/12-api-design.md
new file mode 100644
index 00000000..5258c238
--- /dev/null
+++ b/docs/v2/12-api-design.md
@@ -0,0 +1,492 @@
+# API Design: MCP Server & REST Interface
+
+## 1. Overview
+
+Beever Atlas exposes two interfaces:
+
+- **REST API** (current): HTTP endpoints consumed by the web dashboard frontend and external integrations. All implemented endpoints are listed in this document.
+- **MCP Server** (planned): Tools + Resources for AI assistants (Claude, etc.). The MCP tool specs below describe the planned interface — the underlying service logic exists, but the MCP wrapper layer is not yet the primary interface.
+
+Both interfaces share the same service layer — MCP tools and REST routes call the same underlying functions. There is no separate logic per interface.
+
+> **Status**: REST API is fully implemented. MCP Tools (`ask_questions`, `search_memories`, etc.) are design specs for the planned MCP server layer.
+
+```python
+@app.middleware("http")
+async def authenticate(request: Request, call_next):
+    token = request.headers.get("Authorization", "").replace("Bearer ", "")
+    if not token:
+        return JSONResponse(status_code=401, content={"error": "Missing auth token"})
+    user = await verify_workspace_token(token)
+    request.state.user_id = user.id
+    request.state.workspace_id = user.workspace_id
+    return await call_next(request)
+```
+
+Private channel access is inherited from platform membership. Public channels are visible to all workspace members. Private channel results are filtered via `acl.filter_results()` at the retrieval layer.
+
+---
+
+## 2. MCP Tools
+
+Seven tools are exposed. Graph queries are abstracted behind `ask_questions` — the smart router decides when to use Neo4j. Users and AI clients do not need to know about the dual-memory architecture.
+
+```python
+@tool("ask_questions")
+async def ask_questions(
+    question: str,           # Natural language query
+    channel_id: str = None,  # Target channel (None = cross-channel search, ACL-filtered)
+    include_citations: bool = True,
+    max_results: int = 10,
+) -> AskResponse:
+    """Ask a question about channel knowledge. Routes automatically
+    to semantic search, graph traversal, or both based on query type.
+    Cost: $0.001-$0.006 depending on route."""
+
+
+@tool("search_memories")
+async def search_memories(
+    query: str,              # Search query
+    channel_id: str,
+    tier: str = "all",       # "all" | "summary" | "topic" | "atomic"
+    limit: int = 15,
+    include_images: bool = False,
+) -> SearchResponse:
+    """Direct hybrid search — bypasses router for power users.
+    Cost: ~$0.001"""
+
+
+@tool("get_wiki")
+async def get_wiki(
+    channel_id: str,
+    section: str = "all",    # "all"|"overview"|"topics"|"people"|"decisions"|"recent"
+) -> WikiResponse:
+    """Read cached wiki content. FREE for cached sections.
+    Returns stale data if wiki is dirty — use refresh_wiki to force update."""
+
+
+@tool("get_topics")
+async def get_topics(
+    channel_id: str,
+) -> TopicsResponse:
+    """List topic clusters for a channel. FREE (cached Tier 1)."""
+
+
+@tool("sync_channel")
+async def sync_channel(
+    channel_id: str,
+    max_messages: int = 5000,  # Safety limit to prevent cost explosion
+    since: str = None,         # ISO timestamp, defaults to last sync point
+) -> SyncResponse:
+    """Trigger ingestion for a channel. Runs in background.
+    Cost: ~$0.0025/message (text), ~$0.008/message (with media)."""
+
+
+@tool("get_sync_status")
+async def get_sync_status(
+    channel_id: str = None,    # None = all channels
+) -> SyncStatusResponse:
+    """Check sync progress and health status. FREE."""
+
+
+@tool("refresh_wiki")
+async def refresh_wiki(
+    channel_id: str,
+) -> RefreshResponse:
+    """Force wiki regeneration. Triggers full reconsolidation.
+    Cost: ~$0.01 for LLM synthesis."""
+```
+
+---
+
+## 3. MCP Resources
+
+Read-only, URI-based access to pre-rendered wiki content. Resources are served from cache and do not trigger LLM calls.
+
+```python
+@resource("wiki://{channel_id}")           # Full wiki markdown
+@resource("wiki://{channel_id}/overview")  # Tier 0 summary only
+@resource("wiki://{channel_id}/topics")    # Tier 1 cluster list
+```
+
+Resources return stale content if the wiki is dirty. Clients should call `refresh_wiki` first if freshness is required.
+
+---
+
+## 4. REST API Endpoints
+
+All endpoints require `Authorization: Bearer <token>`. All responses are `application/json`.
+
+### 4.1 Channels
+
+| Method | Path | Description |
+|--------|------|-------------|
+| GET | `/api/channels` | List all synced channels with metadata |
+| GET | `/api/channels/:id` | Get channel details + sync status |
+| POST | `/api/channels/:id/sync` | Trigger sync (wraps `sync_channel`) |
+| GET | `/api/channels/:id/wiki` | Get wiki content (wraps `get_wiki`) |
+| POST | `/api/channels/:id/wiki/refresh` | Force wiki refresh (wraps `refresh_wiki`) |
+
+**GET /api/channels**
+
+Returns all channels the authenticated user can access, ordered by last sync time.
+
+Query params: `platform` (filter by `slack`|`teams`|`discord`), `page`, `limit` (default 50).
+
+**GET /api/channels/:id**
+
+Returns channel metadata, current sync state, and wiki staleness flag.
+
+**POST /api/channels/:id/sync**
+
+Request body:
+```json
+{
+  "max_messages": 5000,
+  "since": "2024-01-15T00:00:00Z"
+}
+```
+
+Returns a `SyncResponse` with `job_id` for polling. Both fields are optional; `since` defaults to the last sync checkpoint.
+
+**GET /api/channels/:id/wiki**
+
+Query params: `section` (`all`|`overview`|`topics`|`people`|`decisions`|`recent`, default `all`).
+
+Returns a `WikiResponse`.
+
+**POST /api/channels/:id/wiki/refresh**
+
+No body. Enqueues a full reconsolidation job. Returns `{ "job_id": "...", "status": "queued" }`.
+
+---
+
+### 4.2 Per-Channel Ask (Streaming)
+
+| Method | Path | Description |
+|--------|------|-------------|
+| POST | `/api/channels/:id/ask` | **Streaming Q&A** for a specific channel (SSE) |
+| POST | `/api/channels/:id/search/memories` | Direct memory search within channel |
+
+**POST /api/channels/:id/ask** (Server-Sent Events)
+
+The primary query endpoint. Returns an SSE stream showing the agent's thinking, tool calls, and response in real-time.
+
+Request body:
+```json
+{
+  "question": "What did we decide about the auth approach?",
+  "include_citations": true,
+  "max_results": 10
+}
+```
+
+Response: `Content-Type: text/event-stream`
+
+```
+event: thinking
+data: {"content": "Analyzing query... route=graph, entities=[Alice, JWT]..."}
+
+event: tool_call
+data: {"tool": "search_weaviate_hybrid", "input_summary": "query='JWT auth'", "output_summary": "5 results, top score 0.87"}
+
+event: tool_call
+data: {"tool": "traverse_neo4j", "input_summary": "entities=[Alice]", "output_summary": "Person(Alice) → DECIDED → Decision(RS256)"}
+
+event: response_delta
+data: {"content": "Alice decided to use RS256 for JWT"}
+
+event: response_delta
+data: {"content": " in the March sprint [1]. This was blocked by"}
+
+event: citations
+data: {"citations": [{"id": "c1", "type": "fact", "fact_text": "Alice decided RS256...", ...}, {"id": "c2", "type": "graph", "graph_path": "Person(Alice) → DECIDED → Decision(RS256)", ...}, {"id": "c3", "type": "message", "permalink": "https://slack.com/archives/...", ...}]}
+
+event: metadata
+data: {"route_used": "graph", "confidence": 0.92, "cost_usd": 0.005, "degraded": false}
+
+event: done
+data: {}
+```
+
+**Citation types** (3 kinds per result):
+- `fact` — the atomic memory text from Weaviate with quality score
+- `graph` — the entity/relationship path from Neo4j
+- `message` — permalink to the original Slack/Teams/Discord message
+
+The ADK Runner streams agent events directly to the SSE connection. The `query_router_agent` emits thinking events, each tool call is reported as it happens, and the `response_agent` streams its output token-by-token.
+
+**POST /api/channels/:id/search/memories**
+
+Direct hybrid search within a channel — bypasses the agent for power users who want raw memory results.
+
+Request body:
+```json
+{
+  "query": "authentication JWT refresh tokens",
+  "tier": "all",
+  "limit": 15,
+  "include_images": false
+}
+```
+
+Returns a `SearchResponse`.
+
+### 4.2.1 Global Cross-Channel Search (Phase 2)
+
+| Method | Path | Description |
+|--------|------|-------------|
+| POST | `/api/search` | Cross-channel Q&A (searches all joined channels) |
+
+**POST /api/search**
+
+Same request/response shape as per-channel ask, but `channel_id` is derived from the user's joined channels via `acl.get_accessible_channels()`. Results are ACL-filtered.
+
+This is a **Phase 2** feature — the per-channel ask (`/api/channels/:id/ask`) is the priority.
+
+---
+
+### 4.3 Graph
+
+| Method | Path | Description |
+|--------|------|-------------|
+| GET | `/api/graph/entities` | List entities with filters |
+| GET | `/api/graph/entities/:id` | Get entity details + relationships |
+| GET | `/api/graph/entities/:id/neighbors` | N-hop neighborhood for graph visualization |
+| GET | `/api/graph/traverse` | Run traversal from entity names |
+| GET | `/api/graph/decisions/:channel_id` | Decision timeline for a channel |
+
+**GET /api/graph/entities**
+
+Query params: `type` (entity type filter), `channel_id`, `q` (name search), `page`, `limit` (default 50).
+
+Returns `{ "entities": EntityResponse[], "total": int, "page": int }`.
+
+**GET /api/graph/entities/:id**
+
+Returns full entity details including all outgoing and incoming relationships.
+
+**GET /api/graph/entities/:id/neighbors**
+
+Query params: `hops` (int, default 1, max 3).
+
+Returns a `GraphNeighborhoodResponse` with nodes and edges suitable for passing directly to a graph visualization library (e.g., D3, Cytoscape).
+
+**GET /api/graph/traverse**
+
+Query params: `from` (entity name), `channel_id` (optional scope), `depth` (default 2).
+
+Runs a Neo4j traversal starting from the named entity. Returns nodes and relationships encountered.
+
+**GET /api/graph/decisions/:channel_id**
+
+Returns a chronological decision timeline for the channel. Supersedes decision chain queries — timeline view is the canonical representation of decision history.
+
+Query params: `since` (ISO timestamp), `limit` (default 100).
+
+---
+
+### 4.4 System
+
+| Method | Path | Description |
+|--------|------|-------------|
+| GET | `/api/health` | Health check across all components |
+| GET | `/api/stats` | Aggregate statistics |
+| GET | `/api/sync/status` | Global sync status (wraps `get_sync_status`) |
+
+**GET /api/health**
+
+No auth required. Checks MongoDB, Neo4j, vector store, and job queue. Returns `200` if all healthy, `503` if any component is degraded.
+
+**GET /api/stats**
+
+Returns workspace-level aggregate counts: total memories, total entities, total channels synced, approximate storage used.
+
+**GET /api/sync/status**
+
+Query params: `channel_id` (optional; omit for all channels).
+
+Returns a `SyncStatusResponse`.
+
+---
+
+### 4.5 Settings
+
+| Method | Path | Description |
+|--------|------|-------------|
+| GET | `/api/settings` | Get current workspace configuration |
+| PUT | `/api/settings` | Update configuration |
+| GET | `/api/platforms` | List connected platforms + OAuth status |
+| POST | `/api/platforms/:type/connect` | Initiate OAuth flow |
+
+**GET /api/settings**
+
+Returns workspace configuration: daily cost budget, sync defaults, rate limit overrides.
+
+**PUT /api/settings**
+
+Request body (partial update, all fields optional):
+```json
+{
+  "daily_cost_budget_usd": 5.00,
+  "default_max_messages": 5000,
+  "wiki_auto_refresh": true
+}
+```
+
+**GET /api/platforms**
+
+Returns connected platforms (`slack`, `teams`, `discord`) with OAuth token status and scopes.
+
+**POST /api/platforms/:type/connect**
+
+`:type` is `slack`, `teams`, or `discord`. Returns an OAuth redirect URL. The frontend should redirect the user to this URL to complete the OAuth handshake.
+
+---
+
+## 5. Response Schemas
+
+### Shared (MCP + REST)
+
+```python
+class AskResponse:
+    answer: str                    # Grounded response with inline citations
+    citations: list[Citation]      # Source facts with platform permalinks
+    route_used: str                # "semantic" | "graph" | "both"
+    confidence: float              # 0.0-1.0
+    degraded: bool                 # True if a component was unavailable
+    cost_usd: float                # Estimated cost of this query
+
+class Citation:
+    text: str                      # Original fact text
+    channel: str                   # Source channel name
+    user: str                      # Who said it
+    timestamp: str                 # ISO 8601
+    permalink: str                 # Platform message URL
+    tier: str                      # "atomic" | "topic" | "summary"
+
+class SyncResponse:
+    status: str                    # "started" | "already_running" | "queued"
+    channel_id: str
+    estimated_messages: int        # Approximate message count to process
+    job_id: str                    # For tracking via get_sync_status
+
+class WikiResponse:
+    content: str                   # Markdown wiki content
+    generated_at: str              # ISO 8601
+    is_stale: bool                 # True if wiki_dirty flag is set
+    channel_id: str
+```
+
+### REST-only
+
+```python
+class ChannelResponse:
+    channel_id: str
+    name: str                      # Display name (e.g. "#backend")
+    platform: str                  # "slack" | "teams" | "discord"
+    is_private: bool
+    last_synced_at: str            # ISO 8601, null if never synced
+    message_count: int             # Total messages ingested
+    wiki_is_stale: bool
+    sync_status: str               # "idle" | "running" | "failed"
+
+class EntityResponse:
+    id: str                        # Neo4j node ID
+    name: str
+    type: str                      # Entity type (person, project, decision, etc.)
+    channel_id: str                # Source channel (if scoped)
+    properties: dict               # Type-specific properties
+    relationship_count: int        # Total edges on this node
+
+class GraphNeighborhoodResponse:
+    center_id: str
+    nodes: list[dict]              # [{id, name, type, properties}]
+    edges: list[dict]              # [{source, target, type, properties}]
+    hops: int                      # Depth actually returned
+
+class StatsResponse:
+    workspace_id: str
+    total_memories: int
+    total_entities: int
+    total_relationships: int
+    channels_synced: int
+    estimated_storage_mb: float
+    last_updated: str              # ISO 8601
+
+class HealthResponse:
+    status: str                    # "healthy" | "degraded" | "down"
+    components: dict               # {mongodb, neo4j, vector_store, job_queue} -> "up"|"down"
+    latency_ms: dict               # Per-component latency
+    checked_at: str                # ISO 8601
+```
+
+---
+
+## 6. Error Handling
+
+All errors use a consistent envelope:
+
+```json
+{
+  "error": {
+    "code": "CHANNEL_NOT_FOUND",
+    "message": "Channel #backend not found or not synced",
+    "status": 404
+  }
+}
+```
+
+| Status | Code | Trigger |
+|--------|------|---------|
+| 400 | `INVALID_QUERY` | Blank or malformed query string |
+| 400 | `INVALID_CHANNEL_ID` | Channel ID format invalid |
+| 401 | `UNAUTHORIZED` | Missing or unparseable Bearer token |
+| 401 | `TOKEN_EXPIRED` | Token is valid but expired |
+| 403 | `CHANNEL_ACCESS_DENIED` | Private channel, user is not a member |
+| 404 | `CHANNEL_NOT_FOUND` | Channel not synced or does not exist in this workspace |
+| 404 | `ENTITY_NOT_FOUND` | Neo4j node ID not found |
+| 429 | `RATE_LIMIT_EXCEEDED` | Per-user rate limit or daily cost budget hit |
+| 503 | `SERVICE_DEGRADED` | A required component is down; response may be partial |
+
+For `503 SERVICE_DEGRADED`, the response body includes a `degraded_components` array and any partial results that could be returned:
+
+```json
+{
+  "error": {
+    "code": "SERVICE_DEGRADED",
+    "message": "Neo4j unavailable — returning semantic results only",
+    "status": 503,
+    "degraded_components": ["neo4j"]
+  },
+  "partial_result": { ... }
+}
+```
+
+---
+
+## 7. Rate Limiting & Cost Controls
+
+### Per-user rate limits
+
+| Endpoint class | Limit |
+|----------------|-------|
+| Query endpoints (`/api/search/*`, `ask_questions`, `search_memories`) | 60 req/min |
+| Sync endpoints (`/api/channels/:id/sync`, `sync_channel`) | 10 req/min |
+| Read endpoints (wiki, health, stats, graph) | 120 req/min |
+
+Limits are enforced per `user_id` extracted from the Bearer token. Exceeding a limit returns `429 RATE_LIMIT_EXCEEDED` with a `Retry-After` header (seconds until the window resets).
+
+### Daily cost budget
+
+A configurable `daily_cost_budget_usd` setting (default: $5.00 per workspace) caps total LLM spend. When the budget is exhausted, all cost-incurring operations return `429 RATE_LIMIT_EXCEEDED` with `"code": "DAILY_BUDGET_EXCEEDED"` until the next UTC day.
+
+Cost is tracked per workspace. The `/api/stats` endpoint includes `cost_today_usd` in its response.
+
+### Sync limits
+
+- Default `max_messages` per sync: 5000 (configurable via `/api/settings`)
+- Absolute hard cap: 10,000 messages per sync call regardless of setting
+- Concurrent syncs per workspace: 3
+
+Sync operations exceeding `max_messages` are truncated at the limit and continue from that point on the next sync call using the checkpoint stored in `last_synced_at`.
diff --git a/docs/v2/13-adk-integration.md b/docs/v2/13-adk-integration.md
new file mode 100644
index 00000000..1aad24e9
--- /dev/null
+++ b/docs/v2/13-adk-integration.md
@@ -0,0 +1,274 @@
+# ADK & Chat SDK Integration
+
+> **Status**: Implemented (ingestion pipeline, consolidation). Query routing agents are in development.
+> **Scope**: Google ADK agent orchestration for ingestion, consolidation, and (planned) Q&A routing
+
+---
+
+## Overview
+
+All LLM-powered operations in Beever Atlas v2 are orchestrated by [Google ADK](https://google.github.io/adk-docs/) agents. This replaces the direct LLM API call pattern with composable agent types (`LlmAgent`, `SequentialAgent`, `ParallelAgent`, `LoopAgent`), each with typed tools and shared session state. Model fallback is handled by [LiteLLM](https://docs.litellm.ai/) — no custom `LLMProvider` class is needed.
+
+The behavioral specs in docs 01-12 (prompts, retrieval logic, quality gates, etc.) remain accurate — they describe *what* each component does. This document describes *how* they are orchestrated.
+
+---
+
+## Google ADK Agent Architecture
+
+### Agent Hierarchy
+
+> **Implementation status**: The ingestion pipeline and consolidation agents are implemented. The query routing agent hierarchy (semantic_agent, graph_agent, response_agent) is the **design spec for the planned Q&A agent** — only a placeholder `echo.py` exists in `agents/query/` today.
+
+#### ✅ Implemented: Ingestion Pipeline
+
+```
+ingestion_pipeline (SequentialAgent)  — create_ingestion_pipeline()
+│
+│   Created by the factory in agents/ingestion/pipeline.py.
+│   Processes one NormalizedMessage through 6 stages.
+│
+├── preprocessor (PreprocessorAgent)
+│   Model: none (deterministic, no LLM)
+│   Behavior: Stage 1 — Slack mrkdwn → markdown, thread context assembly,
+│             bot/system message filtering, media processing (images via Gemini
+│             vision, PDFs via pypdf/chunking)
+│
+├── extraction_parallel (ParallelAgent)
+│   │
+│   ├── fact_extractor (LlmAgent)        — create_fact_extractor()
+│   │   Model: LLM_FAST_MODEL (default: gemini-2.5-flash)
+│   │   Behavior: Stage 2 — extract atomic facts, quality gate (reject < 0.5,
+│   │             max 2 facts/message)
+│   │
+│   └── entity_extractor (LlmAgent)      — create_entity_extractor()
+│       Model: LLM_FAST_MODEL (default: gemini-2.5-flash)
+│       Behavior: Stage 2 — extract entities + relationships, quality gate
+│                 (reject confidence < 0.6), filter hypotheticals
+│
+├── enrich_parallel (ParallelAgent)
+│   │
+│   ├── embedder (EmbedderAgent)
+│   │   Model: none (calls Jina v4 API directly)
+│   │   Behavior: Stage 3 — generate 2048-dim named vectors (text + image)
+│   │
+│   └── cross_batch_validator (LlmAgent) — create_cross_batch_validator()
+│       Model: LLM_FAST_MODEL (default: gemini-2.5-flash)
+│       Behavior: Stage 3 — resolve entity aliases across message batches,
+│                 validate relationship consistency
+│
+└── persister (PersisterAgent)
+    Model: none (rule-based, no LLM)
+    Tools: upsert_fact, upsert_entity, create_episodic_link
+    Behavior: Stage 4 — outbox persist to Weaviate + Neo4j + MongoDB
+              (spec: 05-ingestion-pipeline.md, 08-resilience.md §12.5)
+```
+
+#### ✅ Implemented: Consolidation
+
+```
+Consolidation is orchestrated by services/consolidation.py (not a LoopAgent).
+It uses ADK LlmAgents for summarization:
+
+    create_summarizer() / create_topic_summarizer() / create_channel_summarizer()
+        Model: LLM_FAST_MODEL (default: gemini-2.5-flash)
+        Behavior: Generate cluster summaries (Tier 1) and channel summaries (Tier 0)
+```
+
+#### 🔧 Planned: Q&A Routing Agents
+
+The following agent hierarchy is the **design spec** for the Q&A feature. Only `agents/query/echo.py` (a test placeholder) currently exists.
+
+```
+[PLANNED] query_router_agent (Root LlmAgent)
+│   Model: LLM_FAST_MODEL
+│   Behavior: Query decomposition + understanding → route to semantic/graph/both
+│
+├── [PLANNED] parallel_retrieval (ParallelAgent)
+│   ├── [PLANNED] semantic_agent — 3-tier Weaviate retrieval
+│   └── [PLANNED] graph_agent   — Neo4j traversal + Weaviate enrichment
+│
+└── [PLANNED] response_agent — grounded response + citations
+    Model: LLM_QUALITY_MODEL
+```
+
+See [`04-query-router.md`](04-query-router.md) for the full design spec of these agents.
+
+### How Agents Use ADK Session State
+
+ADK `Session` objects persist state across the agent hierarchy within a single request:
+
+```python
+# query_router_agent writes routing decision to session state
+session.state["route"] = "both"
+session.state["query_understanding"] = {
+    "entities": ["Alice", "JWT"],
+    "topics": ["authentication"],
+    "semantic_depth": "topic",
+    "temporal_scope": "recent",
+}
+
+# parallel_retrieval reads state to decide which sub-agents to activate
+# semantic_agent writes results to session state
+session.state["semantic_results"] = [...]
+
+# graph_agent writes results to session state
+session.state["graph_results"] = [...]
+
+# response_agent reads both result sets from session state,
+# merges, deduplicates, and generates the final response
+```
+
+For the extraction pipeline, session state carries the message through stages:
+
+```python
+# preprocessor_agent
+session.state["preprocessed"] = {...}
+
+# fact_extractor_agent
+session.state["facts"] = [...]
+session.state["quality_scores"] = [...]
+
+# entity_extractor_agent
+session.state["entities"] = [...]
+session.state["relationships"] = [...]
+
+# persister_agent reads all of the above and writes to stores
+```
+
+### Key Changes from Original Design
+
+| Component | Original (docs 01-12) | ADK Implementation | Status |
+|-----------|----------------------|-------------------|--------|
+| Extraction | Pipeline orchestrator calling LLM directly | `SequentialAgent` chaining 6-stage sub-agents | ✅ Implemented |
+| Consolidation | Scheduled function calls | `LlmAgent` summarizers via `services/consolidation.py` | ✅ Implemented |
+| Query routing | `llm_provider.call("fast", prompt)` | `query_router_agent` with sub-agent delegation | 🔧 Planned |
+| Retrieval | Direct function calls to `semantic_retriever` / `graph_retriever` | `ParallelAgent` running `semantic_agent` + `graph_agent` concurrently | 🔧 Planned |
+| Response gen | `llm_provider.call("quality", prompt)` | `response_agent` reading from ADK session state | 🔧 Planned |
+| LLM model config | Hardcoded model names | `LLMProvider.resolve_model()` reads `LLM_FAST_MODEL` / `LLM_QUALITY_MODEL` env vars | ✅ Implemented |
+
+### ADK Tools
+
+Store operations are wrapped as ADK `FunctionTool` instances. Each tool is a thin wrapper around the corresponding store method — no business logic lives in the tool layer.
+
+| Tool | Wraps | Used By | Spec |
+|------|-------|---------|------|
+| `search_weaviate_hybrid` | `weaviate_store.search_hybrid()` | semantic_agent, graph_agent | [`02-semantic-memory.md`](02-semantic-memory.md) |
+| `get_tier0_summary` | `weaviate_store.get_tier0_summary()` | semantic_agent | [`02-semantic-memory.md`](02-semantic-memory.md) |
+| `get_tier1_clusters` | `weaviate_store.get_tier1_clusters()` | semantic_agent, consolidation | [`02-semantic-memory.md`](02-semantic-memory.md) |
+| `traverse_neo4j` | `neo4j_store.traverse()` | graph_agent | [`03-graph-memory.md`](03-graph-memory.md) |
+| `temporal_chain` | `neo4j_store.temporal_chain()` | graph_agent | [`03-graph-memory.md`](03-graph-memory.md) |
+| `comprehensive_traverse` | `neo4j_store.comprehensive_traverse()` | graph_agent | [`03-graph-memory.md`](03-graph-memory.md) |
+| `get_episodic_weaviate_ids` | `neo4j_store.get_episodic_weaviate_ids()` | graph_agent | [`03-graph-memory.md`](03-graph-memory.md) |
+| `search_tavily` | `external_search.search()` | query_router | [`04-query-router.md`](04-query-router.md) |
+| `upsert_fact` | `weaviate_store.upsert_fact()` | persister_agent | [`05-ingestion-pipeline.md`](05-ingestion-pipeline.md) |
+| `upsert_entity` | `neo4j_store.upsert_entity()` | persister_agent | [`03-graph-memory.md`](03-graph-memory.md) |
+| `create_episodic_link` | `neo4j_store.create_episodic_link()` | persister_agent | [`03-graph-memory.md`](03-graph-memory.md) |
+
+### Model Configuration
+
+Each agent is configured with a model tier. LiteLLM handles transparent fallback when the primary model is unavailable (timeout, rate limit, circuit breaker open). See [`08-resilience.md`](08-resilience.md) for the full fallback chain per agent.
+
+Models are configured via env vars and resolved by `LLMProvider.resolve_model()` in `src/beever_atlas/llm/provider.py`.
+
+| Agent Tier | Env Var | Default Value | Agents |
+|-----------|---------|---------------|--------|
+| Fast | `LLM_FAST_MODEL` | `gemini-2.5-flash` | fact_extractor, entity_extractor, cross_batch_validator, summarizers |
+| Quality | `LLM_QUALITY_MODEL` | `gemini-2.5-flash` | wiki compiler (WikiCompiler) |
+| None | — | — | preprocessor, persister, embedder (rule-based / external API) |
+
+### ADK Runner Integration with FastAPI
+
+The FastAPI server creates an ADK `Runner` at startup and uses it to handle all requests:
+
+```python
+from google.adk.runners import Runner
+from google.adk.sessions import InMemorySessionService
+
+# At startup
+session_service = InMemorySessionService()
+runner = Runner(
+    agent=query_router_agent,
+    app_name="beever_atlas",
+    session_service=session_service,
+)
+
+# Per request (in api_routes.py)
+@app.post("/api/search")
+async def search(request: SearchRequest):
+    session = await session_service.create_session(
+        app_name="beever_atlas",
+        user_id=request.state.user_id,
+    )
+    response = await runner.run_async(
+        session_id=session.id,
+        user_id=request.state.user_id,
+        new_message=Content(parts=[Part(text=request.question)]),
+    )
+    return format_ask_response(response)
+```
+
+The same `Runner` serves both MCP tool calls and REST API requests — the agent hierarchy is the single entry point for all LLM-powered operations.
+
+### Observability
+
+ADK agents emit OpenTelemetry spans automatically for each agent invocation, tool call, and model request. These integrate with the existing telemetry pipeline in [`09-observability.md`](09-observability.md) — no additional instrumentation is needed.
+
+---
+
+## Vercel Chat SDK Bot
+
+A TypeScript service (`bot/`) provides real-time chat across Slack, Teams, and Discord.
+
+### Architecture
+
+```
+User → Slack/Teams/Discord
+         ↓ (real-time events)
+    Chat SDK Bot (TypeScript)
+         ↓ (REST API calls)
+    FastAPI + ADK Runner
+         ↓
+    ADK Agents → Stores
+```
+
+### Event Handlers
+
+| Event | Handler | Action |
+|-------|---------|--------|
+| `onNewMention` | Subscribe to thread, query backend | Posts Card with answer + citations |
+| `onSubscribedMessage` | Process follow-up in thread | Posts answer in thread |
+| `onAction("refresh_wiki")` | Call wiki refresh API | Posts confirmation |
+| `onAction("sync_channel")` | Call sync API | Posts job ID |
+
+### Platform Adapters
+
+| Platform | Package | State |
+|----------|---------|-------|
+| Slack | `@chat-adapter/slack` | Redis |
+| Teams | `@chat-adapter/teams` | Redis |
+| Discord | `@chat-adapter/discord` | Redis |
+
+### Relationship to Python Adapters
+
+The Python `SlackAdapter` (in `src/beever_atlas/adapters/`) handles **batch historical message ingestion** — fetching message history for initial sync.
+
+The Chat SDK bot handles **real-time conversational interaction** — responding to mentions, follow-ups, and action buttons.
+
+Both are needed: batch adapters build the knowledge base, the chat bot surfaces it.
+
+---
+
+## Infrastructure Additions
+
+| Service | Image | Purpose |
+|---------|-------|---------|
+| `redis` | `redis:7-alpine` | Chat SDK conversation state |
+| `bot` | Custom (Node.js) | Chat SDK bot service |
+
+---
+
+## References
+
+- [Google ADK Documentation](https://google.github.io/adk-docs/)
+- [Vercel Chat SDK](https://chat-sdk.dev/)
+- [Chat SDK Adapters](https://chat-sdk.dev/docs/adapters)
diff --git a/docs/v2/README.md b/docs/v2/README.md
new file mode 100644
index 00000000..bf9c39d4
--- /dev/null
+++ b/docs/v2/README.md
@@ -0,0 +1,53 @@
+# Beever Atlas v2 — Documentation Index
+
+Beever Atlas v2 is a dual-memory knowledge retrieval system for Slack, Teams, and Discord. It combines semantic memory (Weaviate) for fast factual and topic queries with graph memory (Neo4j) for relational and temporal queries, routed by an LLM-powered smart router. The system ingests messages from any supported platform, builds a persistent knowledge base, and surfaces grounded answers with citations.
+
+**Core Frameworks**: [Google ADK](https://google.github.io/adk-docs/) for agent orchestration, [Vercel Chat SDK](https://chat-sdk.dev/) for real-time chat bot, FastAPI for backend, React 19 + shadcn/ui for frontend.
+
+**Key Infrastructure**: Weaviate (semantic), Neo4j + APOC (graph), MongoDB (state), Redis (sessions), Jina v4 (embeddings), Gemini Flash / Claude (LLMs via LiteLLM), Tavily (web search), OpenTelemetry (observability).
+
+---
+
+## Documents
+
+| # | File | Description |
+|---|------|-------------|
+| 1 | [01-architecture-overview.md](01-architecture-overview.md) | System overview, dual-memory design principle, memory interconnection |
+| 2 | [02-semantic-memory.md](02-semantic-memory.md) | Weaviate 3-tier design, schema, retrieval improvements |
+| 3 | [03-graph-memory.md](03-graph-memory.md) | Neo4j flexible schema, entity scoping, traversal methods |
+| 4 | [04-query-router.md](04-query-router.md) | Smart routing, query decomposition, external search |
+| 5 | [05-ingestion-pipeline.md](05-ingestion-pipeline.md) | Multi-platform adapters, 7-stage pipeline, quality gates |
+| 6 | [06-wiki-generation.md](06-wiki-generation.md) | Wiki template, consolidation, caching |
+| 7 | [07-deployment.md](07-deployment.md) | Docker Compose, MCP tools, module structure |
+| 8 | [08-resilience.md](08-resilience.md) | Degradation matrix, circuit breakers, outbox pattern |
+| 9 | [09-observability.md](09-observability.md) | Health endpoints, metrics, tracing, backups |
+| 10 | [10-access-control.md](10-access-control.md) | Channel ACL, authentication |
+| 11 | [11-frontend-design.md](11-frontend-design.md) | Web dashboard UI/UX *(NEW)* |
+| 12 | [12-api-design.md](12-api-design.md) | MCP server + REST API interface spec *(NEW)* |
+| 13 | [13-adk-integration.md](13-adk-integration.md) | Google ADK agents + Vercel Chat SDK bot *(NEW)* |
+| — | [decisions.md](decisions.md) | Key design decisions, open questions, research papers |
+| — | [weakness-resolution-map.md](weakness-resolution-map.md) | v1 → v2 weakness fix mapping |
+
+---
+
+## Suggested Reading Order
+
+**For implementation:**
+
+1. `01-architecture-overview.md` — understand the system shape before touching anything else
+2. `13-adk-integration.md` — the ADK agent architecture that orchestrates all LLM operations
+3. `02-semantic-memory.md` + `03-graph-memory.md` — the two storage backends (can be read in parallel)
+4. `05-ingestion-pipeline.md` — how data flows into both backends
+5. `04-query-router.md` — how queries are dispatched
+6. `06-wiki-generation.md` — the user-facing output layer
+7. `07-deployment.md` — running the stack locally
+8. `08-resilience.md` + `09-observability.md` — production hardening
+9. `10-access-control.md` — multi-tenant security
+10. `11-frontend-design.md` + `12-api-design.md` — client-facing surface
+11. `decisions.md` + `weakness-resolution-map.md` — context for why things are designed the way they are
+
+---
+
+## v1 Archive
+
+The original monolith proposal and v1 codebase notes are in [`../v1-archive/`](../v1-archive/).
diff --git a/docs/v2/current-architecture-diagram.md b/docs/v2/current-architecture-diagram.md
new file mode 100644
index 00000000..06ed2460
--- /dev/null
+++ b/docs/v2/current-architecture-diagram.md
@@ -0,0 +1,432 @@
+# Beever Atlas v2 — Architecture Diagram
+
+> Last updated: 2026-03-31 (M3+ implementation — media nodes, entity-facts, graph filtering)
+
+## System Overview
+
+```
+┌─────────────────────────────────────────────────────────────────────────────┐
+│                        BEEVER ATLAS v2 — SYSTEM ARCHITECTURE               │
+│              M3+: Ingest & Store (Dual Memory) + Dashboard + Media Graph   │
+└─────────────────────────────────────────────────────────────────────────────┘
+
+┌─ FRONTEND (React 19 + Vite + TailwindCSS) ─── web/src/ (54 files) ─────────┐
+│                                                                              │
+│  Pages                          Hooks                  Components            │
+│  ├── Dashboard.tsx              ├── useSync.ts         ├── dashboard/        │
+│  │   ├── StatCards              ├── useStats.ts        │   ├── StatCards.tsx  │
+│  │   └── ActivityFeed           ├── useGraph.ts        │   └── ActivityFeed  │
+│  ├── Channels.tsx               ├── useMemories.ts     ├── channel/          │
+│  ├── ChannelWorkspace.tsx       ├── useAsk.ts          │   ├── SyncButton    │
+│  │   ├── Wiki tab               ├── useEntityFacts.ts  │   ├── SyncProgress  │
+│  │   ├── Ask tab (SSE)          └── useTheme.ts        │   ├── MessagesTab   │
+│  │   ├── Messages tab                                  │   └── AskTab        │
+│  │   ├── Memories tab ──────── real Weaviate data      ├── memories/         │
+│  │   ├── Graph tab ────────── cytoscape.js + sidebar   │   ├── TierBrowser   │
+│  │   └── SyncButton + Progress                         │   ├── FactCard      │
+│  ├── GraphExplorer.tsx                                 │   ├── ClusterCard   │
+│  ├── ActivityPage.tsx                                  │   ├── MemoryFilters │
+│  ├── SettingsPage.tsx                                  │   └── SummaryCard   │
+│  └── SearchPage.tsx                                    └── graph/            │
+│                                                            ├── GraphCanvas   │
+│              Polls /api/channels/:id/sync/status           ├── GraphTab      │
+│              Entity panel: Details + Facts tabs             ├── EntityPanel   │
+│              Media modal: click-to-enlarge images           ├── GraphFilters  │
+│                                                            └── MediaModal    │
+└──────────────────────────┬───────────────────────────────────────────────────┘
+                           │ HTTP (REST + SSE)
+┌──────────────────────────▼───────────────────────────────────────────────────┐
+│  BACKEND API (FastAPI) ──── src/beever_atlas/api/ ───────────────────────────│
+│                                                                              │
+│  POST /api/channels/:id/sync ──────── Trigger sync job (auto|full|incr)     │
+│  GET  /api/channels/:id/sync/status ─ Poll progress (idle|syncing|error)    │
+│  GET  /api/channels/:id/memories ──── Paginated atomic facts (Weaviate)     │
+│  GET  /api/channels/:id/memories/:id  Single fact + graph entity enrichment │
+│  GET  /api/graph/entities ──────────── List entities (Neo4j, channel filter) │
+│  GET  /api/graph/relationships ─────── List relationships (channel filter)  │
+│  GET  /api/graph/entities/:id/neighbors  N-hop subgraph (1-5 hops)         │
+│  GET  /api/graph/media ─────────────── List media nodes (Neo4j)             │
+│  GET  /api/graph/decisions/:channel ── Decision timeline                    │
+│  GET  /api/stats ──────────────────── Aggregate counts (all stores)         │
+│  GET  /api/activity ───────────────── Recent sync events                    │
+│  GET  /api/sync-history ───────────── Sync job history                      │
+│  POST /api/channels/:id/ask ───────── SSE streaming Q&A (echo agent)       │
+│  DELETE /api/channels/:id/data ─────── Clear synced data (all stores)       │
+│  GET  /api/channels/:id/threads/:tid/messages ── Thread replies             │
+│  GET  /api/files/proxy ────────────── Proxy Slack file downloads            │
+│  GET  /api/health ─────────────────── Component health checks              │
+└──────────────────────────┬───────────────────────────────────────────────────┘
+                           │
+┌──────────────────────────▼───────────────────────────────────────────────────┐
+│  SERVICES (Orchestration) ──── src/beever_atlas/services/ ──────────────────│
+│                                                                              │
+│  SyncRunner                    BatchProcessor           WriteReconciler      │
+│  ├── start_sync(channel_id)    ├── process_messages()   ├── run_once()      │
+│  ├── _fetch_all_messages()     ├── chunk into batches    ├── retry failed    │
+│  │   (cursor pagination        ├── create ADK session   │   Weaviate/Neo4j  │
+│  │    >500 msg support)        ├── run pipeline          │   writes          │
+│  ├── _run_sync() (background)  └── update progress      └── start_loop()   │
+│  └── shutdown() (graceful)                                   (every 15min)  │
+│                                                                              │
+│  MediaProcessor                                                             │
+│  ├── process_media()  ── text-first vision routing                          │
+│  └── handles images, PDFs, videos from Slack attachments                    │
+└──────────────────────────┬───────────────────────────────────────────────────┘
+                           │
+┌──────────────────────────▼───────────────────────────────────────────────────┐
+│  AGENTS (Google ADK) ──── src/beever_atlas/agents/ ─────────────────────────│
+│                                                                              │
+│  agents/ingestion/pipeline.py ── create_ingestion_pipeline()                │
+│  ┌─────────────────────────────────────────────────────────────┐            │
+│  │  SequentialAgent("ingestion_pipeline")                      │            │
+│  │  ├── PreprocessorAgent ──── BaseAgent (stage 1, no LLM)    │            │
+│  │  │   ├── filters bots/system messages                      │            │
+│  │  │   ├── strips Slack mrkdwn                               │            │
+│  │  │   └── extracts media URLs + link URLs from attachments  │            │
+│  │  ├── ParallelAgent ─────── stages 2+3 run concurrently     │            │
+│  │  │   ├── fact_extractor ── LlmAgent (Flash Lite)           │            │
+│  │  │   └── entity_extractor  LlmAgent (Flash Lite)           │            │
+│  │  ├── classifier ────────── LlmAgent (Flash Lite, stage 4)  │            │
+│  │  ├── EmbedderAgent ─────── BaseAgent (Jina API, stage 5)   │            │
+│  │  ├── cross_batch_validator  LlmAgent (Flash, stage 6)      │            │
+│  │  └── PersisterAgent ────── BaseAgent (outbox, stage 7)     │            │
+│  │      ├── writes facts → Weaviate                           │            │
+│  │      ├── writes entities/relationships → Neo4j             │            │
+│  │      ├── creates Media nodes → Neo4j                       │            │
+│  │      ├── reconciles entity↔media via fact references       │            │
+│  │      └── creates stub entities for unmatched references    │            │
+│  └─────────────────────────────────────────────────────────────┘            │
+│                                                                              │
+│  agents/prompts/    ── Prompt templates (5 files, independently editable)   │
+│  agents/schemas/    ── Pydantic output models (3 files, reusable)          │
+│  agents/callbacks/  ── Quality gates (configurable thresholds)             │
+│  agents/query/echo.py ── Echo agent (M2, replaced by retrieval in M4)     │
+│                                                                              │
+│  llm/provider.py ──── LLMProvider (fast/quality tiers, centralized)        │
+└──────────────────────────┬───────────────────────────────────────────────────┘
+                           │
+┌──────────────────────────▼───────────────────────────────────────────────────┐
+│  DATA STORES ──── src/beever_atlas/stores/ ─────────────────────────────────│
+│                                                                              │
+│  ┌─────────────────┐  ┌──────────────────┐  ┌────────────────────────┐     │
+│  │  WeaviateStore   │  │  Neo4jStore       │  │  MongoDBStore          │     │
+│  │  (Semantic)      │  │  (Graph)          │  │  (State)               │     │
+│  │                  │  │                   │  │                        │     │
+│  │  MemoryFact      │  │  :Entity nodes    │  │  sync_jobs             │     │
+│  │  collection      │  │  :Event nodes     │  │  channel_sync_state    │     │
+│  │                  │  │  :Media nodes     │  │  write_intents (outbox)│     │
+│  │  Named vectors:  │  │  Relationships    │  │  activity_events       │     │
+│  │  text_vector     │  │                   │  │                        │     │
+│  │  (2048-dim Jina) │  │  Episodic linking │  │  Reconciler retries    │     │
+│  │                  │  │  Entity→Event→    │  │  pending intents       │     │
+│  │  Hybrid search   │  │  Weaviate fact    │  │                        │     │
+│  │  BM25 + vector   │  │                   │  │  Channel data deletion │     │
+│  │                  │  │  REFERENCES_MEDIA │  │  (sync state cleanup)  │     │
+│  │  Channel-level   │  │  Entity↔Media     │  │                        │     │
+│  │  fact deletion   │  │                   │  │                        │     │
+│  │                  │  │  Channel filtering │  │                        │     │
+│  │  Media fields:   │  │  via episodic     │  │                        │     │
+│  │  source_media_*  │  │  links            │  │                        │     │
+│  │  source_link_*   │  │                   │  │                        │     │
+│  │                  │  │  APOC fuzzy match │  │                        │     │
+│  └────────┬─────────┘  └────────┬──────────┘  └───────────┬────────────┘     │
+│           │                     │                          │                 │
+│  EntityRegistry (alias resolution, backed by Neo4j)        │                 │
+│  StoreClients (singleton, FastAPI lifespan lifecycle)       │                 │
+└───────────┼─────────────────────┼──────────────────────────┼─────────────────┘
+            │                     │                          │
+   ┌────────▼────────┐  ┌────────▼────────┐  ┌──────────────▼──────┐
+   │  Weaviate 1.28  │  │  Neo4j 5.26     │  │  MongoDB 7.0       │
+   │  :8080          │  │  + APOC         │  │  :27017             │
+   │  (Docker)       │  │  :7687 (Docker) │  │  (Docker)           │
+   └─────────────────┘  └─────────────────┘  └────────────────────┘
+```
+
+## Bot Service
+
+```
+┌─────────────────────────────────────────────────────────────────────────────┐
+│  BOT SERVICE (TypeScript + Chat SDK) ──── bot/src/ (8 files) ──────────────│
+│                                                                              │
+│  index.ts ─── Slack bot: @mention → askBackend() → SSE → Slack reply       │
+│  bridge.ts ── REST gateway: /bridge/channels, /bridge/messages             │
+│               /bridge/files (proxy Slack file downloads)                    │
+│               Extracts links, media, unfurls, reactions from Slack API     │
+│               Resolves user profiles (parallel, concurrency=8)             │
+│               Python backend fetches Slack data through this bridge         │
+│  sse-client.ts ── Consumes SSE from Python backend                         │
+│  formatter.ts ── Slack Block Kit message formatting                        │
+│  slack-mrkdwn.ts ── Slack mrkdwn parsing/stripping                         │
+│  *.test.ts ── Unit tests for formatter, sse-client, slack-mrkdwn          │
+└─────────────────────────────────────────────────────────────────────────────┘
+```
+
+## Data Flow: Channel Sync
+
+```
+┌─────────────────────────────────────────────────────────────────────────────┐
+│  User clicks "Sync Channel"                                                │
+│                                                                              │
+│  Dashboard → POST /api/channels/:id/sync                                   │
+│    → SyncRunner.start_sync(channel_id)                                     │
+│      → Fetch messages via Bridge API (cursor pagination, max 1000)         │
+│        → Bridge extracts: text, attachments, links, unfurls, reactions     │
+│      → Batch into groups of ~50 (thread-aware grouping)                    │
+│      → For each batch:                                                     │
+│          → ADK Session (state: messages, channel, known_entities)           │
+│          → SequentialAgent runs 7 stages:                                  │
+│            1. Preprocess (filter bots, detect modality, extract media/link │
+│               URLs from attachments and raw_metadata)                      │
+│            2. Extract facts (LLM, quality gate < 0.5)      ┐ parallel     │
+│            3. Extract entities (LLM, confidence gate < 0.6) ┘              │
+│            4. Classify (topic tags, importance)                            │
+│            5. Embed (Jina v4 batch API, 2048-dim)                         │
+│            6. Cross-batch validate (alias resolution, consistency)         │
+│            7. Persist:                                                     │
+│               a. Write facts → Weaviate (with media/link metadata)        │
+│               b. Write entities/relationships → Neo4j                     │
+│               c. Create Media nodes → Neo4j (with original filenames)     │
+│               d. Reconcile entity↔media via fact entity references        │
+│               e. Create stub entities for unmatched references            │
+│               f. Outbox pattern via MongoDB WriteIntents                  │
+│          → Update SyncJob progress in MongoDB                              │
+│      → Log activity event                                                  │
+│  Dashboard polls status → progress bar updates                             │
+│  After sync → Memories tab shows real facts, Graph tab shows entities     │
+│             → Media nodes visible in graph, click to enlarge               │
+└─────────────────────────────────────────────────────────────────────────────┘
+```
+
+## Data Flow: Graph Visualization
+
+```
+┌─────────────────────────────────────────────────────────────────────────────┐
+│  User opens Graph tab for a channel                                        │
+│                                                                              │
+│  GraphTab → useGraph(channelId)                                            │
+│    → GET /api/graph/entities?channel_id=...                                │
+│    → GET /api/graph/relationships?channel_id=...                           │
+│    → GET /api/graph/media?channel_id=...                                   │
+│    → Neo4j filters entities/relationships via episodic links               │
+│    → Media nodes deduplicated by URL                                       │
+│                                                                              │
+│  GraphCanvas (cytoscape.js)                                                │
+│    ├── Renders entity nodes (colored by type)                              │
+│    ├── Renders media nodes (distinct color scheme)                         │
+│    ├── Renders REFERENCES_MEDIA + entity relationships as edges            │
+│    ├── useRef pattern for fresh callbacks (avoids stale closures)          │
+│    └── On node click → opens EntityPanel sidebar                           │
+│                                                                              │
+│  EntityPanel (tabbed sidebar)                                              │
+│    ├── Details tab: entity properties, type, aliases                       │
+│    └── Facts tab: useEntityFacts(entityName) → Weaviate search            │
+│         └── Displays related atomic memories for selected entity           │
+│                                                                              │
+│  MediaModal                                                                │
+│    ├── Triggered by clicking media nodes in graph                          │
+│    └── Triggered by clicking image thumbnails in FactCard                  │
+│         → Full-size image lightbox with close on backdrop click            │
+│                                                                              │
+│  GraphFilters                                                              │
+│    └── Toggle visibility by entity type (Person, Decision, Project,        │
+│        Technology, Media, etc.) with color-coded legend                    │
+└─────────────────────────────────────────────────────────────────────────────┘
+```
+
+## Backend Directory Structure
+
+```
+src/beever_atlas/
+├── models/                     # Domain + persistence + API models
+│   ├── domain.py              # AtomicFact, GraphEntity, GraphRelationship, Subgraph
+│   │                          # Media fields: source_media_urls/names, source_link_urls/titles
+│   ├── persistence.py         # SyncJob, ChannelSyncState, WriteIntent, ActivityEvent
+│   └── api.py                 # MemoryFilters, PaginatedFacts, HealthResponse
+│
+├── agents/                     # Agent definitions (WHAT they do)
+│   ├── ingestion/             # 7-stage ingestion pipeline
+│   │   ├── pipeline.py        # create_ingestion_pipeline() — SequentialAgent wiring
+│   │   ├── preprocessor.py    # BaseAgent — stage 1 (no LLM, extracts media/link URLs)
+│   │   ├── fact_extractor.py  # Factory → LlmAgent (Flash Lite)
+│   │   ├── entity_extractor.py# Factory → LlmAgent (Flash Lite)
+│   │   ├── classifier.py      # Factory → LlmAgent (Flash Lite)
+│   │   ├── embedder.py        # BaseAgent — stage 5 (Jina API)
+│   │   ├── cross_batch_validator.py # Factory → LlmAgent (Flash)
+│   │   └── persister.py       # BaseAgent — stage 7 (outbox writes + media nodes
+│   │                          #   + entity reconciliation + stub entity creation)
+│   ├── query/                 # Retrieval agents (M4 ready)
+│   │   └── echo.py            # create_echo_agent() — current root agent
+│   ├── prompts/               # Prompt templates (separated from agents)
+│   │   ├── fact_extractor.py, entity_extractor.py, classifier.py
+│   │   ├── cross_batch_validator.py, echo.py
+│   ├── schemas/               # Pydantic output schemas for LLM agents
+│   │   ├── extraction.py, classification.py, validation.py
+│   ├── callbacks/             # Quality gates & post-processing
+│   │   └── quality_gates.py
+│   ├── tools.py               # ADK FunctionTool stubs (M4)
+│   └── runner.py              # ADK Runner + session helpers
+│
+├── llm/                        # LLM provider abstraction
+│   └── provider.py            # LLMProvider (fast/quality tiers)
+│
+├── services/                   # Orchestration layer
+│   ├── batch_processor.py     # Batch chunking + ADK runner loop
+│   ├── sync_runner.py         # Background sync job lifecycle
+│   ├── media_processor.py     # Text-first vision routing for multimodal media
+│   └── reconciler.py          # Failed write retry (every 15min)
+│
+├── stores/                     # Data store clients
+│   ├── weaviate_store.py      # Semantic memory (3-tier, hybrid search, channel deletion)
+│   ├── neo4j_store.py         # Knowledge graph (entities, media nodes, episodic
+│   │                          #   channel filtering, REFERENCES_MEDIA edges)
+│   ├── mongodb_store.py       # State (sync jobs, outbox, activity, channel cleanup)
+│   └── entity_registry.py     # Alias resolution (backed by Neo4j)
+│
+├── api/                        # REST endpoints
+│   ├── ask.py                 # SSE streaming Q&A
+│   ├── channels.py            # Channel CRUD + messages + threads + file proxy
+│   │                          #   + DELETE channel data (all stores)
+│   ├── sync.py                # Sync trigger + progress
+│   ├── memories.py            # Atomic facts CRUD
+│   ├── graph.py               # Entity/relationship/media listing + subgraph
+│   └── stats.py               # Aggregate stats + activity feed + sync history
+│
+├── adapters/                   # Platform adapters (Slack via bot bridge)
+│   ├── base.py                # BaseAdapter, NormalizedMessage, ChannelInfo
+│   ├── bridge.py              # ChatBridgeAdapter (calls bot /bridge/*)
+│   └── mock.py                # MockAdapter (JSON fixtures)
+│
+├── infra/                      # Configuration + cross-cutting
+│   ├── config.py              # Settings (all env vars, centralized)
+│   └── health.py              # Health checks (Weaviate, Neo4j, MongoDB, Redis)
+│
+└── server/                     # FastAPI app
+    └── app.py                 # App creation, lifespan, CORS, routers
+```
+
+## File Counts
+
+| Layer | Files | Purpose |
+|-------|-------|---------|
+| Backend (Python) | 64 | API, agents, stores, services, models |
+| Frontend (TS/TSX) | 54 | React pages, hooks, components |
+| Bot (TypeScript) | 8 | Slack bot, bridge, SSE client, tests |
+| **Total** | **126** | |
+
+## Neo4j Graph Schema
+
+```
+┌─────────────────────────────────────────────────────────────────────────────┐
+│  NODE TYPES                                                                 │
+│                                                                              │
+│  :Entity                          :Event                    :Media          │
+│  ├── name (indexed)               ├── weaviate_id           ├── url (idx)  │
+│  ├── type (indexed)               ├── timestamp             ├── type       │
+│  │   (Person, Decision,           ├── channel_id            ├── channel_id │
+│  │    Project, Technology)        └── description           └── msg_id     │
+│  ├── scope (global|channel)                                                │
+│  ├── aliases[]                                                              │
+│  └── properties (JSON)                                                      │
+│                                                                              │
+│  RELATIONSHIP TYPES                                                         │
+│                                                                              │
+│  (:Entity)-[:DECIDED]->(:Entity)          Decision relationships           │
+│  (:Entity)-[:WORKS_ON]->(:Entity)         Assignment / ownership           │
+│  (:Entity)-[:USES]->(:Entity)             Technology usage                 │
+│  (:Entity)-[:LINKS]->(:Event)             Episodic linking                 │
+│  (:Entity)-[:REFERENCES_MEDIA]->(:Media)  Entity↔Media connections         │
+│                                                                              │
+│  CHANNEL FILTERING                                                          │
+│  Entities filtered per-channel via episodic links:                          │
+│  Entity→[:LINKS]→Event(channel_id) ensures graph shows only                │
+│  entities relevant to the selected channel                                  │
+└─────────────────────────────────────────────────────────────────────────────┘
+```
+
+## Weaviate Fact Schema
+
+```
+┌─────────────────────────────────────────────────────────────────────────────┐
+│  Collection: MemoryFact                                                     │
+│                                                                              │
+│  Core Fields               Source Fields              Tagging               │
+│  ├── memory_text           ├── source_message_id      ├── topic_tags[]     │
+│  ├── quality_score         ├── message_ts             ├── entity_tags[]    │
+│  ├── tier (atomic)         ├── thread_ts              ├── action_tags[]    │
+│  ├── importance            ├── author_id              └── graph_entity_ids[]│
+│  └── text_vector (2048d)   └── channel_id                                  │
+│                                                                              │
+│  Media Fields              Link Fields                Temporal              │
+│  ├── source_media_urls[]   ├── source_link_urls[]     ├── valid_at         │
+│  ├── source_media_names[]  ├── source_link_titles[]   └── invalid_at       │
+│  ├── source_media_type     └── source_link_descs[]                         │
+│  │   (image/pdf/video)                                                      │
+│                                                                              │
+│  Search: Hybrid BM25 + vector similarity                                   │
+│  Filtering: channel, topic, entity, importance, date range                 │
+└─────────────────────────────────────────────────────────────────────────────┘
+```
+
+## Milestone Progress
+
+| Milestone | Status | Description |
+|-----------|--------|-------------|
+| M1: Skeleton & Health Pulse | Done | Bot, Chat SDK, health checks, React scaffold |
+| M2: Chat Bot + Echo Query | Done | Echo agent, SSE streaming, bridge API |
+| M3: Ingest & Store + Dashboard | Done | 7-stage pipeline, dual stores, full dashboard |
+| **M3+: Media & Graph Enhancements** | **Done** | **Media nodes, entity-facts sidebar, channel filtering, image lightbox** |
+| M4: Smart Retrieval & Response | Next | Query router, retrieval agents, Ask tab with real answers |
+| M5: Consolidation, Wiki & Tiers | Planned | Tier 0/1 generation, wiki builder |
+| M6: Contradictions & Retrieval Polish | Planned | Contradiction detection, query decomposition |
+| M7: Resilience, Observability & ACL | Planned | Circuit breakers, metrics, access control |
+| M8: Multi-Platform & Production | Planned | Teams, Discord, OAuth, production polish |
+
+## Technology Stack
+
+| Layer | Technology |
+|-------|-----------|
+| Agent Framework | Google ADK (Python) — SequentialAgent, ParallelAgent, LlmAgent |
+| LLM (fast) | Gemini 2.0 Flash Lite (extraction, classification) |
+| LLM (quality) | Gemini 2.0 Flash (cross-batch validation) |
+| Embeddings | Jina v4 (2048-dim, multimodal) |
+| Semantic Store | Weaviate 1.28 (hybrid BM25 + vector search) |
+| Graph Store | Neo4j 5.26 + APOC (flexible entity schema, media nodes) |
+| State Store | MongoDB 7.0 (sync state, outbox pattern) |
+| Session Cache | Redis 7 (Chat SDK state) |
+| Backend | FastAPI + Pydantic 2 |
+| Frontend | React 19 + Vite + TailwindCSS + shadcn/ui |
+| Graph Viz | cytoscape.js (with EntityPanel sidebar + MediaModal) |
+| Bot | Vercel Chat SDK + @chat-adapter/slack |
+
+## Key Design Patterns
+
+```
+┌─────────────────────────────────────────────────────────────────────────────┐
+│  DESIGN PATTERNS                                                            │
+│                                                                              │
+│  Outbox Pattern (Durability)                                                │
+│  Write to MongoDB WriteIntent first, then dispatch to Weaviate/Neo4j.      │
+│  WriteReconciler retries failed writes every 15 minutes.                   │
+│                                                                              │
+│  Episodic Channel Filtering                                                 │
+│  Entities scoped to channels via Entity→[:LINKS]→Event(channel_id).        │
+│  Graph API filters entities, relationships, and counts per channel.        │
+│                                                                              │
+│  Entity-Media Reconciliation                                                │
+│  PersisterAgent creates Media nodes in Neo4j, then links them to entities  │
+│  referenced in the same fact. Creates stub entities when no match exists.  │
+│                                                                              │
+│  useRef Callback Pattern (Frontend)                                         │
+│  GraphCanvas uses useRef to keep cytoscape tap handlers fresh without      │
+│  destroying the expensive cytoscape instance on re-renders.                │
+│                                                                              │
+│  Thread-Aware Batching                                                      │
+│  BatchProcessor groups messages with their thread replies to avoid          │
+│  splitting conversations across batches.                                   │
+│                                                                              │
+│  Dual Memory Query                                                          │
+│  Semantic (Weaviate) for ~80% of queries (factual, topic-based).           │
+│  Graph (Neo4j) for ~20% of queries (relational, temporal, decisions).      │
+│  Future: Smart router LLM to choose.                                       │
+└─────────────────────────────────────────────────────────────────────────────┘
+```
diff --git a/docs/v2/decisions.md b/docs/v2/decisions.md
new file mode 100644
index 00000000..ff13cb40
--- /dev/null
+++ b/docs/v2/decisions.md
@@ -0,0 +1,64 @@
+# Design Decisions, Open Questions & Research
+
+> **Purpose**: Architectural rationale, open questions, and research paper integration for Beever Atlas v2.
+> Sourced from the v1 monolith (§7, §10, §11, Sources). Complements the individual v2 docs — read alongside `01-architecture-overview.md`.
+
+---
+
+## Key Design Decisions
+
+| Decision | Choice | Rationale | Rejected Alternative |
+|----------|--------|-----------|---------------------|
+| Memory architecture | Dual (Weaviate + Neo4j) | Each does what it's best at — semantic vs. relational | Neo4j only (can't do hybrid BM25+vector), Weaviate only (can't do multi-hop graph) |
+| Weaviate tiers | Keep 3 tiers, fix bugs | Sound design; Tier 0+1 give free reads (wiki-first); just needs working cluster linking | Remove tiers (loses free wiki reads, loses topic scoping) |
+| Graph schema | Guided-flexible | Core types + LLM creates extensions; captures any relationship | Fixed schema (misses Budget, Team, Meeting...), Full triplets (too noisy) |
+| Relationships | Fully flexible | LLM extracts whatever verb phrase captures the meaning | Fixed relationship list (can't capture BLOCKED_BY, POSTPONED_UNTIL...) |
+| Query routing | Hybrid (route OR parallel) | Semantic-first saves cost (80%); parallel for ambiguous | Pure router (misclassification), Pure parallel (wasteful) |
+| Multi-platform | Python adapters | Chat SDK is TS-only, can't fetch history | Chat SDK only (no batch history) |
+| Quality gate | Reject at extraction | Prevent garbage from entering system | Post-hoc cleanup (harder) |
+| Cluster linking | Actually write cluster_id | v1's biggest bug — no-op | Keep as no-op (breaks everything) |
+| Agent framework | [Google ADK](https://google.github.io/adk-docs/) | Native agent orchestration (Sequential, Parallel, Loop), LiteLLM fallback, session state, FunctionTool wrapping of store operations. See [`13-adk-integration.md`](13-adk-integration.md) | Direct LLM calls (no orchestration, manual retry logic), LangChain (heavier abstraction, more dependencies) |
+| Chat bot | [Vercel Chat SDK](https://chat-sdk.dev/) | Multi-platform real-time chat (Slack, Teams, Discord) with adapter pattern, action buttons, Redis state. See [`13-adk-integration.md`](13-adk-integration.md) | Custom webhook handlers per platform (more code), Slack Bolt only (single platform) |
+
+See `04-query-router.md` for routing strategy detail. See `02-semantic-memory.md` for tier and cluster linking implementation. See `05-ingestion-pipeline.md` for quality gate implementation.
+
+---
+
+## Open Questions
+
+1. **Entity extraction cost**: ~$0.001/message for flash-lite. 10K messages = ~$10 initial sync. Acceptable?
+2. **Graph type normalization**: How aggressively should we merge "Team"/"Group"/"Squad" into one type? LLM pass or rule-based?
+3. ~~**Consolidation frequency**~~: **RESOLVED** — Three triggers: after sync (incremental), daily 2 AM UTC (full), on-demand API. See `06-wiki-generation.md`.
+4. ~~**MCP surface**~~: **RESOLVED** — Graph queries abstracted behind `ask_questions`. 7 tools defined. See `07-deployment.md`.
+5. **Chat SDK bridge**: Worth building the TypeScript webhook service for real-time ingestion in Phase 2?
+6. **Decomposition threshold**: When should queries be decomposed vs. sent as-is? Token length? LLM confidence?
+
+---
+
+## Research Paper Integration
+
+| Paper | Core Insight | How v2 Uses It |
+|-------|-------------|----------------|
+| **GraphRAG (Weaviate+Neo4j)** | Hybrid vector-graph search | Dual memory: Weaviate for semantic, Neo4j for relational |
+| **H-MEM** | 4-layer hierarchical memory | 3-tier Weaviate (summary→topic→atomic) with fixes |
+| **System-1/System-2 Routing** | Dual-process retrieval | Smart router: semantic (fast) / graph (deep) / both |
+| **Ebbinghaus Forgetting** | R = e^(-t/S) | Applied to retrieval ranking (actually wired in v2) |
+| **MemoryBank** | Nightly distillation | Scheduled consolidation: clusters + summaries + wiki |
+| **Dynamic Knowledge Graphs** | Episodic edges + fact replacement | Event nodes linking Neo4j↔Weaviate; SUPERSEDES edges |
+| **Zep** | Bi-temporal tracking | valid_from/valid_until/created_at on all relationships |
+| **Mem0/Mem0g** | LLM judge for consolidation | Entity extraction dedup: MERGE vs ADD vs SUPERSEDE |
+
+Full paper summaries, diagrams, and application notes are in [`reference-papers.md`](reference-papers.md).
+
+---
+
+## Sources
+
+- [Vercel Chat SDK](https://chat-sdk.dev/) — [GitHub (vercel/chat)](https://github.com/vercel/chat)
+- [Chat SDK Adapters](https://chat-sdk.dev/docs/adapters) — [Changelog](https://vercel.com/changelog/chat-sdk)
+- [GraphRAG via Weaviate & Neo4j](https://weaviate.io/blog/graph-rag)
+- [H-MEM: Hierarchical Memory](https://arxiv.org/pdf/2507.22925)
+- [System-1/System-2 Graph Retrieval](https://arxiv.org/pdf/2602.15313)
+- [Zep Bi-Temporal Model](https://arxiv.org/pdf/2501.13956)
+- [Mem0/Mem0g](https://arxiv.org/pdf/2504.19413)
+- [Dynamic Knowledge Graphs](https://www.ijcai.org/proceedings/2025/0002.pdf)
diff --git a/docs/v2/memory-architecture.md b/docs/v2/memory-architecture.md
new file mode 100644
index 00000000..36299dff
--- /dev/null
+++ b/docs/v2/memory-architecture.md
@@ -0,0 +1,273 @@
+# Beever Atlas — Memory Architecture Reference
+
+This document describes the 3-tier memory system and graph knowledge layer. Use it to understand the data structures available when building features that consume memory (wiki generation, QA agent, search, etc.).
+
+---
+
+## Architecture Overview
+
+```
+Raw Messages (Slack/Discord/Teams)
+    │
+    ▼
+┌─────────────────────────────────────────────────────┐
+│  INGESTION PIPELINE (per batch of messages)          │
+│  Preprocessor → Fact Extractor → Entity Extractor    │
+│  → Embedder → Cross-Batch Validator → Persister      │
+└─────────────────────────────────────────────────────┘
+    │                          │
+    ▼                          ▼
+┌──────────────┐      ┌──────────────────┐
+│  Weaviate     │      │  Neo4j            │
+│  (3-tier      │      │  (knowledge       │
+│   memory)     │      │   graph)          │
+└──────────────┘      └──────────────────┘
+    │                          │
+    ▼                          ▼
+┌─────────────────────────────────────────────────────┐
+│  CONSOLIDATION PIPELINE (after sync completes)       │
+│  Clustering → Context Building → LLM Summaries       │
+│  → Graph Enrichment → Cross-Cluster Links            │
+└─────────────────────────────────────────────────────┘
+    │
+    ▼
+  Tier 1 (TopicCluster) + Tier 0 (ChannelSummary)
+```
+
+---
+
+## Tier 2 — Atomic Facts (Weaviate)
+
+**What**: Individual extracted facts from messages. The retrieval unit for QA search.
+
+**Model**: `AtomicFact` (`src/beever_atlas/models/domain.py`)
+
+**Key fields**:
+
+| Field | Type | Description |
+|-------|------|-------------|
+| `id` | str | Deterministic UUID from `platform:channel_id:message_ts:fact_index` |
+| `memory_text` | str | Self-contained fact (1-2 sentences). Includes rationale and context — e.g. "Alice decided to use Redis for session caching after evaluating Memcached, citing pub/sub support." |
+| `quality_score` | float | 0.0–1.0 composite of specificity, actionability, verifiability |
+| `fact_type` | str | `"decision"` / `"opinion"` / `"observation"` / `"action_item"` / `"question"` |
+| `importance` | str | `"low"` / `"medium"` / `"high"` / `"critical"` |
+| `topic_tags` | list[str] | 1-3 thematic labels (e.g. "deployment", "auth") |
+| `entity_tags` | list[str] | Named entities mentioned (people, projects, tools) |
+| `action_tags` | list[str] | Action verbs (e.g. "decided", "blocked", "shipped") |
+| `author_name` | str | Display name of message author |
+| `message_ts` | str | Timestamp of source message |
+| `thread_context_summary` | str | 1-sentence deliberation arc for threaded discussions |
+| `cluster_id` | str | Which TopicCluster this fact belongs to |
+| `superseded_by` | str | ID of newer fact that replaces this one (null if current) |
+| `supersedes` | str | ID of older fact this one replaces |
+| `source_media_urls` | list[str] | URLs of attached media (images, PDFs, videos) |
+| `source_media_type` | str | `"image"` / `"pdf"` / `"video"` / `"audio"` / `""` |
+| `source_link_urls` | list[str] | URLs shared in the message |
+| `source_link_titles` | list[str] | Titles of shared links |
+| `text_vector` | list[float] | Jina v4 embedding (2048-dim) for semantic search |
+
+**API**: `GET /api/channels/{channel_id}/memories?page=1&limit=50&topic=&entity=&importance=`
+
+**How to query**: Weaviate hybrid search (keyword + semantic) using `text_vector`. Filter by `channel_id`, `topic_tags`, `entity_tags`, `importance`, `fact_type`, timestamp range.
+
+---
+
+## Tier 1 — Topic Clusters (Weaviate)
+
+**What**: Semantic groupings of related atomic facts. Each cluster represents a knowledge area with multi-angle summaries and structured enrichment.
+
+**Model**: `TopicCluster` (`src/beever_atlas/models/domain.py`)
+
+**Key fields**:
+
+| Field | Type | Description |
+|-------|------|-------------|
+| `id` | str | UUID |
+| `title` | str | Short descriptive name (5-10 words, e.g. "JWT Migration to RS256") |
+| `summary` | str | Narrative of what happened (2-3 sentences) |
+| `current_state` | str | Where things stand now (1-2 sentences) |
+| `open_questions` | str | Unresolved tensions/debates (1-2 sentences, empty if resolved) |
+| `impact_note` | str | Scope and significance (1 sentence) |
+| `topic_tags` | list[str] | 3 most representative tags (LLM-selected, not merged from all members) |
+| `member_ids` | list[str] | IDs of member AtomicFacts |
+| `member_count` | int | Number of member facts |
+| `status` | str | `"active"` / `"completed"` / `"stale"` |
+| `staleness_score` | float | 0.0 (fresh) to 1.0 (very stale) |
+| `key_facts` | list[dict] | Top 5 facts by quality_score with attribution: `{fact_id, memory_text, author_name, message_ts, fact_type, importance, quality_score, source_message_id}` |
+| `decisions` | list[dict] | Decisions with supersede chains: `{name, decided_by, status, superseded_by, date, context}` |
+| `people` | list[dict] | Contributors with roles: `{name, role, entity_id}`. Roles: `decision_maker` / `expert` / `contributor` / `mentioned` |
+| `technologies` | list[dict] | Tech mentioned: `{name, category, champion}` |
+| `projects` | list[dict] | Projects: `{name, status, owner, blockers}` |
+| `faq_candidates` | list[dict] | Q&A pairs: `{question, answer}` |
+| `key_entities` | list[dict] | Graph entities: `{id, name, type}` |
+| `key_relationships` | list[dict] | Graph relationships: `{source, type, target, confidence}` |
+| `authors` | list[str] | All contributor names |
+| `date_range_start/end` | str | Temporal span of member facts |
+| `media_refs` | list[str] | Media URLs from member facts |
+| `link_refs` | list[str] | Link URLs from member facts |
+| `fact_type_counts` | dict | `{"decision": N, "question": N, ...}` |
+| `related_cluster_ids` | list[str] | IDs of clusters sharing 2+ entity tags |
+| `centroid_vector` | list[float] | Mean embedding of all member facts |
+
+**API**: `GET /api/channels/{channel_id}/topics` (sorted by member_count desc)
+
+**Clustering**: Embedding-based cosine similarity against cluster centroids (threshold 0.7). No LLM involved in clustering — only in summary generation.
+
+---
+
+## Tier 0 — Channel Summary (Weaviate)
+
+**What**: High-level channel overview synthesizing all topic clusters. One per channel.
+
+**Model**: `ChannelSummary` (`src/beever_atlas/models/domain.py`)
+
+**Key fields**:
+
+| Field | Type | Description |
+|-------|------|-------------|
+| `channel_id` | str | Channel identifier |
+| `channel_name` | str | Resolved display name (e.g. "#backend-engineering") |
+| `text` | str | Overall narrative (3-5 sentences) |
+| `description` | str | One-line channel purpose (max 200 chars) |
+| `themes` | str | How knowledge areas interrelate (2-3 sentences) |
+| `momentum` | str | What's active/completed/stale, velocity (1-2 sentences) |
+| `team_dynamics` | str | Who drives decisions, collaboration patterns (1-2 sentences) |
+| `cluster_count` | int | Number of topic clusters |
+| `fact_count` | int | Total facts across all clusters |
+| `top_decisions` | list[dict] | Channel-wide decisions: `{name, decided_by, status, superseded_by, date, topic_cluster_id, context}` |
+| `top_people` | list[dict] | Contributors aggregated: `{name, role, topic_count, expertise_topics}` (highest role wins across clusters) |
+| `tech_stack` | list[dict] | Technologies: `{name, category, champion, topic_count}` |
+| `active_projects` | list[dict] | Projects: `{name, status, owner, blockers, topic_cluster_id}` |
+| `glossary_terms` | list[dict] | Channel jargon: `{term, definition, first_mentioned_by, related_topics}` |
+| `recent_activity_summary` | dict | Last 7 days: `{facts_added_7d, decisions_added_7d, new_topics, updated_topics, highlights}` |
+| `topic_graph_edges` | list[dict] | Edges between topics: `{source_cluster_id, target_cluster_id, source_title, target_title, shared_entities}` |
+| `key_topics` | list[dict] | All topics: `{tags, title, member_count, status}` |
+| `worst_staleness` | float | Max staleness across all clusters |
+
+**API**: `GET /api/channels/{channel_id}/summary`
+
+---
+
+## Graph Memory (Neo4j)
+
+**What**: Knowledge graph of entities and relationships extracted from messages. Complements the vector memory with structured, traversable connections.
+
+**Protocol**: `GraphStore` (`src/beever_atlas/stores/graph_protocol.py`)
+
+### Entity Types
+
+| Type | Scope | Examples |
+|------|-------|---------|
+| `Person` | global | Alice, Bob, Charlie |
+| `Technology` | global | Redis, Neo4j, Kubernetes |
+| `Project` | global | Atlas, Auth Migration |
+| `Team` | global | Backend Team, Mobile Team |
+| `Decision` | channel | "Use RS256 for JWT signing" |
+| `Meeting` | channel | "Sprint Review March 20" |
+| `Artifact` | channel | "API Spec v3", "Architecture Diagram" |
+
+**Entity fields**: `name`, `type`, `scope`, `properties` (role, category, status, etc.), `aliases`, `status` (active/pending)
+
+### Relationship Types
+
+| Relationship | Meaning | Example |
+|-------------|---------|---------|
+| `DECIDED` | Person made a decision | Alice → DECIDED → Use RS256 |
+| `WORKS_ON` | Person works on project | Bob → WORKS_ON → Atlas |
+| `USES` | Person/project uses tech | Atlas → USES → Redis |
+| `OWNS` | Person owns project | Alice → OWNS → Auth Module |
+| `BLOCKED_BY` | Project blocked by another | Rate Limiting → BLOCKED_BY → Redis Upgrade |
+| `SUPERSEDES` | Decision replaces another | Use RS256 → SUPERSEDES → Use HS256 |
+| `DEPENDS_ON` | Project depends on another | API v2 → DEPENDS_ON → Auth Migration |
+| `REPORTS_TO` | Person reports to person | Bob → REPORTS_TO → Alice |
+| `MENTIONED_IN` | Entity mentioned in fact | Redis → MENTIONED_IN → Event(fact_id) |
+
+**Relationship fields**: `type`, `source`, `target`, `confidence` (0.0-1.0), `valid_from`, `context`
+
+### Episodic Links
+
+Entities are connected to facts via `MENTIONED_IN` edges to `Event` nodes. Each Event stores `weaviate_fact_id`, `message_ts`, `channel_id`, `media_urls`, `link_urls`.
+
+### Key Query Patterns
+
+```python
+# Get all decisions for a channel
+decisions = await graph.get_decisions(channel_id, limit=20)
+
+# Get entity neighborhood (1-2 hops)
+subgraph = await graph.get_neighbors(entity_id, hops=2, limit=50)
+
+# List entities by type
+people = await graph.list_entities(channel_id, entity_type="Person", limit=100)
+
+# List relationships
+rels = await graph.list_relationships(channel_id, limit=200)
+
+# Find entity by name
+entity = await graph.find_entity_by_name("Redis")
+```
+
+---
+
+## How Data Flows for Consumers
+
+### For Wiki Generation
+
+The wiki builder should read pre-computed structured data — no additional LLM calls needed:
+
+```
+ChannelSummary → Overview section (text, description)
+               → Themes section (themes)
+               → Momentum section (momentum, recent_activity_summary)
+               → People section (top_people, team_dynamics)
+               → Tech Stack section (tech_stack)
+               → Projects section (active_projects)
+               → Decisions section (top_decisions with supersede chains)
+               → Glossary section (glossary_terms)
+               → Topic graph (topic_graph_edges → mermaid diagram)
+
+TopicCluster[] → Topic pages (title, summary, current_state, open_questions)
+               → Key facts per topic (key_facts with citation)
+               → Decisions per topic (decisions)
+               → People per topic (people with roles)
+               → FAQ section (faq_candidates)
+
+AtomicFact[]   → Source citations ([1] @author · date · View)
+               → Media & Resources section (source_media_urls, source_link_urls)
+```
+
+### For QA Agent
+
+The QA agent should use a hybrid retrieval strategy:
+
+1. **Vector search** (Weaviate) — query `text_vector` on AtomicFacts for semantic match
+2. **Keyword filter** — filter by `topic_tags`, `entity_tags`, `fact_type`, `importance`
+3. **Graph traversal** (Neo4j) — expand entity neighborhoods for related context
+4. **Tier routing** — broad questions → Tier 0/1 summaries; specific questions → Tier 2 facts
+
+```
+User question
+    │
+    ├─ "What's this channel about?" → ChannelSummary.text + description
+    ├─ "What did we decide about auth?" → TopicCluster(topic_tags∋"auth").decisions + key_facts
+    ├─ "Who works on Redis?" → graph.get_neighbors("Redis") → Person entities
+    └─ "When did we switch to RS256?" → AtomicFact search(fact_type="decision", entity_tags∋"RS256")
+```
+
+---
+
+## File Map
+
+| File | What |
+|------|------|
+| `src/beever_atlas/models/domain.py` | Domain models: AtomicFact, TopicCluster, ChannelSummary, GraphEntity, GraphRelationship |
+| `src/beever_atlas/services/consolidation.py` | Consolidation pipeline: clustering, context building, LLM summaries, graph enrichment |
+| `src/beever_atlas/agents/schemas/consolidation.py` | LLM output schemas: TopicSummaryResult, ChannelSummaryResult, FaqCandidate, GlossaryTerm |
+| `src/beever_atlas/agents/consolidation/summarizer.py` | Summarizer agent factories (topic + channel) |
+| `src/beever_atlas/agents/prompts/fact_extractor.py` | Fact extraction prompt with type-specific context guidance |
+| `src/beever_atlas/stores/weaviate_store.py` | Weaviate CRUD for all tiers |
+| `src/beever_atlas/stores/graph_protocol.py` | GraphStore protocol (Neo4j/NebulaGraph) |
+| `src/beever_atlas/stores/neo4j_store.py` | Neo4j implementation of GraphStore |
+| `src/beever_atlas/api/topics.py` | REST API for topics, summaries, entity cards |
+| `src/beever_atlas/services/batch_processor.py` | Batch processing with checkpoint/resume |
+| `src/beever_atlas/services/sync_runner.py` | Sync orchestration (fetch → process → consolidate) |
diff --git a/docs/v2/reference-papers.md b/docs/v2/reference-papers.md
new file mode 100644
index 00000000..a9467941
--- /dev/null
+++ b/docs/v2/reference-papers.md
@@ -0,0 +1,122 @@
+# Detailed Research Review: Memory Architectures for Conversational AI
+
+This document outlines a detailed review of recent research and frameworks regarding advanced memory structures for AI agents. The goal is to evaluate these findings and apply them to a conversational summarization application built for business communication platforms (Slack, Discord, Teams) to assist with onboarding, project tracking, and company knowledge.
+
+## Part 1: Detailed Paper & Framework Analysis
+
+### 1. GraphRAG via Weaviate & Neo4j
+**Source:** [weaviate.io/blog/graph-rag](https://weaviate.io/blog/graph-rag)
+**Core Concept:** Hybrid Vector-Graph Search (GraphRAG).
+**Detailed Findings:** Proposes a hybrid setup using Neo4j (a graph database) and Weaviate (a vector database). Weaviate's semantic search identifies relevant entities based on meaning, while Neo4j traverses a knowledge graph to find connected entities and broader contextual networks that pure vector search would miss.
+**How it helps our project:** Standard vector search will fail when a user asks multi-hop questions like, "Who is working with Dave on the new onboarding doc?" Vector search understands "onboarding doc," but GraphRAG can traverse the graph from `[Onboarding Doc]` -> `(AUTHORED_BY)` -> `[Alice]` -> `(WORKS_WITH)` -> `[Dave]`. This is crucial for navigating company structures.
+
+```mermaid
+graph LR
+    Q([User Query]) -.->|1. Vector Match| Doc
+    Doc[Onboarding Doc] -- 2. AUTHORED_BY --> Alice[Alice]
+    Alice -- 3. WORKS_WITH --> Dave[Dave]
+    
+    style Doc fill:#e1f5fe,stroke:#01579b,color:#333
+    style Alice fill:#e8f5e9,stroke:#1b5e20,color:#333
+    style Dave fill:#fff3e0,stroke:#e65100,color:#333
+```
+
+### 2. H-MEM (Hierarchical Memory)
+**Source:** [arxiv.org/pdf/2507.22925](https://arxiv.org/pdf/2507.22925)
+**Core Concept:** Top-Down Memory Organization & Temporal Decay.
+**Detailed Findings:** Proposes a Four-Layer Memory Structure instead of a single pool: Domain Layer (broad topics), Category Layer (sub-domains), Memory Trace Layer (key points), and Episode Layer (granular details/timestamps). Also introduces Dynamic Memory Regulation (positive/negative feedback and simulated human forgetting curves).
+**How it helps our project:** Slack channels are chaotic. By adopting this 4-layer structure, we can organize a messy #marketing channel into structured layers. If a user wants a high-level summary, we query the "Domain" or "Category" layer. If they want to know exactly what was said at 2 PM yesterday, we query the "Episode" layer. The forgetting curve helps the AI ignore old, irrelevant chatter.
+
+```mermaid
+flowchart TD
+    L1["Domain Layer\n(Broad Topic: e.g., #marketing)"]
+    L2["Category Layer\n(Sub-domain: e.g., Q3 Launch)"]
+    L3["Memory Trace Layer\n(Key Facts: e.g., Budget Approved)"]
+    L4["Episode Layer\n(Granular: e.g., 'Approved by John at 2:00 PM')"]
+
+    L1 -->|Contains| L2
+    L2 -->|Summarizes| L3
+    L3 -->|Derived from| L4
+    
+    style L1 fill:#ede7f6,stroke:#4527a0,color:#333
+    style L2 fill:#d1c4e9,stroke:#4527a0,color:#333
+    style L3 fill:#b39ddb,stroke:#4527a0,color:#333
+    style L4 fill:#9575cd,stroke:#4527a0,color:#333
+```
+
+### 3. System-1 vs System-2 Routing
+**Source:** [arxiv.org/pdf/2602.15313](https://arxiv.org/pdf/2602.15313)
+**Core Concept:** Dual-Process Graph Retrieval.
+**Detailed Findings:** Defines two routes for retrieval. Route 1 (Base Graph / System-1) is a flat web for speed and precision, acting like a highly advanced "Ctrl+F" to grab specific facts. Route 2 (Hierarchical Graph / System-2) is a semantic tree built for global reasoning. It uses strict rules (Minimum Concept Abstraction, Many-to-Many Mapping, Compression Efficiency) to keep information organized without bloating.
+**How it helps our project:** We can build a smart router for our chatbot. If a user asks a simple question ("What is the guest WiFi?"), we use Route 1 for a lightning-fast response. If they ask a complex question ("Summarize the roadblocks for the Q3 product launch"), we use Route 2 to navigate the semantic tree and generate a comprehensive report.
+
+```mermaid
+flowchart LR
+    Query["User Query"] --> Router{"Smart LLM Router"}
+    Router -- "Simple Fact Retrieval" --> S1["System-1\n(Fast Vector/Base Graph)"]
+    Router -- "Complex Synthesis" --> S2["System-2\n(Hierarchical Semantic Tree)"]
+    S1 --> Response["Final Output"]
+    S2 --> Response
+```
+
+### 4. The Ebbinghaus Forgetting Curve
+**Source:** [en.wikipedia.org/wiki/Forgetting_curve](https://en.wikipedia.org/wiki/Forgetting_curve)
+**Core Concept:** Mathematical Memory Decay.
+**Detailed Findings:** Hermann Ebbinghaus formulated the equation $R = e^{-t/S}$ to show how retention rate ($R$) decays over time ($t$) based on the strength of memory ($S$) if information is not reinforced.
+**How it helps our project:** This provides the exact mathematical formula we can implement in our database scoring algorithm. Messages in Slack quickly become outdated. By applying this curve to the "relevance score" of stored messages, older messages naturally fade away unless people continue to talk about them (which reinforces their score).
+
+### 5. MemoryBank
+**Source:** [arxiv.org/pdf/2305.10250](https://arxiv.org/pdf/2305.10250)
+**Core Concept:** Nightly Distillation & User Profiling.
+**Detailed Findings:** Details a 3-Phase system. Phase A (Storage) records raw dialogue and uses an LLM to distill it into Daily/Global Summaries and persistent User Portraits. Phase B uses the Ebbinghaus curve for memory updates. Phase C performs a flat semantic search to inject retrieved memories into the LLM's background prompt.
+**How it helps our project:** Instead of processing memories on the fly (which is expensive and slow), we can implement a nightly batch job. Every night, the system can summarize the day's Slack messages and update "User Portraits" (e.g., "John is a backend dev who prefers async communication"). This makes the AI highly personalized.
+
+### 6. Dynamic Knowledge Graphs (Updating Memory)
+**Source:** [www.ijcai.org/proceedings/2025/0002.pdf](https://www.ijcai.org/proceedings/2025/0002.pdf)
+**Core Concept:** Fact Replacement & Episodic Linking.
+**Detailed Findings:** Stores raw text (Episodic) and extracts Triplets (Semantic). Crucially, it links the two via "episodic edges." When the environment changes (e.g., a closed locker is now open), it detects the outdated info, deletes the old edge, and adds the new one to prevent contradictory facts. Retrieval happens via a two-step process (Semantic search to find facts, Episodic search to pull the original raw text).
+**How it helps our project:** Essential for tracking project states. If someone says in Teams, "The design is blocked," and later says, "The design is finished," our system must proactively replace the "blocked" fact with "finished" so it doesn't hallucinate contradictory statuses in its summaries.
+
+```mermaid
+stateDiagram-v2
+    direction LR
+    state "Time: T1" as T1 {
+        [*] --> Fact1
+        Fact1: [Design] -- STATUS --> [Blocked]
+    }
+    
+    state "Time: T2" as T2 {
+        Fact2: [Design] -- STATUS --> [Finished]
+        Fact2 --> [*]
+    }
+    
+    T1 --> T2 : New Team Message:\n"Design is finished"
+    note right of T1: Old edge is invalidated
+```
+
+### 7. Zep Framework
+**Source:** [arxiv.org/pdf/2501.13956](https://arxiv.org/pdf/2501.13956)
+**Core Concept:** Bi-Temporal Tracking & Community Subgraphs.
+**Detailed Findings:** Zep uses a Tri-Tiered Subgraph (Episode, Semantic Entity, and Community clusters). Its defining feature is the Bi-Temporal Model: it tracks Event Time (when it happened) and Ingestion Time (when it was recorded). Old facts aren't deleted; they are given "invalidated" timestamps. It also extracts entities using an $N$ message context window.
+**How it helps our project:** The bi-temporal model is the ultimate solution for corporate changes. If a user asks, "Who was managing this project last month?", Zep's bi-temporal tracking allows our bot to look back at invalidated facts and answer correctly, providing a perfect historical audit trail of company decisions.
+
+### 8. Mem0 / Mem0g
+**Source:** [arxiv.org/pdf/2504.19413](https://arxiv.org/pdf/2504.19413)
+**Core Concept:** Automated Consolidation & LLM Judges.
+**Detailed Findings:** Mem0 acts as a fast vector base layer, while Mem0g is a graph-enhanced layer for multi-hop reasoning. The key innovation is Phase B: Consolidation. An LLM acts as a judge against the database, choosing to ADD, UPDATE, DELETE, or NOOP new facts against existing ones.
+**How it helps our project:** This provides the exact logic for our ingestion pipeline. When a new Slack message arrives, we don't just blindly insert it. We run it through an LLM judge to compare it against the user's existing graph node, allowing the AI to organically maintain an accurate state of truth.
+
+```mermaid
+flowchart LR
+    Msg["New Slack Message"] --> Judge{"LLM Judge"}
+    Judge -- "Entirely New Info" --> Add["ADD New Graph Node"]
+    Judge -- "State Change" --> Update["UPDATE Existing Edge"]
+    Judge -- "Explicit Contradiction" --> Delete["DELETE/INVALIDATE Old Fact"]
+    Judge -- "Redundant Chatter" --> Noop["NOOP (Ignore)"]
+```
+
+### 9. Graphiti (by Zep)
+**Source:** [github.com/getzep/graphiti](https://github.com/getzep/graphiti)
+**Core Concept:** Implementation Framework for Context Graphs.
+**Detailed Findings:** An open-source Python framework that builds context graphs with temporal validity windows. It features incremental construction (no need for heavy batch jobs) and Hybrid Retrieval (Dense Semantic + BM25 Keyword + Reciprocal Rank Fusion + Graph Traversal). It handles deduplication natively.
+**How it helps our project:** This is the practical implementation tool. Instead of building the graph ingestion, deduplication, and hybrid search logic from scratch, our engineering team can use Graphiti as the foundational library to connect our Slack/Teams webhooks directly to a database like Neo4j.
diff --git a/docs/v2/weakness-resolution-map.md b/docs/v2/weakness-resolution-map.md
new file mode 100644
index 00000000..3b3d929e
--- /dev/null
+++ b/docs/v2/weakness-resolution-map.md
@@ -0,0 +1,586 @@
+# Beever Atlas v2: Weakness Resolution Map
+
+> **Date**: 2026-03-24
+> **Purpose**: Maps every validated weakness from `RETRIEVAL_IMPROVEMENT_IDEAS.md` to its resolution in the v2 architecture
+> **Status**: Complete — all 15 weaknesses addressed, all 8 proposed solutions incorporated
+
+---
+
+## Resolution Summary
+
+| Weakness | Severity | v2 Resolution | v2 Doc |
+|----------|----------|--------------|--------|
+| 1.1 Top-down only retrieval | Medium | Bidirectional expansion (up + down) in Semantic Retriever | [02-semantic-memory.md](02-semantic-memory.md) |
+| 1.2 Meaningless expansion thresholds | Medium | Score-based expansion (`max_score < 0.6`) | [02-semantic-memory.md](02-semantic-memory.md) |
+| 1.3 Detail queries bypass hierarchy | **High** | Two-stage topic-first retrieval (coarse→fine) | [02-semantic-memory.md](02-semantic-memory.md) |
+| 1.4 Temporal decay never applied | **High** | `apply_temporal_decay()` wired into retrieval pipeline | [02-semantic-memory.md](02-semantic-memory.md) |
+| 1.5 No feedback loop | Medium | Citation tracking + quality metrics in MongoDB | [02-semantic-memory.md](02-semantic-memory.md) |
+| 1.6 Slack only | Medium | Python adapter layer (Slack, Teams, Discord) | [05-ingestion-pipeline.md](05-ingestion-pipeline.md) |
+| 1.7 No real-time sync | Medium | Optional Chat SDK webhook bridge (Phase 2) | [05-ingestion-pipeline.md](05-ingestion-pipeline.md) |
+| 1.8 No memory expiration | Medium | Ebbinghaus decay + bi-temporal `valid_at`/`invalid_at` | [02-semantic-memory.md](02-semantic-memory.md), [03-graph-memory.md](03-graph-memory.md) |
+| 1.9 ADK migration incomplete | Low | Full ADK integration — all LLM operations are ADK agents with tools, orchestration, and state management | [13-adk-integration.md](13-adk-integration.md) |
+| 1.10 Brittle regex classifier | Medium | LLM-powered query understanding (flash-lite) | [04-query-router.md](04-query-router.md) |
+| 1.11 Cluster linking is a no-op | **High** | `_link_memories_to_cluster()` actually writes `cluster_id` | [02-semantic-memory.md](02-semantic-memory.md) |
+| 1.12 No cross-channel search | Medium | Global entities (Person/Tech/Project) span channels; channel-scoped types (Decision/Meeting) by design | [03-graph-memory.md](03-graph-memory.md) |
+| 1.13 Memory quality 5.25/10 | **High** | Quality gate: reject < 0.5, max 2 facts/msg, vague pattern filter | [05-ingestion-pipeline.md](05-ingestion-pipeline.md) |
+| 1.14 No adaptive alpha | Low | Pass `alpha=None` → `get_adaptive_alpha()` runs automatically | [02-semantic-memory.md](02-semantic-memory.md) |
+| 1.15 No semantic dedup | Low | Jaccard similarity dedup across tiers after expansion | [02-semantic-memory.md](02-semantic-memory.md) |
+
+**All 8 proposed solutions from the original doc are incorporated:**
+
+| Solution | Status | How It's Used in v2 |
+|----------|--------|-------------------|
+| A: Two-stage topic-first retrieval | **Incorporated** | Semantic Retriever: clusters first → scoped atomic search |
+| B: Bidirectional tier expansion | **Incorporated** | `_should_expand(memories, "up")` path added |
+| C: Score-based expansion thresholds | **Incorporated** | `max_score < 0.6 or avg_score < 0.4` replaces count checks |
+| D: Apply temporal decay | **Incorporated** | `_apply_temporal_decay()` called before returning results |
+| E: LLM-augmented query classification | **Incorporated + Enhanced** | Expanded to full query router (semantic/graph/both) |
+| F: Memory quality pipeline | **Incorporated** | `MemoryQualityGate` class with scoring + rejection |
+| G: Adaptive alpha per query | **Incorporated** | `alpha=None` in all retrieval methods |
+| H: Cross-tier semantic dedup | **Incorporated** | `_semantic_dedup()` with Jaccard similarity |
+
+---
+
+## Detailed Resolution Per Weakness
+
+### 1.1 Top-Down Only Retrieval (No Bottom-Up)
+
+**Original Problem:** `retrieve()` only expands downward. If Tier 2 atomic search returns weak results, there's no way to navigate up to Tier 1 clusters for broader context. If Tier 0 summary is stale, there's no way to synthesize from fresh Tier 2 data.
+
+**v2 Resolution: Bidirectional Expansion**
+
+The improved Semantic Retriever adds upward expansion to every depth:
+
+```python
+# In SemanticRetriever.retrieve():
+
+# Topic depth: after searching clusters + scoped atomics
+if self._should_expand(memories, "up"):
+    summaries = await self._retrieve_summary(channel_id, query)
+    memories = self._merge_and_rerank(memories, summaries)
+
+# Detail depth: after searching atomics directly
+if self._should_expand(memories, "up"):
+    clusters = await self._retrieve_clusters(channel_id, query)
+    memories = self._merge_and_rerank(memories, clusters)
+```
+
+**Additionally**, the Graph Memory provides a complementary upward path. When semantic search at Tier 2 is weak, the router can fall back to Neo4j entity traversal — effectively navigating "up" via relationship structure rather than text hierarchy.
+
+**v2 doc:** [02-semantic-memory.md](02-semantic-memory.md) (`ImprovedSemanticRetriever`)
+
+---
+
+### 1.2 Hardcoded Expansion Thresholds Are Meaningless
+
+**Original Problem:** `if len(memories) < 2` and `if len(memories) < 3` — raw counts, not relevance scores. 5 irrelevant results = "enough"; 1 perfect result = "expand."
+
+**v2 Resolution: Score-Based Expansion**
+
+```python
+def _should_expand(self, memories: list, direction: str) -> bool:
+    """Score-based, not count-based."""
+    if not memories:
+        return True
+    scores = [m.get("score", 0) for m in memories]
+    return max(scores) < 0.6 or (sum(scores) / len(scores)) < 0.4
+```
+
+Both thresholds are configurable via `settings.expansion_score_threshold` and `settings.expansion_avg_threshold`.
+
+After expansion, results are **re-ranked by score** (not just appended):
+```python
+memories = self._merge_and_rerank(original, expanded)
+# Sorts by score descending, takes top max_results
+```
+
+**v2 doc:** [02-semantic-memory.md](02-semantic-memory.md) (`_should_expand` method)
+
+---
+
+### 1.3 Detail Queries Don't Benefit from Hierarchical Structure
+
+**Original Problem:** Detail queries go straight to flat Tier 2 search across ALL atomic memories. For a channel with 10K memories, the hierarchy provides zero benefit — it's identical to flat vector search.
+
+**v2 Resolution: Two-Stage Topic-First Retrieval (Solution A)**
+
+Even for detail queries, the Semantic Retriever now uses topic clusters to scope the search:
+
+```
+Step 1 (coarse): Find relevant topic clusters
+  hybrid_search(tier="tier1_cluster", topic_filter=extracted_topics)
+  → "authentication" cluster (member_ids: [uuid1..uuid15])
+  → "security" cluster (member_ids: [uuid20..uuid28])
+
+Step 2 (fine): Search atomics WITHIN matched clusters only
+  hybrid_search(tier="tier2_atomic", id_filter=member_ids)
+  → Searches 43 memories instead of 10,000
+```
+
+**Prerequisites (from Solution A) are addressed:**
+1. `_link_memories_to_cluster()` actually writes `cluster_id` → **FIXED** (see 1.11)
+2. `cluster_id` filter in `hybrid_search()` → **ADDED** as `id_filter` parameter
+
+**Fallback:** If no matching clusters are found (new topic not yet clustered), falls back to global Tier 2 search — same as v1 behavior, so no regression.
+
+**v2 doc:** [02-semantic-memory.md](02-semantic-memory.md) (`depth == "topic"` branch)
+
+---
+
+### 1.4 Temporal Decay Exists But Is Never Applied
+
+**Original Problem:** `apply_temporal_decay()` exists in `temporal.py:153-181` but is never called. Only `enrich_memories_with_temporal()` (text labels) is used. A 6-month-old fact has identical retrieval weight as yesterday's.
+
+**v2 Resolution: Temporal Decay Wired Into Retrieval**
+
+```python
+# In SemanticRetriever.retrieve(), BEFORE returning results:
+self._apply_temporal_decay(memories)
+
+def _apply_temporal_decay(self, memories: list) -> None:
+    for m in memories:
+        days_ago = self._days_since(m.get("timestamp"))
+        decay = math.exp(-self.decay_rate * (days_ago / 30))
+        m["score"] *= decay
+    memories.sort(key=lambda m: m.get("score", 0), reverse=True)
+```
+
+Additionally, the Graph Memory provides **bi-temporal tracking** on all relationships:
+- `valid_from`: when the relationship became true
+- `valid_until`: when it was invalidated (null = current)
+- `created_at`: when we ingested it
+
+This means the temporal query "How did auth evolve?" follows `SUPERSEDES` chains in Neo4j with proper time ordering — something the v1 text labels could never support.
+
+**v2 docs:** [02-semantic-memory.md](02-semantic-memory.md) (`_apply_temporal_decay`), [03-graph-memory.md](03-graph-memory.md) (temporal properties on Neo4j relationships)
+
+---
+
+### 1.5 No Feedback Loop for Retrieval Quality
+
+**Original Problem:** No thumbs up/down, no citation tracking, no active learning. The eval plan (`09-MEMORY_EVAL_PLAN.md`) is documentation only — no pipeline runs in production.
+
+**v2 Resolution: Citation Tracking + Quality Metrics**
+
+The v2 Weaviate schema includes `quality_score` on every atomic memory. The response generator tracks which memories were actually cited:
+
+```python
+# In response_generator.py:
+async def generate(self, query, memories, ...) -> Response:
+    response = await self._llm_generate(query, memories)
+
+    # Track which memories were cited
+    cited_ids = self._extract_cited_memory_ids(response)
+
+    # Log to MongoDB for quality analysis
+    await self.mongo.quality_logs.insert_one({
+        "query": query,
+        "retrieved_ids": [m["id"] for m in memories],
+        "cited_ids": cited_ids,
+        "retrieval_precision": len(cited_ids) / len(memories),
+        "timestamp": datetime.utcnow(),
+    })
+```
+
+This enables:
+- **Precision@K tracking**: What % of retrieved memories were actually useful?
+- **Citation coverage**: Are we finding the right information?
+- **Future active learning**: Boost memories that get cited, penalize those that don't
+
+**Status:** Partially addressed in v2 (tracking infrastructure). Full active learning loop is Phase 2+.
+
+**v2 doc:** [02-semantic-memory.md](02-semantic-memory.md) (Weaviate schema: `quality_score`), [07-deployment.md](07-deployment.md) (module structure: `mongo_store.py`)
+
+---
+
+### 1.6 Single Workspace, Slack Only
+
+**Original Problem:** Hardcoded to single Slack workspace. No Teams, Discord, or multi-workspace support.
+
+**v2 Resolution: Python Adapter Layer with NormalizedMessage**
+
+```python
+class NormalizedMessage:
+    content: str
+    author: AuthorInfo
+    platform: Platform          # slack | teams | discord
+    channel_id: str
+    channel_name: str
+    message_id: str
+    timestamp: datetime
+    thread_id: str | None
+    attachments: list[Attachment]
+    ...
+
+class SlackAdapter(BaseAdapter):   # slack-sdk (Python)
+class TeamsAdapter(BaseAdapter):   # Microsoft Graph API
+class DiscordAdapter(BaseAdapter): # discord.py
+```
+
+Every adapter normalizes platform-specific messages into `NormalizedMessage`. The rest of the pipeline is platform-agnostic.
+
+**Chat SDK evaluation:** The [Vercel Chat SDK](https://chat-sdk.dev/) is TypeScript-only and can't fetch message history. It's suitable for real-time webhooks (Phase 2) but not batch ingestion. Python adapters are the primary ingestion mechanism.
+
+**v2 doc:** [05-ingestion-pipeline.md](05-ingestion-pipeline.md) (Multi-Platform Adapters)
+
+---
+
+### 1.7 No Real-Time Sync
+
+**Original Problem:** Sync is pull-based only. `slack_app_token` and `slack_signing_secret` config fields exist but are unused. Socket Mode was planned but never implemented.
+
+**v2 Resolution: Dual-Mode Ingestion**
+
+- **Mode 1 (primary):** Python adapters for batch history fetch — works today, no webhook infrastructure needed
+- **Mode 2 (Phase 2):** Optional Chat SDK TypeScript bridge for real-time webhook ingestion
+
+```yaml
+# docker-compose.yml — Phase 2 addition:
+chat-sdk-bridge:
+  build: ./chat-sdk-bridge
+  environment:
+    SLACK_BOT_TOKEN: ${SLACK_BOT_TOKEN}
+    BEEVER_API_URL: http://beever-atlas:8000
+```
+
+The Chat SDK bridge receives webhook events from Slack/Teams/Discord and POSTs normalized messages to the Python backend's `/api/ingest` endpoint.
+
+**Status:** Batch sync is the v2 MVP. Real-time is Phase 2.
+
+**v2 doc:** [05-ingestion-pipeline.md](05-ingestion-pipeline.md) (Chat SDK evaluation, dual-mode diagram)
+
+---
+
+### 1.8 No Memory Expiration / Storage Growth Management
+
+**Original Problem:** No automated TTL, no archival, no pruning. Channels accumulate unbounded memories.
+
+**v2 Resolution: Ebbinghaus Decay + Bi-Temporal Model**
+
+Two complementary mechanisms:
+
+**1. Retrieval-time decay (Ebbinghaus):** Old memories naturally rank lower via `apply_temporal_decay()`. They still exist but don't surface in results unless reinforced (frequently cited/accessed).
+
+**2. Bi-temporal `valid_at`/`invalid_at`:** When a fact is superseded (detected via Neo4j `SUPERSEDES` edges), the old Weaviate memory gets `invalid_at` set. Queries can filter out invalidated facts:
+
+```python
+# In hybrid_search, optionally exclude invalidated memories:
+if exclude_invalidated:
+    combined_filter &= Filter.by_property("invalid_at").is_none(True)
+```
+
+**3. Future: Scheduled pruning** (Phase 2+): Archive memories where `quality_score < 0.3` AND `days_ago > 90` AND `never_cited = True`.
+
+**v2 docs:** [02-semantic-memory.md](02-semantic-memory.md) (temporal decay), [03-graph-memory.md](03-graph-memory.md) (bi-temporal properties, SUPERSEDES edges)
+
+---
+
+### 1.9 ADK Migration Incomplete
+
+**Original Problem:** The `agents/` directory has ADK scaffolding (coordinator, orchestrator, retrieval agents) but they're disconnected from the main MCP tools path in `server.py`.
+
+**v2 Resolution: Full ADK Agent Architecture**
+
+The v2 redesign replaces all direct LLM calls with Google ADK agents. The `query_router_agent` (root `LlmAgent`) orchestrates retrieval via `ParallelAgent` (semantic + graph) and extraction via `SequentialAgent`:
+
+```
+src/beever_atlas/
+├── agents/                      # ADK agent definitions
+│   ├── query_router_agent.py    # Root LlmAgent — routes to retrieval or extraction
+│   ├── semantic_agent.py        # Weaviate 3-tier retrieval tools
+│   ├── graph_agent.py           # Neo4j traversal tools
+│   ├── response_agent.py        # Grounded citations from session state
+│   ├── extraction_agents.py     # SequentialAgent: preprocess → extract → persist
+│   ├── consolidation_agent.py   # LoopAgent: cluster assignment → health check
+│   └── tools.py                 # ADK FunctionTool wrappers for store operations
+├── retrieval/
+│   ├── semantic_retriever.py    # Store operations (wrapped as ADK tools)
+│   ├── graph_retriever.py       # Store operations (wrapped as ADK tools)
+│   └── result_merger.py         # Merge + dedup + rank
+```
+
+Store operations remain in `retrieval/` and `stores/` but are wrapped as ADK `FunctionTool` instances. The agent layer handles orchestration, model selection, and fallback via LiteLLM.
+
+**v2 docs:** [13-adk-integration.md](13-adk-integration.md) (agent hierarchy, tool mappings, model config), [07-deployment.md](07-deployment.md) (module structure)
+
+---
+
+### 1.10 Query Classification Uses Brittle Regex
+
+**Original Problem:** `QueryClassifier` uses hardcoded regex. `(\w+)` captures only single-word topics. DETAIL patterns checked first cause misclassification. `model_query_classification = gemini-2.5-flash-lite` is configured but never used.
+
+**v2 Resolution: LLM-Powered Query Understanding (Enhanced Solution E)**
+
+The v2 query router goes beyond Solution E's original proposal. Instead of just classifying overview/topic/detail, it also:
+- **Routes to Semantic vs Graph memory** (new capability from dual-memory architecture)
+- **Extracts entities** for Neo4j fuzzy matching (not just topics)
+- **Detects temporal intent** (recent/any/historical)
+- **Falls back to regex** for obvious queries (zero cost fast path)
+
+```python
+QUERY_UNDERSTANDING_PROMPT = """
+Classify this query:
+1. route: "semantic" | "graph" | "both"
+2. semantic_depth: "overview" | "topic" | "detail"
+3. entities: ["Alice", "JWT"]
+4. topics: ["authentication", "deployment"]   ← multi-word, multi-topic
+5. temporal_scope: "recent" | "any" | "historical"
+6. confidence: 0.0-1.0
+"""
+```
+
+**Key improvements over Solution E:**
+- Multi-word topics: "API design" instead of just "API" ✓
+- Multi-topic detection: "NBA and FIFA" → `["NBA", "FIFA"]` ✓
+- No priority misclassification: LLM understands intent holistically ✓
+- Additionally: routes to graph memory for relational queries (Solution E didn't have this)
+
+**Cost:** Same ~$0.001/query using `gemini-2.5-flash-lite` — the model that was already configured but unused.
+
+**v2 docs:** [04-query-router.md](04-query-router.md) (Query Understanding, Routing Strategy)
+
+---
+
+### 1.11 Cluster Linking Is a No-Op (THE Blocker)
+
+**Original Problem:** `_link_memories_to_cluster()` is literally a `logger.debug()` — it never writes `cluster_id` to atomic memories. This breaks everything: topic-first retrieval is impossible, consolidation re-processes the same memories every run, duplicate clusters accumulate.
+
+**v2 Resolution: Actually Write `cluster_id`**
+
+```python
+async def _link_memories_to_cluster(self, memories, cluster_id):
+    """v1: no-op. v2: ACTUALLY writes cluster_id."""
+    collection = self.weaviate.collections.get(COLLECTION_NAME)
+    for memory in memories:
+        if memory.get("id"):
+            collection.data.update(
+                uuid=memory["id"],
+                properties={"cluster_id": cluster_id}
+            )
+    logger.info(f"Linked {len(memories)} memories to cluster {cluster_id}")
+```
+
+**Additionally, prevent duplicate clusters:**
+```python
+async def _consolidate_to_clusters(self, channel_id):
+    for topic, memories in topic_groups.items():
+        # CHECK if cluster already exists for this topic
+        existing = await self._find_existing_cluster(channel_id, topic)
+        if existing:
+            await self._update_cluster(existing, memories)  # Update, don't duplicate
+        else:
+            cluster_id = await self._create_topic_cluster(channel_id, topic, memories)
+
+        # THIS LINE ACTUALLY WORKS NOW
+        await self._link_memories_to_cluster(memories, cluster_id)
+```
+
+This unblocks:
+- ✅ Topic-first retrieval (Solution A)
+- ✅ `_get_unclustered_memories()` correctly filters already-clustered memories
+- ✅ No more duplicate cluster accumulation
+
+**v2 doc:** [02-semantic-memory.md](02-semantic-memory.md) (Consolidation Service)
+
+---
+
+### 1.12 No Cross-Channel Search
+
+**Original Problem:** Every query tool requires `channel_id`. If auth was discussed in `#backend` but user asks in `#frontend`, it won't be found.
+
+**v2 Resolution: Graph Memory Naturally Spans Channels**
+
+In the Semantic Memory (Weaviate), queries remain channel-scoped — this is by design for cost and relevance.
+
+But the Graph Memory (Neo4j) naturally provides cross-channel visibility because **entities span channels**:
+
+```cypher
+-- "What decisions has Alice made?" — searches across ALL channels
+MATCH (p:Person {name: "Alice"})-[:DECIDED]->(d:Decision)
+RETURN d.summary, d.channel, d.valid_from
+ORDER BY d.valid_from DESC
+```
+
+A `Person(Alice)` node created from `#backend` messages is the SAME node referenced in `#frontend` messages. The graph naturally deduplicates entities across channels.
+
+**Status:** Fully resolved. The entity scoping strategy makes this explicit by design: global entities (Person, Technology, Project, Team) are MERGED by name only and span all channels. Channel-scoped entities (Decision, Meeting, Artifact) are intentionally channel-local because they are contextual. Cross-channel search for relational queries works by default via the global entity nodes. Weaviate semantic search remains channel-scoped for cost and relevance — this is the correct behavior, not a limitation.
+
+**v2 docs:** [03-graph-memory.md](03-graph-memory.md) (Graph Memory, Memory Interconnection, Entity Scoping Strategy)
+
+---
+
+### 1.13 Memory Quality Is Low (5.25/10)
+
+**Original Problem:** 319-memory audit: 5.25/10 average quality, 2.44 facts per message (too many), only 2.2% high quality, 17% vague/generic. Examples: "The user does not use 'uv'", "The output was adjusted accordingly."
+
+**v2 Resolution: Quality Gate at Extraction (Solution F, Enhanced)**
+
+Three layers of quality control:
+
+**Layer 1: Extraction prompt improvement**
+```
+Extract only the MOST IMPORTANT 1-2 facts from this message.
+Each fact MUST be self-contained — understandable without the original message.
+Do NOT extract obvious, trivial, or context-dependent statements.
+```
+Target: 1-2 facts/message (down from 2.44)
+
+**Layer 2: Quality gate scoring + rejection**
+```python
+class MemoryQualityGate:
+    MIN_QUALITY_SCORE = 0.5
+    MAX_FACTS_PER_MESSAGE = 2
+    VAGUE_PATTERNS = ["the user", "the process", "it was", ...]
+
+    def score_fact(self, fact):
+        # Length, vagueness, specificity, self-containedness checks
+        # Returns 0.0-1.0
+
+    def gate(self, facts):
+        # Reject < 0.5, keep top 2 by quality
+```
+Target: reject vague/generic → < 5% (down from 17%)
+
+**Layer 3: Retrieval-time quality boost**
+```python
+# Quality-weighted ranking: good memories score higher
+quality = mem.get("quality_score", 0.5)
+mem["score"] = mem["score"] * (0.7 + 0.3 * quality)
+```
+
+**Expected impact:** Quality score from 5.25/10 → target > 7.0/10. High quality (>6) from 2.2% → target > 50%.
+
+**v2 doc:** [05-ingestion-pipeline.md](05-ingestion-pipeline.md) (Quality Gate)
+
+---
+
+### 1.14 No Per-Query-Type Hybrid Alpha Tuning
+
+**Original Problem:** `get_adaptive_alpha()` exists in `weaviate_client.py` but is bypassed by hierarchical retrieval because it always passes an explicit `alpha` (hardcoded `0.3` for Tier 0, `settings.hybrid_alpha` for Tier 1/2).
+
+**v2 Resolution: Pass `alpha=None` (Solution G)**
+
+One-line fix per retrieval method:
+
+```python
+# Before (v1):
+await hybrid_search(..., alpha=settings.hybrid_alpha)
+
+# After (v2):
+await hybrid_search(..., alpha=None)  # → get_adaptive_alpha() runs automatically
+```
+
+The existing `get_adaptive_alpha()` logic is sound:
+- Short keyword queries → favor BM25 (`alpha=0.2`)
+- Medium queries → balanced (`alpha=0.5`)
+- Long semantic queries → favor vector (`alpha=0.7`)
+
+This is applied in ALL three retrieval methods: `_retrieve_summary`, `_retrieve_clusters`, `_retrieve_atomics`.
+
+**v2 doc:** [02-semantic-memory.md](02-semantic-memory.md) (all retrieval calls use `alpha=None`)
+
+---
+
+### 1.15 No Semantic Deduplication Across Tiers
+
+**Original Problem:** Dedup only checks by memory ID. Same information in Tier 1 cluster summary and Tier 2 atomic gets included twice, wasting LLM token budget.
+
+**v2 Resolution: Jaccard Similarity Dedup (Solution H)**
+
+Applied after tier expansion merges results:
+
+```python
+def _semantic_dedup(self, memories: list, threshold=0.85) -> list:
+    unique = []
+    for mem in memories:
+        is_dup = any(
+            self._jaccard_similarity(mem["memory"], e["memory"]) > threshold
+            for e in unique
+        )
+        if not is_dup:
+            unique.append(mem)
+    return unique
+```
+
+When a Tier 1 summary and Tier 2 atomic overlap semantically, the **more specific one** (typically the atomic with citations) is kept.
+
+**v2 doc:** [02-semantic-memory.md](02-semantic-memory.md) (`_semantic_dedup` method)
+
+---
+
+## What the Graph Memory Adds (Beyond v1 Weaknesses)
+
+The original `RETRIEVAL_IMPROVEMENT_IDEAS.md` focused on fixing the existing Weaviate-only system. The v2 proposal goes further by adding Graph Memory (Neo4j), which addresses limitations that weren't explicitly listed as weaknesses but are inherent to a vector-only architecture:
+
+| Limitation (implicit in v1) | Graph Memory Solution |
+|---|---|
+| Can't answer "Who decided X?" | Person → DECIDED → Decision traversal |
+| Can't answer "What blocks project Y?" | Project ← BLOCKED_BY ← Constraint traversal |
+| Can't track fact evolution over time | Decision → SUPERSEDES → older Decision chains |
+| Can't connect entities across channels | Same Person node referenced from multiple channels |
+| Can't show organizational structure | Person → MEMBER_OF → Team → OWNS → Project |
+| Can't detect contradictions | Bi-temporal `valid_until` on superseded relationships |
+| No relationship context in wiki | Wiki "People" and "Decisions" sections from Neo4j |
+
+These are capabilities that **no amount of Weaviate improvement can provide** — they require a graph data model.
+
+---
+
+## Completeness Check
+
+| Original Weakness | Has v2 Fix? | Fix Quality |
+|---|---|---|
+| 1.1 Top-down only | ✅ | Full — bidirectional expansion + graph fallback |
+| 1.2 Count thresholds | ✅ | Full — score-based, configurable |
+| 1.3 Detail bypasses hierarchy | ✅ | Full — topic-first two-stage retrieval |
+| 1.4 Temporal decay unused | ✅ | Full — wired into pipeline + bi-temporal graph |
+| 1.5 No feedback loop | ⚠️ Partial | Tracking infra in v2; active learning in Phase 2+ |
+| 1.6 Slack only | ✅ | Full — adapter layer with NormalizedMessage |
+| 1.7 No real-time sync | ⚠️ Partial | Batch in v2 MVP; Chat SDK real-time in Phase 2 |
+| 1.8 No memory expiration | ⚠️ Partial | Ebbinghaus decay + bi-temporal invalidation; pruning in Phase 2 |
+| 1.9 ADK incomplete | ✅ | Full — complete ADK agent architecture with tools, orchestration, and LiteLLM fallback |
+| 1.10 Regex classifier | ✅ | Full — LLM query understanding with graph routing |
+| 1.11 Cluster linking no-op | ✅ | Full — actually writes cluster_id + dedup clusters |
+| 1.12 No cross-channel | ✅ | Global entities span channels by design; channel-scoping for contextual types is correct behavior |
+| 1.13 Quality 5.25/10 | ✅ | Full — 3-layer quality gate (prompt + scoring + retrieval boost) |
+| 1.14 No adaptive alpha | ✅ | Full — `alpha=None` in all methods |
+| 1.15 No semantic dedup | ✅ | Full — Jaccard similarity after tier expansion |
+
+**Result:** 12/15 fully resolved, 3/15 partially resolved (with clear Phase 2 plans).
+
+---
+
+## Original Solutions Mapping
+
+| Solution from RETRIEVAL_IMPROVEMENT_IDEAS.md | v2 Status | Enhancement in v2 |
+|---|---|---|
+| **A: Two-stage topic-first** | ✅ Incorporated | Enhanced with graph-based topic scoping as fallback |
+| **B: Bidirectional expansion** | ✅ Incorporated | Also applies across memory systems (semantic ↔ graph) |
+| **C: Score-based thresholds** | ✅ Incorporated | Configurable via settings |
+| **D: Apply temporal decay** | ✅ Incorporated | Extended with Neo4j bi-temporal tracking |
+| **E: LLM query classification** | ✅ Incorporated + Enhanced | Extended to route between semantic AND graph memory |
+| **F: Memory quality pipeline** | ✅ Incorporated | 3-layer approach (prompt + gate + retrieval boost) |
+| **G: Adaptive alpha** | ✅ Incorporated | One-line fix per method |
+| **H: Semantic dedup** | ✅ Incorporated | Jaccard similarity, prefers specific over general |
+
+---
+
+*All 15 weaknesses are addressed. All 8 proposed solutions are incorporated. The Graph Memory adds 7 additional capabilities that were impossible with Weaviate alone.*
+
+---
+
+## V2-Introduced Risks (Identified and Addressed)
+
+The v2 dual-memory architecture introduces new complexity. The following risks have been identified and designed with mitigations:
+
+| Risk | Severity | Mitigation | v2 Doc |
+|------|----------|------------|--------|
+| Cross-store write failures create orphaned data | Critical | Outbox pattern: MongoDB intent → idempotent fan-out → reconciler | [08-resilience.md](08-resilience.md) |
+| No graceful degradation on component failure | Critical | Circuit breakers + degradation matrix per dependency | [08-resilience.md](08-resilience.md) |
+| No monitoring for 3 databases + APIs | Critical | OpenTelemetry traces + metrics + health endpoints + backups | [09-observability.md](09-observability.md) |
+| Entity resolution drifts at scale | High | Canonical entity registry + Jaro-Winkler fuzzy matching + alias resolution | [03-graph-memory.md](03-graph-memory.md) |
+| Neo4j channel-scoping contradicts cross-channel promise | High | Type-based scoping: global (Person/Tech) vs channel (Decision) | [03-graph-memory.md](03-graph-memory.md) |
+| Single LLM provider (Gemini) = single point of failure | High | Provider abstraction + tiered fallback chain per call site | [08-resilience.md](08-resilience.md) |
+| Entity extraction has no quality gate | High | EntityQualityGate: confidence threshold + hypothetical filter | [05-ingestion-pipeline.md](05-ingestion-pipeline.md) |
+| No access control for private channels | High | Channel-level ACL inherited from platform membership | [10-access-control.md](10-access-control.md) |
+| Graph traversal unbounded for high-degree nodes | High | APOC expansion + directed traversal + 5s timeout + indexes | [03-graph-memory.md](03-graph-memory.md) |
diff --git a/openspec/changes/ingestion-pipeline-hardening/.openspec.yaml b/openspec/changes/ingestion-pipeline-hardening/.openspec.yaml
new file mode 100644
index 00000000..0f528039
--- /dev/null
+++ b/openspec/changes/ingestion-pipeline-hardening/.openspec.yaml
@@ -0,0 +1,2 @@
+schema: spec-driven
+created: 2026-04-01
diff --git a/openspec/changes/ingestion-pipeline-hardening/design.md b/openspec/changes/ingestion-pipeline-hardening/design.md
new file mode 100644
index 00000000..27b54cd4
--- /dev/null
+++ b/openspec/changes/ingestion-pipeline-hardening/design.md
@@ -0,0 +1,123 @@
+## Context
+
+Beever Atlas ingests Slack messages through a 7-stage ADK pipeline: Preprocessor -> (FactExtractor || EntityExtractor) -> Classifier -> Embedder -> CrossBatchValidator -> Persister. Facts land in Weaviate (vector store), entities/relationships in Neo4j (graph store). The system serves as an enterprise internal knowledge base.
+
+Current state:
+- Entity dedup uses Jaro-Winkler string similarity only — semantically equivalent names ("Atlas" vs "Beever Atlas") are not merged unless an explicit alias exists.
+- No coreference resolution — pronouns and implicit references pass through unresolved.
+- Multimodal support covers images (Gemini vision) and PDFs (text extraction) only. Video, audio, and Office docs are metadata-only.
+- Weaviate stores embedding vectors but exposes no semantic search — all retrieval is field-filter based.
+- No temporal fact lifecycle — contradictory facts coexist indefinitely.
+- Thread context is lost across ingestion batches.
+- Orphan entities (no relationships in current batch) are hard-deleted immediately.
+
+Constraints:
+- Pipeline runs on Google ADK (SequentialAgent/ParallelAgent/LlmAgent).
+- LLM calls use Gemini via ADK; adding extra LLM calls impacts latency and cost.
+- Weaviate and Neo4j are the only persistent stores (plus MongoDB for outbox).
+- Must remain backward-compatible with existing stored facts/entities.
+
+## Goals / Non-Goals
+
+**Goals:**
+- Resolve pronoun/implicit references before extraction so downstream agents see explicit entity names
+- Merge semantically equivalent entities using embedding similarity, not just string matching
+- Expand multimodal ingestion to cover video (keyframes + transcript), Office documents (docx/xlsx/pptx), and audio
+- Activate Weaviate near-vector search for semantic fact retrieval
+- Implement fact supersession so contradictory or outdated facts are marked invalid with pointers to replacements
+- Preserve thread context across ingestion batches
+- Replace hard orphan deletion with a grace-period soft state
+
+**Non-Goals:**
+- Real-time streaming ingestion (batch model is retained)
+- Building a full NLP coreference model from scratch (we use LLM-based resolution)
+- Supporting non-Slack platforms in this change (adapter layer stays Slack-only)
+- Building a complete query/RAG layer (we only activate vector search primitives)
+- OCR for handwritten or scanned documents
+- Live transcription of ongoing meetings
+
+## Decisions
+
+### D1: LLM-based coreference resolution as a preprocessor sub-step
+
+**Choice**: Add an LLM call in the preprocessor that takes a sliding window of recent messages (current batch + last N persisted messages from the channel) and rewrites pronoun references inline.
+
+**Rationale**: Dedicated NLP coreference models (e.g., neuralcoref, coref-hoi) are English-only, require GPU, and struggle with domain-specific terms. An LLM call with conversation context handles multilingual, domain-specific references naturally.
+
+**Alternatives considered**:
+- *neuralcoref / spaCy pipeline*: Rejected — English-only, poor on domain jargon, extra dependency.
+- *Post-extraction entity linking*: Rejected — by the time entities are extracted, the pronoun context is lost; fixing after extraction is harder than enriching before.
+
+**Implementation**: New `CoreferenceResolver` service called by preprocessor. Takes batch messages + recent channel history (last 20 messages from MongoDB/Weaviate). Returns rewritten text with pronouns replaced by explicit entity names. Original text preserved in `raw_text` field.
+
+### D2: Embedding-based entity dedup in CrossBatchValidator
+
+**Choice**: Before Jaro-Winkler matching, compute embeddings for extracted entity names and compare against known entity name embeddings using cosine similarity (threshold 0.85). Candidates above threshold are presented to the LLM validator for confirmation.
+
+**Rationale**: String similarity fails on semantic equivalence ("Beever Atlas" vs "Atlas" = 0.55 Jaro-Winkler, below 0.8 threshold). Embedding similarity captures meaning. LLM confirmation prevents false merges.
+
+**Alternatives considered**:
+- *Pure embedding similarity without LLM confirmation*: Rejected — too many false positives (e.g., "Redis" and "Redshift" embed similarly).
+- *Knowledge graph link prediction*: Rejected — requires mature graph with many edges; cold-start problem.
+- *Prebuilt synonym dictionary*: Rejected — doesn't scale to project-specific entities.
+
+**Implementation**: Reuse Jina embeddings (same model as fact embeddings). Cache entity name embeddings in Neo4j `Entity.name_vector` property. CrossBatchValidator prompt updated to include embedding-similarity candidates alongside alias matches.
+
+### D3: Modular media extractors with a registry pattern
+
+**Choice**: Refactor `MediaProcessor` into a registry of extractors keyed by MIME type. Each extractor implements `extract(file_bytes, metadata) -> MediaContent`. New extractors: `VideoExtractor` (ffmpeg keyframes + Whisper transcript), `OfficeExtractor` (python-docx/openpyxl/python-pptx), `AudioExtractor` (Whisper API).
+
+**Rationale**: Current `MediaProcessor` has hardcoded if/else branches. A registry pattern makes adding new types trivial and testable in isolation.
+
+**Alternatives considered**:
+- *External document processing service (e.g., Unstructured.io)*: Rejected for now — adds external dependency and cost; revisit if extraction quality is insufficient.
+- *Apache Tika*: Rejected — JVM dependency, heavy for our Python stack.
+
+**Implementation**:
+- Video: `ffmpeg` extracts 1 keyframe per 30s + audio track; Whisper API transcribes audio. Output: combined transcript + keyframe descriptions (via Gemini vision).
+- Office: `python-docx` for .docx, `openpyxl` for .xlsx (cell text + sheet names), `python-pptx` for .pptx (slide text + speaker notes). Output: concatenated text content.
+- Audio: Whisper API transcription. Output: transcript text.
+- All outputs feed into the existing pipeline as enriched message text.
+
+### D4: Activate Weaviate near-vector search
+
+**Choice**: Add `semantic_search(query_vector, filters, limit)` method to `WeaviateStore`. Uses Weaviate's `near_vector` query with optional metadata filters (channel_id, importance, topic_tags, date range).
+
+**Rationale**: Vectors are already stored. Activation is a store-layer change only — no schema migration needed.
+
+**Alternatives considered**:
+- *Hybrid search (BM25 + vector)*: Deferred — requires Weaviate text2vec module config change. Can be added later.
+- *External vector DB (Pinecone, Qdrant)*: Rejected — vectors already in Weaviate, no reason to duplicate.
+
+### D5: Fact supersession via contradiction detection
+
+**Choice**: Add a post-classification step that queries Weaviate for existing facts with overlapping entity_tags and topic_tags. If the LLM identifies a contradiction (e.g., "we use Redis" vs "we deprecated Redis"), the new fact gets a `supersedes` field pointing to the old fact's ID, and the old fact's `invalid_at` is set.
+
+**Rationale**: Enterprise knowledge bases must reflect current state. Stale facts erode trust.
+
+**Alternatives considered**:
+- *Manual curation UI*: Complementary but doesn't solve automated ingestion.
+- *Time-based expiry (TTL)*: Rejected — facts don't expire uniformly; a 2-year-old architecture decision may still be valid.
+- *Version chains*: Rejected as over-engineering for v1 — simple supersession pointer is sufficient.
+
+### D6: Cross-batch thread context via persisted parent summaries
+
+**Choice**: When the preprocessor encounters a thread reply whose parent is not in the current batch, query MongoDB/Weaviate for the parent message by `thread_ts`. Store a `thread_parent_summary` field on preprocessed messages.
+
+**Rationale**: Current design only resolves parent context within the same batch. Enterprise Slack threads often span hours/days across multiple ingestion runs.
+
+### D7: Soft orphan handling with grace period
+
+**Choice**: Instead of deleting entities with zero relationships, tag them as `status: "pending"` with a `pending_since` timestamp. A background reconciler promotes entities to `active` if relationships appear within N batches (configurable, default 5), or prunes them after the window expires.
+
+**Rationale**: First mentions of projects/initiatives often lack relationships in their initial batch. Hard deletion loses important entities.
+
+## Risks / Trade-offs
+
+- **[Increased LLM cost]** Coreference resolution adds one LLM call per batch. -> Mitigation: Use smaller model (Gemini Flash) for coreference; skip batches with no pronouns detected (regex pre-filter).
+- **[Embedding computation for entity names]** Extra Jina API calls for entity name embeddings. -> Mitigation: Cache embeddings on Neo4j nodes; only compute for new/unseen names.
+- **[Video processing latency]** ffmpeg + Whisper can take minutes per video. -> Mitigation: Process media asynchronously; don't block the main pipeline. Use a media processing queue with configurable concurrency.
+- **[False entity merges]** Embedding similarity may suggest merging distinct entities with similar names. -> Mitigation: LLM confirmation step; configurable similarity threshold; merge audit log.
+- **[Contradiction detection false positives]** LLM may incorrectly flag non-contradictory facts as contradictions. -> Mitigation: Only supersede when confidence > 0.8; keep superseded facts queryable (soft invalidation, not deletion).
+- **[Migration complexity]** Adding `name_vector` to existing Entity nodes and `invalid_at`/`supersedes` to existing facts. -> Mitigation: Both are additive fields with null defaults; no destructive migration. Backfill can run asynchronously.
+- **[Whisper API cost for audio/video]** -> Mitigation: Configurable per-workspace; can disable audio transcription for cost-sensitive deployments.
diff --git a/openspec/changes/ingestion-pipeline-hardening/proposal.md b/openspec/changes/ingestion-pipeline-hardening/proposal.md
new file mode 100644
index 00000000..d7aa06e9
--- /dev/null
+++ b/openspec/changes/ingestion-pipeline-hardening/proposal.md
@@ -0,0 +1,35 @@
+## Why
+
+The ingestion pipeline (raw Slack message -> atomic facts in Weaviate + graph entities in Neo4j) has several semantic and structural gaps that prevent it from serving as a reliable enterprise knowledge base. Key issues: (1) entity deduplication is string-similarity only — "Atlas" and "Beever Atlas" are treated as different nodes, (2) no coreference resolution — pronouns like "it" and "they" lose their referents, (3) multimodal coverage is shallow — videos, Office docs, and audio are ignored, (4) vector embeddings are stored but never queried for semantic search, (5) no temporal fact lifecycle — contradictory facts coexist indefinitely, and (6) cross-batch thread context is lost. For an enterprise product these gaps erode trust in search results and graph accuracy.
+
+## What Changes
+
+- **Coreference resolution layer**: Add a pre-extraction pass that resolves pronouns and implicit references ("it", "they", "this", "that project") to their antecedents within the conversation window, producing enriched text for downstream extractors.
+- **Semantic entity deduplication**: Supplement Jaro-Winkler string matching with embedding-based similarity so "Atlas", "Beever Atlas", and "the atlas project" merge into one canonical node.
+- **Multimodal expansion**: Add extractors for video (keyframe + audio transcript), Office documents (docx/xlsx/pptx text extraction), and audio files; feed extracted content into the same fact/entity pipeline.
+- **Semantic vector search**: Activate Weaviate near-vector queries for fact retrieval so the query layer can find semantically similar facts, not just exact field matches.
+- **Temporal fact lifecycle**: Implement fact supersession — when a new fact contradicts an existing one, mark the old fact as invalidated with a pointer to its replacement.
+- **Cross-batch thread context**: Persist parent message summaries so threaded replies that span ingestion batches retain their conversational context.
+- **Smarter orphan handling**: Replace hard-delete of relationship-less entities with a soft "pending" state that survives across a configurable batch window before final pruning.
+
+## Capabilities
+
+### New Capabilities
+- `coreference-resolution`: Pre-extraction pass resolving pronouns and implicit references to named entities within conversation context
+- `semantic-entity-dedup`: Embedding-based entity merging to complement string-similarity deduplication in the cross-batch validator
+- `multimodal-expansion`: Extraction support for video (keyframes + transcript), Office docs (docx/xlsx/pptx), and audio files
+- `semantic-search`: Activate Weaviate near-vector queries for semantic fact retrieval
+- `temporal-fact-lifecycle`: Fact supersession, invalidation, and contradiction detection across ingestion batches
+- `cross-batch-thread-context`: Persistent parent message summaries for threaded replies spanning multiple ingestion batches
+- `soft-orphan-handling`: Grace-period entity retention replacing immediate orphan deletion
+
+### Modified Capabilities
+
+## Impact
+
+- **Agents**: `preprocessor.py` (coreference pass, media expansion, thread context lookup), `entity_extractor.py` (semantic dedup candidates in prompt), `cross_batch_validator.py` (embedding similarity merge, soft orphan logic), `fact_extractor.py` (contradiction detection hints)
+- **Services**: `media_processor.py` (new extractors for video/audio/office), new `coreference_resolver.py` service
+- **Stores**: `weaviate_store.py` (near-vector search, fact invalidation fields), `neo4j_store.py` (soft-delete/pending state on Entity nodes, supersession edges), `entity_registry.py` (embedding-based fuzzy match)
+- **Dependencies**: New libraries for video processing (e.g., `moviepy` or `ffmpeg`), Office extraction (`python-docx`, `openpyxl`, `python-pptx`), speech-to-text API for audio
+- **Prompts**: Updated extraction prompts for coreference-enriched input, contradiction detection instructions, multimodal content handling
+- **API**: New semantic search endpoint or updated query router to use vector similarity
diff --git a/openspec/changes/ingestion-pipeline-hardening/specs/coreference-resolution/spec.md b/openspec/changes/ingestion-pipeline-hardening/specs/coreference-resolution/spec.md
new file mode 100644
index 00000000..aec4d6d7
--- /dev/null
+++ b/openspec/changes/ingestion-pipeline-hardening/specs/coreference-resolution/spec.md
@@ -0,0 +1,34 @@
+## ADDED Requirements
+
+### Requirement: Resolve pronoun references to explicit entity names
+The system SHALL resolve pronouns and implicit references (e.g., "it", "they", "this", "that", "the project", "the tool") to their explicit antecedent entity names before passing messages to the fact and entity extractors.
+
+#### Scenario: Pronoun resolving to a named entity in the same batch
+- **WHEN** a message contains "Alice built Atlas. It uses Redis for caching."
+- **THEN** the coreference resolver SHALL rewrite the text to "Alice built Atlas. Atlas uses Redis for caching." before extraction
+
+#### Scenario: Demonstrative reference resolving across messages
+- **WHEN** message 1 says "We're evaluating PostgreSQL for the new service" and message 2 says "That looks promising, let's go with it"
+- **THEN** the resolver SHALL rewrite message 2 to "PostgreSQL looks promising, let's go with PostgreSQL" (or equivalent explicit form)
+
+#### Scenario: No pronouns or implicit references detected
+- **WHEN** a batch of messages contains no pronouns or implicit entity references
+- **THEN** the resolver SHALL pass messages through unchanged with no LLM call (cost optimization)
+
+### Requirement: Use conversation window for context
+The system SHALL provide the coreference resolver with a sliding window of recent messages: the current batch plus the last N persisted messages from the same channel (configurable, default 20).
+
+#### Scenario: Cross-batch pronoun resolution
+- **WHEN** the previous batch contained "Team decided to adopt Kubernetes" and the current batch contains "We started migrating to it yesterday"
+- **THEN** the resolver SHALL resolve "it" to "Kubernetes" using the persisted channel history as context
+
+#### Scenario: Channel history unavailable
+- **WHEN** channel history cannot be retrieved (first batch or store error)
+- **THEN** the resolver SHALL proceed with only the current batch context and log a warning
+
+### Requirement: Preserve original text
+The system SHALL preserve the original unmodified message text in a `raw_text` field on the preprocessed message, alongside the coreference-resolved text in the `text` field.
+
+#### Scenario: Original text retained after resolution
+- **WHEN** a message "They approved it" is resolved to "The security team approved the Redis migration"
+- **THEN** the preprocessed message SHALL have `raw_text: "They approved it"` and `text: "The security team approved the Redis migration"`
diff --git a/openspec/changes/ingestion-pipeline-hardening/specs/cross-batch-thread-context/spec.md b/openspec/changes/ingestion-pipeline-hardening/specs/cross-batch-thread-context/spec.md
new file mode 100644
index 00000000..561ef7c8
--- /dev/null
+++ b/openspec/changes/ingestion-pipeline-hardening/specs/cross-batch-thread-context/spec.md
@@ -0,0 +1,30 @@
+## ADDED Requirements
+
+### Requirement: Retrieve parent message for cross-batch thread replies
+The system SHALL query persisted data (MongoDB or Weaviate) for the parent message when a thread reply's parent is not present in the current ingestion batch.
+
+#### Scenario: Parent message found in persistence
+- **WHEN** a thread reply references `thread_ts: "1234567890.000100"` and the parent message exists in MongoDB
+- **THEN** the preprocessor SHALL retrieve the parent message text and build `thread_context` as "[Reply to {author}: {text_truncated}]"
+
+#### Scenario: Parent message not found anywhere
+- **WHEN** a thread reply's parent message is not in the current batch, MongoDB, or Weaviate
+- **THEN** the preprocessor SHALL log a warning and proceed without thread context (same as current behavior)
+
+### Requirement: Thread context enriches extraction quality
+The system SHALL pass the resolved thread context to fact and entity extractors so that context-dependent replies produce meaningful facts.
+
+#### Scenario: Context-dependent reply produces valid fact
+- **WHEN** parent message says "Should we migrate from MySQL to PostgreSQL?" and the reply says "Yes, let's do it next sprint"
+- **THEN** the fact extractor SHALL produce a fact like "Team decided to migrate from MySQL to PostgreSQL next sprint" using the thread context
+
+#### Scenario: Self-contained reply works without parent
+- **WHEN** a thread reply says "I deployed the hotfix to production at 3pm"
+- **THEN** the fact extractor SHALL produce a valid fact regardless of whether thread context is available
+
+### Requirement: Configurable thread context lookup
+The system SHALL support configuring whether cross-batch thread context lookup is enabled (default: enabled) and the maximum parent text length (default: 200 chars).
+
+#### Scenario: Thread context disabled
+- **WHEN** cross-batch thread context is disabled in configuration
+- **THEN** the preprocessor SHALL only use in-batch parent messages for thread context (current behavior)
diff --git a/openspec/changes/ingestion-pipeline-hardening/specs/multimodal-expansion/spec.md b/openspec/changes/ingestion-pipeline-hardening/specs/multimodal-expansion/spec.md
new file mode 100644
index 00000000..894e41ae
--- /dev/null
+++ b/openspec/changes/ingestion-pipeline-hardening/specs/multimodal-expansion/spec.md
@@ -0,0 +1,60 @@
+## ADDED Requirements
+
+### Requirement: Video content extraction
+The system SHALL extract content from video files by generating keyframe images at regular intervals and transcribing the audio track.
+
+#### Scenario: Video with speech and visual content
+- **WHEN** a Slack message contains a .mp4 video attachment of a product demo
+- **THEN** the system SHALL extract keyframes (1 per 30 seconds), describe them via vision API, transcribe the audio via speech-to-text, and combine the output as enriched message text for downstream extraction
+
+#### Scenario: Silent video (no audio track)
+- **WHEN** a video file has no audio track
+- **THEN** the system SHALL extract and describe keyframes only, without attempting audio transcription
+
+#### Scenario: Video exceeds size/duration limit
+- **WHEN** a video exceeds the configurable maximum duration (default 10 minutes) or file size (default 100MB)
+- **THEN** the system SHALL process only the first N minutes/bytes and append a "[truncated]" indicator to the output
+
+### Requirement: Office document text extraction
+The system SHALL extract text content from Microsoft Office documents (.docx, .xlsx, .pptx).
+
+#### Scenario: Word document extraction
+- **WHEN** a .docx file is attached to a Slack message
+- **THEN** the system SHALL extract all paragraph text, preserving heading structure, and feed it as enriched message text (up to configurable char limit, default 10000)
+
+#### Scenario: Excel spreadsheet extraction
+- **WHEN** a .xlsx file is attached
+- **THEN** the system SHALL extract sheet names and cell text content, formatted as "Sheet: <name>\n<cell contents>" for each sheet
+
+#### Scenario: PowerPoint extraction
+- **WHEN** a .pptx file is attached
+- **THEN** the system SHALL extract slide text and speaker notes, formatted as "Slide N: <text>\nNotes: <notes>" for each slide
+
+### Requirement: Audio file transcription
+The system SHALL transcribe audio files (.mp3, .wav, .m4a, .ogg) attached to messages using a speech-to-text API.
+
+#### Scenario: Audio message transcription
+- **WHEN** a Slack message contains an audio recording attachment
+- **THEN** the system SHALL transcribe the audio and append the transcript as enriched message text
+
+#### Scenario: Audio exceeds duration limit
+- **WHEN** an audio file exceeds the configurable maximum duration (default 30 minutes)
+- **THEN** the system SHALL transcribe only the first N minutes and append a "[truncated]" indicator
+
+### Requirement: Media extractor registry pattern
+The system SHALL use a registry of media extractors keyed by MIME type, replacing hardcoded if/else branches in MediaProcessor.
+
+#### Scenario: Known MIME type dispatched to correct extractor
+- **WHEN** a file with MIME type "application/vnd.openxmlformats-officedocument.wordprocessingml.document" is encountered
+- **THEN** the system SHALL dispatch it to the OfficeExtractor (docx handler)
+
+#### Scenario: Unknown MIME type fallback
+- **WHEN** a file with an unregistered MIME type is encountered
+- **THEN** the system SHALL fall back to metadata-only extraction (filename, size, type) as today
+
+### Requirement: Asynchronous media processing
+The system SHALL process video and audio media asynchronously to avoid blocking the main ingestion pipeline.
+
+#### Scenario: Long video does not block batch
+- **WHEN** a batch contains a 5-minute video alongside 50 text messages
+- **THEN** the text messages SHALL proceed through the pipeline without waiting for video processing to complete; video-derived facts SHALL be persisted when processing finishes
diff --git a/openspec/changes/ingestion-pipeline-hardening/specs/semantic-entity-dedup/spec.md b/openspec/changes/ingestion-pipeline-hardening/specs/semantic-entity-dedup/spec.md
new file mode 100644
index 00000000..f60f595a
--- /dev/null
+++ b/openspec/changes/ingestion-pipeline-hardening/specs/semantic-entity-dedup/spec.md
@@ -0,0 +1,41 @@
+## ADDED Requirements
+
+### Requirement: Embedding-based entity similarity matching
+The system SHALL compute embedding vectors for entity names and compare them against known entity name embeddings using cosine similarity to identify semantically equivalent entities that string similarity misses.
+
+#### Scenario: Semantic equivalence detected
+- **WHEN** the extracted entity name is "Beever Atlas" and the known entity "Atlas" exists with a cosine similarity of 0.92
+- **THEN** the system SHALL flag "Beever Atlas" and "Atlas" as merge candidates
+
+#### Scenario: Similar but distinct entities not merged
+- **WHEN** "Redis" and "Redshift" have a cosine similarity of 0.78 (below the 0.85 threshold)
+- **THEN** the system SHALL NOT flag them as merge candidates
+
+### Requirement: LLM confirmation before merge
+The system SHALL require LLM confirmation before merging embedding-similarity candidates to prevent false merges.
+
+#### Scenario: LLM confirms merge
+- **WHEN** embedding similarity flags "Atlas" and "Beever Atlas" as candidates and the LLM confirms they refer to the same entity
+- **THEN** the cross-batch validator SHALL merge them under the most complete canonical name ("Beever Atlas") with "Atlas" as an alias
+
+#### Scenario: LLM rejects merge
+- **WHEN** embedding similarity flags "Atlas" and "Atlas Corp" as candidates but the LLM determines they are distinct entities (product vs. company)
+- **THEN** the system SHALL keep them as separate entities and record the rejection to avoid re-evaluating in future batches
+
+### Requirement: Cache entity name embeddings
+The system SHALL cache entity name embeddings on Neo4j Entity nodes in a `name_vector` property to avoid recomputing embeddings for known entities.
+
+#### Scenario: New entity gets embedding computed and cached
+- **WHEN** a new entity "Kubernetes" is persisted to Neo4j
+- **THEN** the system SHALL compute and store its name embedding in the `name_vector` property
+
+#### Scenario: Known entity uses cached embedding
+- **WHEN** computing similarity for a known entity that already has a `name_vector`
+- **THEN** the system SHALL use the cached embedding without calling the embedding API
+
+### Requirement: Configurable similarity threshold
+The system SHALL use a configurable cosine similarity threshold (default 0.85) for entity merge candidate detection.
+
+#### Scenario: Threshold adjustment
+- **WHEN** the threshold is set to 0.90
+- **THEN** only entity pairs with cosine similarity >= 0.90 SHALL be flagged as candidates
diff --git a/openspec/changes/ingestion-pipeline-hardening/specs/semantic-search/spec.md b/openspec/changes/ingestion-pipeline-hardening/specs/semantic-search/spec.md
new file mode 100644
index 00000000..1e3b0329
--- /dev/null
+++ b/openspec/changes/ingestion-pipeline-hardening/specs/semantic-search/spec.md
@@ -0,0 +1,30 @@
+## ADDED Requirements
+
+### Requirement: Near-vector semantic search on facts
+The system SHALL support querying Weaviate facts by vector similarity using the stored `text_vector` embeddings.
+
+#### Scenario: Semantic query returns relevant facts
+- **WHEN** a query "what database did we choose" is embedded and searched with near_vector
+- **THEN** the system SHALL return facts semantically related to database decisions, ranked by vector similarity score
+
+#### Scenario: Semantic search with metadata filters
+- **WHEN** a near-vector query is combined with filters (channel_id, importance >= "high", date range)
+- **THEN** the system SHALL apply both vector similarity ranking and metadata filtering, returning only facts matching all criteria
+
+#### Scenario: Empty results
+- **WHEN** a semantic query has no facts above the minimum similarity threshold (configurable, default 0.7)
+- **THEN** the system SHALL return an empty result set rather than low-relevance matches
+
+### Requirement: Hybrid retrieval combining vector and field-based search
+The system SHALL support a hybrid retrieval mode that combines semantic vector results with exact field-filter results and deduplicates them.
+
+#### Scenario: Hybrid search merges results
+- **WHEN** a query matches 3 facts via vector similarity and 2 facts via exact entity_tag filter, with 1 overlapping fact
+- **THEN** the system SHALL return 4 unique facts, with the overlapping fact ranked highest
+
+### Requirement: Search result includes similarity score
+The system SHALL include a similarity score (0.0-1.0) with each result from semantic search.
+
+#### Scenario: Scores returned with results
+- **WHEN** a semantic search returns 5 facts
+- **THEN** each fact SHALL include a `similarity_score` field indicating its cosine similarity to the query vector
diff --git a/openspec/changes/ingestion-pipeline-hardening/specs/soft-orphan-handling/spec.md b/openspec/changes/ingestion-pipeline-hardening/specs/soft-orphan-handling/spec.md
new file mode 100644
index 00000000..f33ac53b
--- /dev/null
+++ b/openspec/changes/ingestion-pipeline-hardening/specs/soft-orphan-handling/spec.md
@@ -0,0 +1,41 @@
+## ADDED Requirements
+
+### Requirement: Tag relationship-less entities as pending instead of deleting
+The system SHALL assign a `status: "pending"` state with a `pending_since` timestamp to extracted entities that have no relationships in the current batch, instead of deleting them.
+
+#### Scenario: Entity with no relationships tagged as pending
+- **WHEN** the entity "Project X" is extracted but has no relationships in the current batch or to known entities
+- **THEN** the system SHALL persist it to Neo4j with `status: "pending"` and `pending_since: <current_timestamp>`
+
+#### Scenario: Entity with relationships persisted as active
+- **WHEN** the entity "PostgreSQL" is extracted with a USES relationship from "Alice"
+- **THEN** the system SHALL persist it with `status: "active"` (current behavior)
+
+### Requirement: Promote pending entities when relationships appear
+The system SHALL promote a pending entity to `status: "active"` when a subsequent ingestion batch creates a relationship involving that entity.
+
+#### Scenario: Pending entity gains a relationship
+- **WHEN** "Project X" was persisted as pending in batch N, and batch N+2 extracts a WORKS_ON relationship from "Bob" to "Project X"
+- **THEN** the system SHALL update "Project X" to `status: "active"` and clear `pending_since`
+
+### Requirement: Prune expired pending entities
+The system SHALL delete pending entities that have not gained any relationships within a configurable grace window (default: 5 batches or 7 days, whichever comes first).
+
+#### Scenario: Pending entity expires
+- **WHEN** "Random Tool" has been pending for 5 batches and 8 days with no relationships created
+- **THEN** the background reconciler SHALL delete it from Neo4j
+
+#### Scenario: Pending entity within grace window retained
+- **WHEN** "New Initiative" has been pending for 2 batches and 1 day
+- **THEN** the system SHALL retain it in Neo4j with its pending status
+
+### Requirement: Pending entities excluded from default graph queries
+The system SHALL exclude pending entities from default graph queries but allow explicit inclusion.
+
+#### Scenario: Default query excludes pending
+- **WHEN** a graph query requests entities for a channel without specifying include_pending
+- **THEN** the system SHALL return only active entities
+
+#### Scenario: Explicit inclusion of pending entities
+- **WHEN** a graph query specifies include_pending=true
+- **THEN** the system SHALL return both active and pending entities, with pending entities marked accordingly
diff --git a/openspec/changes/ingestion-pipeline-hardening/specs/temporal-fact-lifecycle/spec.md b/openspec/changes/ingestion-pipeline-hardening/specs/temporal-fact-lifecycle/spec.md
new file mode 100644
index 00000000..f9538909
--- /dev/null
+++ b/openspec/changes/ingestion-pipeline-hardening/specs/temporal-fact-lifecycle/spec.md
@@ -0,0 +1,34 @@
+## ADDED Requirements
+
+### Requirement: Detect contradictory facts during ingestion
+The system SHALL check newly extracted facts against existing facts with overlapping entity and topic tags to detect contradictions.
+
+#### Scenario: Direct contradiction detected
+- **WHEN** existing fact says "Team uses Redis for caching" and a new fact says "Team deprecated Redis and switched to Memcached"
+- **THEN** the system SHALL identify this as a contradiction with confidence score
+
+#### Scenario: Non-contradictory update not flagged
+- **WHEN** existing fact says "Auth service uses JWT tokens" and new fact says "Auth service added refresh token support"
+- **THEN** the system SHALL NOT flag this as a contradiction (additive, not contradictory)
+
+### Requirement: Supersede outdated facts
+The system SHALL mark contradicted facts as superseded by linking the new fact to the old fact via a `supersedes` pointer, and setting `invalid_at` on the old fact.
+
+#### Scenario: Fact supersession chain
+- **WHEN** fact B supersedes fact A, and later fact C supersedes fact B
+- **THEN** fact A SHALL have `invalid_at` set and `superseded_by: B`, fact B SHALL have `invalid_at` set and `superseded_by: C`, fact C SHALL be the current valid fact
+
+#### Scenario: Low-confidence contradiction not auto-superseded
+- **WHEN** contradiction detection confidence is below 0.8
+- **THEN** the system SHALL NOT automatically supersede the old fact; both facts SHALL coexist with a `potential_contradiction` flag
+
+### Requirement: Superseded facts remain queryable
+The system SHALL retain superseded facts in Weaviate (soft invalidation) so they can be queried for historical context.
+
+#### Scenario: Historical query includes superseded facts
+- **WHEN** a query explicitly requests historical facts (include_superseded=true)
+- **THEN** the system SHALL return both current and superseded facts, with superseded facts marked accordingly
+
+#### Scenario: Default query excludes superseded facts
+- **WHEN** a standard query does not specify include_superseded
+- **THEN** the system SHALL exclude facts where `invalid_at` is set, returning only current facts
diff --git a/openspec/changes/ingestion-pipeline-hardening/tasks.md b/openspec/changes/ingestion-pipeline-hardening/tasks.md
new file mode 100644
index 00000000..bffa8a29
--- /dev/null
+++ b/openspec/changes/ingestion-pipeline-hardening/tasks.md
@@ -0,0 +1,77 @@
+## 1. Coreference Resolution
+
+- [x] 1.1 Create `CoreferenceResolver` service in `src/beever_atlas/services/coreference_resolver.py` with LLM-based pronoun resolution using conversation window context
+- [x] 1.2 Add channel history retrieval (last 20 messages) from MongoDB/Weaviate for cross-batch context window
+- [x] 1.3 Add regex-based pronoun pre-filter to skip LLM call when no pronouns/implicit references detected in batch
+- [x] 1.4 Integrate `CoreferenceResolver` into preprocessor pipeline — call after text cleaning, before thread context assembly
+- [x] 1.5 Preserve original text in `raw_text` field on preprocessed messages alongside resolved `text`
+- [x] 1.6 Write coreference resolution prompt (Gemini Flash) with examples for pronoun, demonstrative, and implicit reference resolution
+- [x] 1.7 Add unit tests for CoreferenceResolver: pronoun resolution, cross-message references, no-pronoun skip, missing history fallback
+
+## 2. Semantic Entity Deduplication
+
+- [x] 2.1 Add `name_vector` property to Neo4j Entity node schema and create backfill script for existing entities
+- [x] 2.2 Extend `EntityRegistry` with `compute_name_embedding(name)` using Jina API and `find_similar(name_vector, threshold)` using cosine similarity
+- [ ] 2.3 Update `CrossBatchValidator` to run embedding similarity check before Jaro-Winkler matching, producing merge candidates
+- [ ] 2.4 Update cross-batch validator prompt to include embedding-similarity candidates with LLM confirmation/rejection step
+- [x] 2.5 Add merge rejection cache (Neo4j or MongoDB) to avoid re-evaluating previously rejected pairs
+- [x] 2.6 Add configurable similarity threshold setting (default 0.85) to pipeline configuration
+- [x] 2.7 Write tests for semantic dedup: merge confirmation, merge rejection, cached rejection skip, threshold tuning
+
+## 3. Multimodal Expansion
+
+- [x] 3.1 Refactor `MediaProcessor` into registry pattern — create `MediaExtractorRegistry` with `register(mime_type, extractor)` and `extract(file_bytes, metadata)` dispatch
+- [x] 3.2 Migrate existing image extractor (Gemini vision) and PDF extractor (pypdf) into registry as `ImageExtractor` and `PdfExtractor`
+- [x] 3.3 Create `OfficeExtractor` for .docx (python-docx), .xlsx (openpyxl), .pptx (python-pptx) with text extraction and char limit
+- [x] 3.4 Create `VideoExtractor` using ffmpeg for keyframe extraction (1 per 30s) and Whisper API for audio transcription
+- [x] 3.5 Create `AudioExtractor` using Whisper API for standalone audio files (.mp3, .wav, .m4a, .ogg)
+- [ ] 3.6 Add async media processing queue so video/audio extraction does not block the main pipeline batch
+- [x] 3.7 Add configurable size/duration limits for video (default 10min/100MB) and audio (default 30min)
+- [x] 3.8 Add dependencies: `python-docx`, `openpyxl`, `python-pptx`, `moviepy` or `ffmpeg-python` to project
+- [x] 3.9 Write tests for each extractor: docx, xlsx, pptx, video (mock ffmpeg/whisper), audio, unknown MIME fallback
+
+## 4. Semantic Search Activation
+
+- [x] 4.1 Add `semantic_search(query_vector, filters, limit, threshold)` method to `WeaviateStore` using Weaviate `near_vector` query
+- [x] 4.2 Add `hybrid_search(query_vector, filters, limit)` method that merges vector results with field-filter results and deduplicates
+- [x] 4.3 Include `similarity_score` field in search results from semantic queries
+- [x] 4.4 Add configurable minimum similarity threshold (default 0.7) to filter low-relevance results
+- [ ] 4.5 Update API query endpoints to support `search_mode: "semantic" | "exact" | "hybrid"` parameter
+- [x] 4.6 Write tests for semantic search: vector query, filtered vector query, hybrid merge, empty results below threshold
+
+## 5. Temporal Fact Lifecycle
+
+- [x] 5.1 Add `superseded_by`, `supersedes`, and `potential_contradiction` fields to AtomicFact schema and Weaviate collection
+- [x] 5.2 Create contradiction detection step in pipeline — after classification, query Weaviate for existing facts with overlapping entity/topic tags
+- [x] 5.3 Write contradiction detection prompt that compares new fact against candidate existing facts and returns contradiction confidence
+- [x] 5.4 Implement fact supersession logic: set `invalid_at` on old fact, `supersedes` on new fact when contradiction confidence >= 0.8
+- [x] 5.5 Add `potential_contradiction` flag for low-confidence contradictions (0.5-0.8) without auto-supersession
+- [x] 5.6 Update Weaviate query methods to exclude `invalid_at`-set facts by default, with `include_superseded` option
+- [x] 5.7 Write tests for contradiction detection: direct contradiction, additive non-contradiction, low-confidence flag, supersession chain
+
+## 6. Cross-Batch Thread Context
+
+- [x] 6.1 Add parent message lookup in preprocessor — query MongoDB by `thread_ts` when parent not in current batch
+- [x] 6.2 Add Weaviate fallback lookup for parent message if MongoDB lookup fails
+- [x] 6.3 Build `thread_context` string from retrieved parent message (author + truncated text, configurable max 200 chars)
+- [x] 6.4 Add configuration toggle for cross-batch thread context (default: enabled) and max parent text length
+- [x] 6.5 Write tests for cross-batch thread context: parent found in MongoDB, parent found in Weaviate, parent not found, disabled config
+
+## 7. Soft Orphan Handling
+
+- [x] 7.1 Add `status` ("active" | "pending") and `pending_since` properties to Neo4j Entity node schema
+- [ ] 7.2 Update `CrossBatchValidator` orphan removal to set `status: "pending"` instead of deleting
+- [x] 7.3 Update `PersisterAgent` to promote pending entities to active when new relationships are created
+- [x] 7.4 Create background reconciler task that prunes expired pending entities (configurable: default 5 batches or 7 days)
+- [x] 7.5 Update Neo4j graph queries to exclude pending entities by default, with `include_pending` option
+- [x] 7.6 Write tests for soft orphan handling: pending creation, promotion on relationship, expiry pruning, query filtering
+
+## 8. Integration & Verification
+
+- [x] 8.1 Run full pipeline end-to-end test with a batch containing: text messages with pronouns, thread replies, images, a .docx attachment, and duplicate entity names
+- [x] 8.2 Verify coreference-resolved text produces correct facts (pronouns replaced before extraction)
+- [x] 8.3 Verify semantic dedup merges "Atlas" / "Beever Atlas" into one canonical entity
+- [x] 8.4 Verify cross-batch thread context resolves parent messages from prior batches
+- [x] 8.5 Verify semantic search returns relevant results for natural language queries
+- [x] 8.6 Verify fact supersession marks outdated facts as invalid when contradictions are ingested
+- [x] 8.7 Verify pending orphan entities survive across batches and get promoted when relationships appear
diff --git a/openspec/changes/m1-skeleton-health-pulse/.openspec.yaml b/openspec/changes/m1-skeleton-health-pulse/.openspec.yaml
new file mode 100644
index 00000000..a61e7c11
--- /dev/null
+++ b/openspec/changes/m1-skeleton-health-pulse/.openspec.yaml
@@ -0,0 +1,2 @@
+schema: spec-driven
+created: 2026-03-27
diff --git a/openspec/changes/m1-skeleton-health-pulse/design.md b/openspec/changes/m1-skeleton-health-pulse/design.md
new file mode 100644
index 00000000..978d0a14
--- /dev/null
+++ b/openspec/changes/m1-skeleton-health-pulse/design.md
@@ -0,0 +1,62 @@
+## Context
+
+Beever Atlas v2 is a greenfield project with only `docs/v2/` (13 spec documents) currently in the repo. M1 ("Skeleton & Health Pulse") initializes the full-stack project: Python backend, React frontend, TypeScript bot service, Docker Compose infrastructure, and a health endpoint proving connectivity. All subsequent milestones build on this skeleton.
+
+The Linear milestone defines 8 active tasks (RES-90 already done) with clear spec references to `docs/v2/`. The tasks have a natural dependency order: package structure → config → ADK scaffolding → FastAPI shell → React shell → bot placeholder → memories tab.
+
+## Goals / Non-Goals
+
+**Goals:**
+- Establish `src/beever_atlas/` Python package with all module directories matching `docs/v2/07-deployment.md`
+- Config system loading all env vars for 7 dependencies + API keys + LiteLLM model routing
+- ADK agent foundation: FunctionTool stubs, LiteLLM config, Runner + session service pattern
+- FastAPI app with `GET /api/health` checking Weaviate, Neo4j, MongoDB, Redis connectivity
+- React 19 + Vite + TailwindCSS + shadcn/ui with layout shell, route stubs, dashboard, HealthBadge
+- 3-tier memory browser UI with mock data
+- TypeScript bot service placeholder with Redis connection
+- Docker Compose orchestrating all 7 services
+- Tests for each component
+
+**Non-Goals:**
+- No actual Slack/Teams/Discord adapters (M2)
+- No real ingestion pipeline (M3)
+- No graph memory implementation (M4)
+- No wiki generation (M5)
+- No circuit breaker degradation logic (M7) — only health check portion of DependencyHealth
+- No authentication/ACL (M7)
+- No real data in memory browser — mock/placeholder data only
+
+## Decisions
+
+### 1. Python project tooling: uv + pyproject.toml
+Use `uv` as the Python package manager with `pyproject.toml` for dependency management. Modern, fast, and replaces pip/poetry.
+- **Alternative**: Poetry — heavier, slower resolution, less momentum
+- **Alternative**: pip + requirements.txt — no lock file, less reproducible
+
+### 2. Package layout: src/ layout
+Use `src/beever_atlas/` (src layout) per Python packaging best practices and `docs/v2/07-deployment.md`.
+- **Alternative**: Flat layout (`beever_atlas/` at root) — can cause import confusion with editable installs
+
+### 3. ADK tools as stubs in M1
+Tool functions in `agents/tools.py` will be defined with correct signatures but raise `NotImplementedError` until stores are implemented in M3/M4. This lets agent scaffolding compile and test without real backends.
+- **Alternative**: Skip tools.py until M3 — delays validation of ADK integration patterns
+
+### 4. Health endpoint checks real connections
+`GET /api/health` will attempt actual connections to all 4 data stores (Weaviate, Neo4j, MongoDB, Redis) and report per-component status. In Docker Compose, services have health checks so the backend waits for them.
+- **Alternative**: Stub health always returning "ok" — defeats the purpose of M1's connectivity proof
+
+### 5. React scaffold with shadcn/ui
+Use shadcn/ui (not a component library import — copies components into project) for UI primitives. This gives full control over styling and matches `docs/v2/11-frontend-design.md`.
+- **Alternative**: Radix UI directly — more boilerplate, less opinionated defaults
+- **Alternative**: Material UI — heavy, opinionated styling doesn't match design spec
+
+### 6. Memories tab uses mock data
+The 3-tier memory browser (RES-112) renders with hardcoded mock data in M1. Real API integration happens when Weaviate stores exist (M3).
+- **Alternative**: Skip memories tab until M3 — delays frontend validation of the 3-tier UX concept
+
+## Risks / Trade-offs
+
+- **[Docker Compose complexity]** 7 services is a heavy local stack → Mitigation: document minimum RAM (8GB), provide `docker compose up --profile minimal` for backend-only development
+- **[ADK version churn]** Google ADK is relatively new → Mitigation: pin version in pyproject.toml, wrap integration points for easy updating
+- **[Mock data divergence]** Mock data in memories browser may not match real schemas → Mitigation: define TypeScript types from spec first (`lib/types.ts`), mock data conforms to types
+- **[shadcn/ui React 19 compat]** shadcn/ui ecosystem may have React 19 edge cases → Mitigation: use latest shadcn/ui which targets React 19
diff --git a/openspec/changes/m1-skeleton-health-pulse/proposal.md b/openspec/changes/m1-skeleton-health-pulse/proposal.md
new file mode 100644
index 00000000..db8a1d80
--- /dev/null
+++ b/openspec/changes/m1-skeleton-health-pulse/proposal.md
@@ -0,0 +1,36 @@
+## Why
+
+Beever Atlas v2 needs its foundational project skeleton before any feature work can begin. The v2 repo currently contains only documentation (`docs/v2/`) — no source code, no infrastructure, no services. M1 establishes the Docker Compose stack (7 services), Python backend shell, React frontend shell, TypeScript bot placeholder, and a health endpoint proving everything is connected. This is the "walking skeleton" that all subsequent milestones (M2-M8) build on.
+
+## What Changes
+
+- Create `src/beever_atlas/` Python package with module directories for agents, adapters, pipeline, stores, retrieval, wiki, server, and infra
+- Add config system loading env vars for all 7 dependencies (Weaviate, Neo4j, MongoDB, Redis) and API keys (Gemini, Jina, Tavily, Anthropic) with LiteLLM model routing
+- Scaffold ADK agent foundation: `tools.py` with FunctionTool stubs, LiteLLM config, Runner + InMemorySessionService integration
+- Docker Compose with Weaviate, Neo4j, MongoDB, Redis, FastAPI backend, React frontend, bot service
+- FastAPI app shell with `GET /api/health` checking all 4 data stores, CORS for React dev server
+- React 19 + Vite + TailwindCSS + shadcn/ui frontend with layout shell, route stubs, HealthBadge, dashboard home
+- React Memories tab: 3-tier memory browser with TierBrowser, SummaryCard, ClusterCard, FactCard components (mock data for M1)
+- TypeScript bot service placeholder connecting to Redis
+- `.env.example` with all required env vars documented
+
+## Capabilities
+
+### New Capabilities
+- `project-scaffold`: Python package structure, Docker Compose, config system, env var management
+- `adk-foundation`: ADK agent scaffolding — Runner, session service, LiteLLM config, FunctionTool stubs
+- `health-endpoint`: FastAPI shell with GET /api/health, DependencyHealth registry, CORS
+- `frontend-shell`: React 19 + Vite + TailwindCSS + shadcn/ui layout, routing, HealthBadge, dashboard
+- `memories-browser`: 3-tier memory browser UI (Tier 0 summary, Tier 1 clusters, Tier 2 facts)
+- `bot-placeholder`: TypeScript bot service with Redis connection, Docker integration
+
+### Modified Capabilities
+<!-- None — greenfield project -->
+
+## Impact
+
+- **New files**: ~40-60 files across `src/`, `web/`, `bot/`, root configs
+- **Dependencies**: Python (FastAPI, google-adk, litellm, weaviate-client, neo4j, pymongo, redis), Node.js (React 19, Vite, TailwindCSS, shadcn/ui, React Router v7)
+- **Infrastructure**: Docker Compose defining 7 services with networking
+- **APIs**: `GET /api/health` — first REST endpoint
+- **No breaking changes** — greenfield project initialization
diff --git a/openspec/changes/m1-skeleton-health-pulse/specs/adk-foundation/spec.md b/openspec/changes/m1-skeleton-health-pulse/specs/adk-foundation/spec.md
new file mode 100644
index 00000000..fc0d0a27
--- /dev/null
+++ b/openspec/changes/m1-skeleton-health-pulse/specs/adk-foundation/spec.md
@@ -0,0 +1,38 @@
+## ADDED Requirements
+
+### Requirement: ADK FunctionTool stubs
+The system SHALL define `agents/tools.py` containing ADK `FunctionTool` wrappers for all 11 store operations: `search_weaviate_hybrid`, `get_tier0_summary`, `get_tier1_clusters`, `traverse_neo4j`, `temporal_chain`, `comprehensive_traverse`, `get_episodic_weaviate_ids`, `search_tavily`, `upsert_fact`, `upsert_entity`, `create_episodic_link`. Each function SHALL have correct type-annotated signatures and docstrings. In M1, each SHALL raise `NotImplementedError`.
+
+#### Scenario: Tool functions are importable
+- **WHEN** running `from beever_atlas.agents.tools import search_weaviate_hybrid`
+- **THEN** the import succeeds and the function has a docstring
+
+#### Scenario: Stub tools raise NotImplementedError
+- **WHEN** calling `search_weaviate_hybrid(query="test", channel_id="ch1")`
+- **THEN** a `NotImplementedError` is raised with a message indicating the store is not yet implemented
+
+#### Scenario: All 11 tools defined
+- **WHEN** inspecting `agents/tools.py`
+- **THEN** exactly 11 FunctionTool-compatible functions are defined
+
+### Requirement: ADK Runner integration
+The system SHALL provide `agents/runner.py` with a function to create an ADK `Runner` with `InMemorySessionService`. The runner SHALL be usable from FastAPI request handlers to execute agent calls.
+
+#### Scenario: Runner creation
+- **WHEN** calling `create_runner(agent)` with an ADK agent
+- **THEN** a Runner instance is returned with InMemorySessionService configured
+
+#### Scenario: Session creation per request
+- **WHEN** a FastAPI request handler needs to run an agent
+- **THEN** a new session is created via the session service with a unique ID
+
+### Requirement: LiteLLM integration module
+The system SHALL provide `infra/litellm_config.py` that configures LiteLLM model routing for ADK agents. It SHALL define model strings compatible with ADK's `LlmAgent(model=...)` parameter.
+
+#### Scenario: Model string for fast tier
+- **WHEN** requesting `get_model("fast")`
+- **THEN** returns a LiteLLM-compatible model string for the fast tier
+
+#### Scenario: Model string for quality tier
+- **WHEN** requesting `get_model("quality")`
+- **THEN** returns a LiteLLM-compatible model string for the quality tier
diff --git a/openspec/changes/m1-skeleton-health-pulse/specs/bot-placeholder/spec.md b/openspec/changes/m1-skeleton-health-pulse/specs/bot-placeholder/spec.md
new file mode 100644
index 00000000..cb24d603
--- /dev/null
+++ b/openspec/changes/m1-skeleton-health-pulse/specs/bot-placeholder/spec.md
@@ -0,0 +1,30 @@
+## ADDED Requirements
+
+### Requirement: TypeScript bot service
+The system SHALL have a `bot/` directory containing a Node.js TypeScript project with a service that connects to Redis on startup and logs "ready".
+
+#### Scenario: Service starts
+- **WHEN** running `npm start` in `bot/`
+- **THEN** the service connects to Redis and logs "Bot service ready" to stdout
+
+#### Scenario: Redis connection failure
+- **WHEN** Redis is unreachable on startup
+- **THEN** the service logs an error and exits with code 1
+
+### Requirement: Docker integration
+The system SHALL include a `bot/Dockerfile` that builds the TypeScript project and runs the service. The Dockerfile SHALL be referenced in `docker-compose.yml`.
+
+#### Scenario: Docker build succeeds
+- **WHEN** running `docker build` on `bot/Dockerfile`
+- **THEN** the image builds successfully
+
+#### Scenario: Service starts in Docker Compose
+- **WHEN** running `docker compose up bot`
+- **THEN** the bot service starts, connects to the Redis service, and logs "ready"
+
+### Requirement: No platform adapters in M1
+The bot service SHALL NOT include any Slack, Teams, or Discord adapter code. It SHALL only prove that the TypeScript service starts and connects to Redis.
+
+#### Scenario: No adapter imports
+- **WHEN** inspecting the bot source code
+- **THEN** there are no imports from slack-sdk, @microsoft/teams-sdk, or discord.js
diff --git a/openspec/changes/m1-skeleton-health-pulse/specs/frontend-shell/spec.md b/openspec/changes/m1-skeleton-health-pulse/specs/frontend-shell/spec.md
new file mode 100644
index 00000000..cba8412e
--- /dev/null
+++ b/openspec/changes/m1-skeleton-health-pulse/specs/frontend-shell/spec.md
@@ -0,0 +1,77 @@
+## ADDED Requirements
+
+### Requirement: React project initialization
+The system SHALL have a `web/` directory containing a Vite + React 19 + TypeScript project with TailwindCSS and shadcn/ui configured.
+
+#### Scenario: Dev server starts
+- **WHEN** running `npm run dev` in `web/`
+- **THEN** a development server starts on port 5173 serving the React app
+
+#### Scenario: Production build succeeds
+- **WHEN** running `npm run build` in `web/`
+- **THEN** the build completes without errors producing static files in `web/dist/`
+
+### Requirement: Route structure
+The system SHALL configure React Router v7 with route stubs for: `/` (Dashboard), `/channels` (Channel list), `/channels/:id` (Channel workspace), `/search` (Search), `/graph` (Graph Explorer), `/settings` (Settings).
+
+#### Scenario: Navigation to all routes
+- **WHEN** navigating to any defined route
+- **THEN** the corresponding page component renders without errors
+
+#### Scenario: Unknown route shows 404
+- **WHEN** navigating to an undefined route
+- **THEN** a "Not Found" page is displayed
+
+### Requirement: Layout shell
+The system SHALL render a root layout with `Sidebar.tsx` (nav links with icons, collapse toggle, 240px expanded / 64px collapsed) and `Header.tsx` (page title, breadcrumb placeholder).
+
+#### Scenario: Sidebar navigation
+- **WHEN** the app loads
+- **THEN** the sidebar shows navigation links for Dashboard, Channels, Search, Graph Explorer, Settings
+
+#### Scenario: Sidebar collapse
+- **WHEN** clicking the collapse toggle
+- **THEN** the sidebar collapses from 240px to 64px, showing only icons
+
+### Requirement: HealthBadge component
+The system SHALL include a `HealthBadge.tsx` component that polls `GET /api/health` every 30 seconds and displays a status indicator: green (healthy), amber (degraded), red (unhealthy), gray (loading/unreachable).
+
+#### Scenario: Healthy status display
+- **WHEN** the health endpoint returns status "healthy"
+- **THEN** the badge shows a green indicator with "All systems operational"
+
+#### Scenario: Degraded status display
+- **WHEN** the health endpoint returns status "degraded"
+- **THEN** the badge shows an amber indicator with the names of degraded components
+
+#### Scenario: API unreachable
+- **WHEN** the health endpoint is unreachable
+- **THEN** the badge shows a gray indicator with "Unable to connect"
+
+### Requirement: API client
+The system SHALL provide `lib/api.ts` with a fetch wrapper using `VITE_API_URL` as the base URL, JSON content type defaults, and error handling that throws typed errors.
+
+#### Scenario: Successful API call
+- **WHEN** calling `api.get("/api/health")` with the backend running
+- **THEN** the response JSON is returned as a typed object
+
+### Requirement: TypeScript type definitions
+The system SHALL provide `lib/types.ts` with TypeScript interfaces mirroring backend schemas: `HealthResponse`, `ComponentHealth`, `AskResponse`, `Citation`, `WikiResponse`, `SyncResponse`, `ChannelInfo`, `MemoryTier0`, `MemoryTier1`, `MemoryTier2`.
+
+#### Scenario: Types match backend schemas
+- **WHEN** importing types from `lib/types.ts`
+- **THEN** all interfaces are available and match the field names defined in `docs/v2/12-api-design.md`
+
+### Requirement: Dashboard home page
+The system SHALL render a dashboard at `/` with stat card placeholders (channels synced, total memories, last sync time, system health) and the HealthBadge component.
+
+#### Scenario: Dashboard renders
+- **WHEN** navigating to `/`
+- **THEN** the dashboard shows placeholder stat cards and the HealthBadge
+
+### Requirement: Design tokens
+The system SHALL configure TailwindCSS with design tokens: Inter font family, slate/indigo color palette, 4px base spacing unit, and card component styles (rounded-lg, shadow-sm, border).
+
+#### Scenario: Design tokens applied
+- **WHEN** rendering any component
+- **THEN** text uses Inter font, primary colors use indigo palette, and cards have consistent rounded/shadow styling
diff --git a/openspec/changes/m1-skeleton-health-pulse/specs/health-endpoint/spec.md b/openspec/changes/m1-skeleton-health-pulse/specs/health-endpoint/spec.md
new file mode 100644
index 00000000..c252c7ed
--- /dev/null
+++ b/openspec/changes/m1-skeleton-health-pulse/specs/health-endpoint/spec.md
@@ -0,0 +1,45 @@
+## ADDED Requirements
+
+### Requirement: FastAPI application entry point
+The system SHALL provide a FastAPI application in `src/beever_atlas/server/app.py` with CORS middleware configured to allow the React dev server (localhost:5173) and production origins.
+
+#### Scenario: App starts successfully
+- **WHEN** running `uvicorn beever_atlas.server.app:app`
+- **THEN** the FastAPI server starts and accepts HTTP requests
+
+#### Scenario: CORS allows React dev server
+- **WHEN** a request arrives from `http://localhost:5173` with an `Origin` header
+- **THEN** the response includes appropriate CORS headers allowing the request
+
+### Requirement: Health endpoint
+The system SHALL expose `GET /api/health` that checks connectivity to Weaviate, Neo4j, MongoDB, and Redis. It SHALL return a `HealthResponse` JSON with overall status and per-component details including status and latency in milliseconds.
+
+#### Scenario: All services healthy
+- **WHEN** all 4 data stores are reachable
+- **THEN** response status is 200, overall status is "healthy", and each component shows status "up" with latency < timeout
+
+#### Scenario: One service down
+- **WHEN** Neo4j is unreachable but others are up
+- **THEN** response status is 200, overall status is "degraded", Neo4j component shows status "down" with error message, others show "up"
+
+#### Scenario: All services down
+- **WHEN** no data stores are reachable
+- **THEN** response status is 200, overall status is "unhealthy", all components show status "down"
+
+### Requirement: DependencyHealth registry
+The system SHALL provide an `infra/health.py` module with a `DependencyHealth` class that maintains a registry of health check functions for each dependency. Each check SHALL have a configurable timeout (default 5 seconds).
+
+#### Scenario: Register a health check
+- **WHEN** calling `registry.register("weaviate", check_fn, timeout=5.0)`
+- **THEN** the check function is stored and callable via `registry.check_all()`
+
+#### Scenario: Health check timeout
+- **WHEN** a health check function takes longer than its timeout
+- **THEN** the component reports status "down" with error "timeout"
+
+### Requirement: HealthResponse schema
+The system SHALL define Pydantic models for health responses: `HealthResponse` (status: str, components: dict, timestamp: str) and `ComponentHealth` (status: str, latency_ms: float, error: Optional[str]).
+
+#### Scenario: Response serialization
+- **WHEN** the health endpoint returns
+- **THEN** the JSON matches the schema with status being one of "healthy", "degraded", "unhealthy"
diff --git a/openspec/changes/m1-skeleton-health-pulse/specs/memories-browser/spec.md b/openspec/changes/m1-skeleton-health-pulse/specs/memories-browser/spec.md
new file mode 100644
index 00000000..51ab3138
--- /dev/null
+++ b/openspec/changes/m1-skeleton-health-pulse/specs/memories-browser/spec.md
@@ -0,0 +1,62 @@
+## ADDED Requirements
+
+### Requirement: TierBrowser layout
+The system SHALL provide a `TierBrowser.tsx` component at `/channels/:id/memories` that displays a 3-tier accordion/column layout: Tier 0 (channel summary) at top, Tier 1 (topic clusters) as expandable cards, Tier 2 (atomic facts) nested under their parent cluster.
+
+#### Scenario: Three tiers render
+- **WHEN** navigating to `/channels/test-channel/memories`
+- **THEN** the page shows the channel summary at top, topic clusters below, and atomic facts are accessible by expanding a cluster
+
+### Requirement: SummaryCard component
+The system SHALL provide a `SummaryCard.tsx` rendering the Tier 0 channel summary. It SHALL always be visible at the top of the memories view, showing the channel name, summary text, last updated timestamp, and message count.
+
+#### Scenario: Summary card display
+- **WHEN** the memories page loads
+- **THEN** the SummaryCard is visible at the top with channel name and summary text
+
+### Requirement: ClusterCard component
+The system SHALL provide a `ClusterCard.tsx` rendering Tier 1 topic clusters as expandable cards. Each card SHALL show the topic label, fact count, and date range. Expanding a card SHALL reveal the member atomic facts.
+
+#### Scenario: Cluster expansion
+- **WHEN** clicking on a ClusterCard
+- **THEN** it expands to show all member FactCards for that cluster
+
+#### Scenario: Cluster metadata display
+- **WHEN** a ClusterCard renders
+- **THEN** it shows the topic label, number of facts, and date range
+
+### Requirement: FactCard component
+The system SHALL provide a `FactCard.tsx` rendering Tier 2 atomic facts. Each card SHALL show the fact text, quality score badge (color-coded: green >= 7, amber >= 4, red < 4), timestamp, author attribution, and entity tags.
+
+#### Scenario: Fact card display
+- **WHEN** a FactCard renders
+- **THEN** it shows fact text, quality badge, timestamp, author, and tags
+
+#### Scenario: Quality score coloring
+- **WHEN** a fact has quality score 8.5
+- **THEN** the badge is green
+
+#### Scenario: Fact detail expansion
+- **WHEN** clicking on a FactCard
+- **THEN** an expanded detail view shows full metadata and a link placeholder for the original message
+
+### Requirement: MemoryFilters component
+The system SHALL provide a `MemoryFilters.tsx` component with filters for: topic (dropdown), entity (text search), minimum importance (slider), and date range (date pickers).
+
+#### Scenario: Filter by topic
+- **WHEN** selecting a topic from the dropdown
+- **THEN** only clusters and facts matching that topic are displayed
+
+### Requirement: Mock data for M1
+The system SHALL use hardcoded mock data conforming to the TypeScript types (`MemoryTier0`, `MemoryTier1`, `MemoryTier2`) to populate the memories browser. Mock data SHALL include at least 1 summary, 3 clusters, and 5 facts.
+
+#### Scenario: Mock data renders
+- **WHEN** the memories page loads with no backend
+- **THEN** mock data is displayed showing the 3-tier structure
+
+### Requirement: useMemories hook
+The system SHALL provide a `useMemories.ts` hook that manages state for the 3-tier memory data. In M1, it SHALL return mock data. The hook interface SHALL match the future API contract: `useMemories(channelId)` returning `{ summary, clusters, facts, filters, setFilters, isLoading }`.
+
+#### Scenario: Hook returns mock data
+- **WHEN** calling `useMemories("test-channel")`
+- **THEN** it returns summary, clusters, and facts with isLoading=false
diff --git a/openspec/changes/m1-skeleton-health-pulse/specs/project-scaffold/spec.md b/openspec/changes/m1-skeleton-health-pulse/specs/project-scaffold/spec.md
new file mode 100644
index 00000000..46640211
--- /dev/null
+++ b/openspec/changes/m1-skeleton-health-pulse/specs/project-scaffold/spec.md
@@ -0,0 +1,59 @@
+## ADDED Requirements
+
+### Requirement: Python package structure
+The system SHALL have a `src/beever_atlas/` Python package with the following module directories, each containing an `__init__.py`: `agents/`, `adapters/`, `pipeline/`, `stores/`, `retrieval/`, `wiki/`, `server/`, `infra/`.
+
+#### Scenario: Package is importable
+- **WHEN** a Python script runs `import beever_atlas`
+- **THEN** the import succeeds without errors
+
+#### Scenario: All submodules exist
+- **WHEN** listing directories under `src/beever_atlas/`
+- **THEN** directories `agents`, `adapters`, `pipeline`, `stores`, `retrieval`, `wiki`, `server`, `infra` each exist and contain `__init__.py`
+
+### Requirement: Project configuration files
+The system SHALL have a `pyproject.toml` at the project root defining the `beever-atlas` package with all required dependencies (FastAPI, google-adk, litellm, weaviate-client, neo4j, pymongo, redis, pydantic, uvicorn).
+
+#### Scenario: Dependencies installable
+- **WHEN** running `uv sync` in the project root
+- **THEN** all dependencies install successfully
+
+### Requirement: Docker Compose stack
+The system SHALL provide a `docker-compose.yml` defining services for: weaviate, neo4j, mongodb, redis, backend (FastAPI), frontend (React), and bot (TypeScript). Each service SHALL have health checks configured.
+
+#### Scenario: Stack starts successfully
+- **WHEN** running `docker compose up -d`
+- **THEN** all 7 services reach healthy status
+
+#### Scenario: Services are networked
+- **WHEN** the backend service starts
+- **THEN** it can reach weaviate (port 8080), neo4j (port 7687), mongodb (port 27017), and redis (port 6379) by service name
+
+### Requirement: Environment variable template
+The system SHALL provide a `.env.example` file documenting all required environment variables with placeholder values.
+
+#### Scenario: All dependencies covered
+- **WHEN** reading `.env.example`
+- **THEN** it contains entries for `WEAVIATE_URL`, `WEAVIATE_API_KEY`, `NEO4J_URI`, `NEO4J_AUTH`, `MONGODB_URI`, `REDIS_URL`, `GOOGLE_API_KEY`, `JINA_API_KEY`, `TAVILY_API_KEY`, `ANTHROPIC_API_KEY`
+
+### Requirement: Config system
+The system SHALL have a `src/beever_atlas/infra/config.py` module that loads all environment variables into a typed configuration object using Pydantic Settings. Missing required variables SHALL raise a validation error at startup.
+
+#### Scenario: Config loads from environment
+- **WHEN** all required env vars are set
+- **THEN** `get_settings()` returns a Settings object with all values populated
+
+#### Scenario: Missing required var raises error
+- **WHEN** `WEAVIATE_URL` is not set and has no default
+- **THEN** `get_settings()` raises a validation error
+
+### Requirement: LiteLLM model routing config
+The system SHALL define model routing configuration for two tiers: fast (Gemini Flash Lite with Haiku fallback) and quality (Gemini Flash with Sonnet fallback). Each agent type (fact extraction, entity extraction, query routing, response generation) SHALL be mapped to a tier.
+
+#### Scenario: Fast tier model resolution
+- **WHEN** requesting the model for query routing (fast tier)
+- **THEN** the config returns `gemini/gemini-2.0-flash-lite` as primary with `anthropic/claude-haiku-4-5` as fallback
+
+#### Scenario: Quality tier model resolution
+- **WHEN** requesting the model for response generation (quality tier)
+- **THEN** the config returns `gemini/gemini-2.0-flash` as primary with `anthropic/claude-sonnet-4-6` as fallback
diff --git a/openspec/changes/m1-skeleton-health-pulse/tasks.md b/openspec/changes/m1-skeleton-health-pulse/tasks.md
new file mode 100644
index 00000000..94d768b3
--- /dev/null
+++ b/openspec/changes/m1-skeleton-health-pulse/tasks.md
@@ -0,0 +1,61 @@
+## 1. Project Scaffold (RES-69)
+
+- [x] 1.1 Create `pyproject.toml` with uv, all Python dependencies (FastAPI, google-adk, litellm, weaviate-client, neo4j, pymongo, redis, pydantic, uvicorn, pytest)
+- [x] 1.2 Create `src/beever_atlas/__init__.py` and all submodule directories with `__init__.py` (agents, adapters, pipeline, stores, retrieval, wiki, server, infra)
+- [x] 1.3 Create `.env.example` with all required env vars
+- [x] 1.4 Create `docker-compose.yml` with all 7 services (weaviate, neo4j, mongodb, redis, backend, frontend, bot) and health checks
+- [x] 1.5 Create backend `Dockerfile`
+- [x] 1.6 Write tests: verify package imports, verify all submodules exist
+
+## 2. Config System (RES-71)
+
+- [x] 2.1 Implement `src/beever_atlas/infra/config.py` — Pydantic Settings class loading all env vars with validation
+- [x] 2.2 Implement `src/beever_atlas/infra/litellm_config.py` — model routing for fast/quality tiers
+- [x] 2.3 Write tests: config loads from env, missing vars raise error, model tier resolution
+
+## 3. ADK Agent Scaffolding (RES-91)
+
+- [x] 3.1 Implement `src/beever_atlas/agents/tools.py` — 11 FunctionTool stubs with correct signatures, docstrings, raising NotImplementedError
+- [x] 3.2 Implement `src/beever_atlas/agents/runner.py` — ADK Runner creation with InMemorySessionService, session-per-request pattern
+- [x] 3.3 Write tests: all 11 tools importable and raise NotImplementedError, runner creation works
+
+## 4. FastAPI App Shell + Health Endpoint (RES-98)
+
+- [x] 4.1 Implement `src/beever_atlas/server/app.py` — FastAPI app with CORS middleware
+- [x] 4.2 Implement `src/beever_atlas/infra/health.py` — DependencyHealth registry with timeout support
+- [x] 4.3 Implement `GET /api/health` endpoint with per-component checks (Weaviate, Neo4j, MongoDB, Redis)
+- [x] 4.4 Define Pydantic models: `HealthResponse`, `ComponentHealth`
+- [x] 4.5 Write tests: health endpoint returns correct schema, handles service up/down, CORS headers present
+
+## 5. React App Scaffold (RES-99)
+
+- [x] 5.1 Initialize Vite + React 19 + TypeScript project in `web/`
+- [x] 5.2 Configure TailwindCSS + shadcn/ui with design tokens (Inter font, slate/indigo palette)
+- [x] 5.3 Set up React Router v7 with route stubs (/, /channels, /channels/:id, /search, /graph, /settings)
+- [x] 5.4 Build `Sidebar.tsx` with nav links, icons, collapse toggle (240px/64px)
+- [x] 5.5 Build `Header.tsx` with page title
+- [x] 5.6 Build `HealthBadge.tsx` — polls /api/health every 30s, green/amber/red/gray indicator
+- [x] 5.7 Create `lib/api.ts` — fetch wrapper with VITE_API_URL, error handling
+- [x] 5.8 Create `lib/types.ts` — TypeScript interfaces matching backend schemas
+- [x] 5.9 Build Dashboard page (`/`) with stat card placeholders + HealthBadge
+- [x] 5.10 Create web `Dockerfile` for Docker Compose
+- [x] 5.11 Write tests: build succeeds, components render without errors
+
+## 6. Chat SDK Bot Placeholder (RES-100)
+
+- [x] 6.1 Initialize `bot/` Node.js TypeScript project with tsconfig
+- [x] 6.2 Implement entry point: connect to Redis, log "Bot service ready"
+- [x] 6.3 Create `bot/Dockerfile` for Docker Compose
+- [x] 6.4 Write tests: service starts, handles Redis connection failure
+
+## 7. React Memories Tab (RES-112)
+
+- [x] 7.1 Create mock data file with MemoryTier0, MemoryTier1, MemoryTier2 (1 summary, 3 clusters, 5+ facts)
+- [x] 7.2 Implement `useMemories.ts` hook returning mock data with filter state
+- [x] 7.3 Build `SummaryCard.tsx` — Tier 0 channel summary (always visible)
+- [x] 7.4 Build `ClusterCard.tsx` — Tier 1 expandable topic clusters
+- [x] 7.5 Build `FactCard.tsx` — Tier 2 atomic facts with quality badge, expandable detail
+- [x] 7.6 Build `MemoryFilters.tsx` — topic, entity, importance, date range filters
+- [x] 7.7 Build `TierBrowser.tsx` — compose all components in 3-tier layout
+- [x] 7.8 Add `/channels/:id/memories` route and wire up TierBrowser
+- [x] 7.9 Write tests: components render with mock data, filter interactions work
diff --git a/openspec/changes/m2-chatbot-echo-query/.openspec.yaml b/openspec/changes/m2-chatbot-echo-query/.openspec.yaml
new file mode 100644
index 00000000..a61e7c11
--- /dev/null
+++ b/openspec/changes/m2-chatbot-echo-query/.openspec.yaml
@@ -0,0 +1,2 @@
+schema: spec-driven
+created: 2026-03-27
diff --git a/openspec/changes/m2-chatbot-echo-query/design.md b/openspec/changes/m2-chatbot-echo-query/design.md
new file mode 100644
index 00000000..e7be76c7
--- /dev/null
+++ b/openspec/changes/m2-chatbot-echo-query/design.md
@@ -0,0 +1,80 @@
+## Context
+
+Milestone 1 delivered the project skeleton: FastAPI backend with health endpoint, React frontend shell, Chat SDK bot placeholder (Redis-only), Docker Compose orchestration, and ADK smoke test. The bot service currently just connects to Redis and stays alive — no Chat SDK, no Slack integration.
+
+The v2 architecture specifies a dual-path approach: Python adapters for batch historical ingestion, and a TypeScript Chat SDK bot for real-time chat interaction. M2 establishes both paths and the ADK agent pipeline that connects them to responses.
+
+**Current state:**
+- `bot/src/index.ts`: Redis connection only, no Chat SDK
+- `src/beever_atlas/`: Package init only, no endpoints beyond health
+- `web/`: Vite + React shell, no channel workspace
+- No ADK agents beyond the smoke test
+
+**Constraints:**
+- Slack is the primary test platform, but the architecture must support Teams, Discord, Linear adapters via Chat SDK
+- Python `SlackAdapter` uses `slack-sdk` for batch history (Chat SDK is TypeScript-only, cannot fetch history)
+- ADK agents use LiteLLM for model routing (gemini-2.0-flash primary, claude fallback)
+- No Weaviate/Neo4j required yet — echo agent validates the pipeline
+
+## Goals / Non-Goals
+
+**Goals:**
+- End-to-end interaction loop: Slack @mention → Chat SDK bot → FastAPI SSE endpoint → ADK agent → streamed response → Slack message
+- Python adapter layer (`NormalizedMessage` + `SlackAdapter`) ready for M3 batch ingestion
+- React channel workspace with streaming Ask tab for browser-based queries
+- Validate ADK Runner wiring without external memory stores
+
+**Non-Goals:**
+- Actual retrieval from Weaviate or Neo4j (M3/M4)
+- Wiki generation or tier consolidation (M5)
+- Multi-workspace OAuth for Slack (M8)
+- Teams/Discord/Linear adapter implementation (M8, but interfaces designed now)
+- Ingestion pipeline stages (M3)
+
+## Decisions
+
+### D1: Chat SDK as the real-time bot framework
+**Choice:** Use Vercel Chat SDK (`chat` npm package) with `@chat-adapter/slack` and `@chat-adapter/state-redis`.
+
+**Why:** The v2 spec mandates Chat SDK. It provides a unified adapter interface across Slack/Teams/Discord/Linear with normalized message handling, thread subscriptions, and JSX-based cards. Single codebase for all platforms.
+
+**Alternative considered:** Direct Slack Bolt.js — rejected because it locks us into Slack-only and doesn't provide the multi-platform adapter pattern we need.
+
+### D2: Separate Python adapter for batch ingestion
+**Choice:** Python `SlackAdapter` using `slack-sdk` for `conversations.history` / `conversations.replies` batch fetching, independent from the Chat SDK bot.
+
+**Why:** Chat SDK is TypeScript and designed for real-time webhooks, not batch history fetching. The ingestion pipeline (M3) is Python/ADK, so the adapter must be Python. Both paths produce `NormalizedMessage` for downstream processing.
+
+**Alternative considered:** Calling Slack API from TypeScript bot and forwarding to Python — rejected because it adds unnecessary network hops and the ingestion pipeline is entirely Python.
+
+### D3: SSE streaming for the Ask endpoint
+**Choice:** `POST /api/channels/:id/ask` returns `text/event-stream` with typed events: `thinking`, `tool_call`, `response_delta`, `citations`, `metadata`, `done`, `error`.
+
+**Why:** The v2 API spec requires SSE streaming. This matches the ADK Runner's streaming output and enables real-time UX in both the React frontend and the Chat SDK bot (which can post-then-edit as tokens arrive).
+
+**Alternative considered:** WebSocket — rejected because SSE is simpler, unidirectional (sufficient for Q&A), and works better with HTTP/2 proxies.
+
+### D4: Echo agent as ADK pipeline validator
+**Choice:** Minimal `LlmAgent` that receives a question and returns a formatted echo response with mock metadata (route, confidence, cost). No tools, no memory stores.
+
+**Why:** Validates the full ADK Runner → SSE streaming → response formatting pipeline without requiring Weaviate/Neo4j infrastructure. Can be swapped for the real `query_router_agent` in M3.
+
+**Alternative considered:** Stub the entire ADK layer and return hardcoded JSON — rejected because it doesn't validate the actual ADK Runner streaming behavior.
+
+### D5: BaseAdapter ABC with platform-agnostic interface
+**Choice:** Abstract `BaseAdapter` with methods: `fetch_history()`, `fetch_thread()`, `normalize_message()`, `get_channel_info()`, `list_channels()`. Platform adapters implement these.
+
+**Why:** The v2 spec lists Slack, Teams, Discord adapters. A common interface ensures the ingestion pipeline doesn't care which platform messages come from. New adapters just implement the ABC.
+
+### D6: React channel workspace with tab layout
+**Choice:** `/channels/:id` route with tab bar (Wiki | Ask | Memories | Graph | Settings). M2 implements Ask tab with SSE streaming consumer; Wiki tab as placeholder.
+
+**Why:** The v2 frontend spec requires this layout. Building the tab infrastructure now lets M3-M5 fill in tabs incrementally.
+
+## Risks / Trade-offs
+
+- **[Chat SDK beta stability]** → Mitigation: Pin exact versions, keep adapter logic thin so we can swap if needed. The core bot logic (forward to backend, format response) is simple.
+- **[Slack app permissions]** → Mitigation: Document required OAuth scopes. For M2, single-workspace bot token is sufficient.
+- **[SSE connection limits]** → Mitigation: Not a concern for M2 (dev/test scale). Production rate limiting addressed in M7.
+- **[ADK echo agent diverges from real agent]** → Mitigation: Echo agent uses the same `LlmAgent` class and Runner streaming interface as the real agent. Only the agent's instructions and tools differ.
+- **[Two Slack integration paths]** → Mitigation: Clear separation — Chat SDK owns real-time (TypeScript, webhooks), Python adapter owns batch (history fetch). They share no state except the messages themselves.
diff --git a/openspec/changes/m2-chatbot-echo-query/proposal.md b/openspec/changes/m2-chatbot-echo-query/proposal.md
new file mode 100644
index 00000000..eb1bf22f
--- /dev/null
+++ b/openspec/changes/m2-chatbot-echo-query/proposal.md
@@ -0,0 +1,32 @@
+## Why
+
+Milestone 1 established the skeleton and health pulse. Now we need the end-to-end interaction loop: a user @mentions Beever in Slack, the system processes the query through ADK agents, and returns a response. This is the foundation for all future retrieval — without a working chat-to-response path, no downstream milestone (ingestion, retrieval, wiki) can be validated by real users. We also need the Python-side `NormalizedMessage` adapter layer to support batch ingestion of Slack history, which Milestone 3 depends on.
+
+## What Changes
+
+- **Chat SDK bot service** (`bot/`): Replace the Redis-only placeholder with a full Chat SDK (`chat` npm package) integration using `@chat-adapter/slack` and `@chat-adapter/state-redis`. Wire `onNewMention` and `onSubscribedMessage` handlers that forward queries to the Python backend's `/api/channels/:id/ask` endpoint and render responses as Slack Block Kit messages.
+- **SSE streaming Q&A endpoint**: `POST /api/channels/:id/ask` accepts a question and streams ADK Runner output as Server-Sent Events (`thinking`, `tool_call`, `response_delta`, `citations`, `metadata`, `done`, `error`).
+- **ADK echo agent**: A minimal ADK agent (query_router_agent shell) that receives a question via the SSE endpoint and returns an echo response. This validates the full ADK Runner wiring without requiring Weaviate/Neo4j.
+- **NormalizedMessage & SlackAdapter (Python)**: The platform adapter layer (`src/beever_atlas/adapters/`) with `NormalizedMessage` dataclass, `BaseAdapter` ABC, and `SlackAdapter` implementation for batch history fetching via `slack-sdk`.
+- **React channel workspace**: Tab layout (`/channels/:id`) with Wiki tab (placeholder), Ask tab (streaming SSE consumer), and basic channel list sidebar.
+
+## Capabilities
+
+### New Capabilities
+- `chat-bot`: Chat SDK bot setup with Slack adapter, webhook routing, @mention/subscription handlers, and multi-adapter architecture for future platforms (Teams, Discord, Linear)
+- `ask-endpoint`: SSE streaming `/api/channels/:id/ask` endpoint with ADK Runner integration and event protocol
+- `adk-echo-agent`: Minimal ADK agent that echoes queries back, validating the full agent pipeline wiring
+- `normalized-message`: `NormalizedMessage` model, `BaseAdapter` ABC, and `SlackAdapter` for batch message history fetching
+- `channel-workspace`: React channel workspace with tab layout, Ask tab with streaming consumer, and channel list
+
+### Modified Capabilities
+<!-- No existing specs to modify -->
+
+## Impact
+
+- **bot/**: Major rewrite — add `chat`, `@chat-adapter/slack`, `@chat-adapter/state-redis` dependencies; new webhook route, event handlers, Slack Block Kit formatter
+- **src/beever_atlas/**: New `adapters/` package, new `api/ask.py` endpoint, new `agents/echo.py` ADK agent
+- **web/**: New channel workspace page, Ask tab component, SSE streaming hook
+- **docker-compose.yml**: Bot service needs `SLACK_BOT_TOKEN`, `SLACK_SIGNING_SECRET` env vars
+- **Dependencies**: `chat`, `@chat-adapter/slack`, `@chat-adapter/state-redis` (npm); `slack-sdk` (Python)
+- **Linear issues**: RES-96 (Chat SDK bot), RES-101 (SSE endpoint), RES-102 (React workspace), RES-75 (NormalizedMessage + SlackAdapter)
diff --git a/openspec/changes/m2-chatbot-echo-query/specs/adk-echo-agent/spec.md b/openspec/changes/m2-chatbot-echo-query/specs/adk-echo-agent/spec.md
new file mode 100644
index 00000000..f4a9fad2
--- /dev/null
+++ b/openspec/changes/m2-chatbot-echo-query/specs/adk-echo-agent/spec.md
@@ -0,0 +1,33 @@
+## ADDED Requirements
+
+### Requirement: Echo agent as root ADK agent
+The system SHALL provide an ADK `LlmAgent` named `query_router_agent` that receives a question from session state and returns a formatted echo response. The agent SHALL use the fast-tier model (`gemini-2.0-flash-lite` via LiteLLM).
+
+#### Scenario: Echo agent processes a question
+- **WHEN** the ADK Runner invokes the echo agent with session state `{"question": "what is our tech stack?"}`
+- **THEN** the agent returns a response containing the original question echoed back with a preamble indicating it is an echo response
+
+### Requirement: Echo response metadata
+The echo agent SHALL include metadata in its response: `route: "echo"`, `confidence: 1.0`, `cost_usd: 0.0`. This validates the metadata pipeline without real routing.
+
+#### Scenario: Echo agent returns metadata
+- **WHEN** the echo agent completes processing
+- **THEN** the response includes metadata with route "echo", confidence 1.0, and cost_usd 0.0
+
+### Requirement: Agent module structure for future replacement
+The echo agent SHALL be defined in `src/beever_atlas/agents/echo.py` and exported via `src/beever_atlas/agents/__init__.py` as `root_agent`. The module structure SHALL match the v2 ADK integration spec so the echo agent can be replaced by the real `query_router_agent` in M3/M4 without changing the Runner wiring.
+
+#### Scenario: Swapping echo agent for real agent
+- **WHEN** a developer replaces the echo agent implementation in `agents/__init__.py` with the real `query_router_agent`
+- **THEN** the ask endpoint and ADK Runner continue to work without modification because they reference `root_agent` from the agents package
+
+### Requirement: Agent configuration via environment
+The agent SHALL read its LLM model configuration from environment variables (`LLM_FAST_MODEL`, `LLM_QUALITY_MODEL`) with defaults matching the v2 spec (`gemini-2.0-flash-lite`, `gemini-2.0-flash`).
+
+#### Scenario: Default model configuration
+- **WHEN** no model environment variables are set
+- **THEN** the agent uses `gemini-2.0-flash-lite` as the default model
+
+#### Scenario: Custom model override
+- **WHEN** `LLM_FAST_MODEL` is set to `claude-haiku-4-5`
+- **THEN** the agent uses `claude-haiku-4-5` as its model
diff --git a/openspec/changes/m2-chatbot-echo-query/specs/ask-endpoint/spec.md b/openspec/changes/m2-chatbot-echo-query/specs/ask-endpoint/spec.md
new file mode 100644
index 00000000..bdf9f973
--- /dev/null
+++ b/openspec/changes/m2-chatbot-echo-query/specs/ask-endpoint/spec.md
@@ -0,0 +1,48 @@
+## ADDED Requirements
+
+### Requirement: POST /api/channels/:id/ask endpoint
+The system SHALL expose `POST /api/channels/:id/ask` that accepts a JSON body with `question` (string, required), `include_citations` (boolean, default true), and `max_results` (integer, default 10). The endpoint SHALL return `Content-Type: text/event-stream`.
+
+#### Scenario: Valid ask request
+- **WHEN** a client sends `POST /api/channels/C123/ask` with `{"question": "what is our tech stack?"}`
+- **THEN** the endpoint returns HTTP 200 with `Content-Type: text/event-stream` and begins streaming SSE events
+
+#### Scenario: Missing question field
+- **WHEN** a client sends `POST /api/channels/C123/ask` with `{}`
+- **THEN** the endpoint returns HTTP 422 with a validation error
+
+#### Scenario: Empty question string
+- **WHEN** a client sends `POST /api/channels/C123/ask` with `{"question": ""}`
+- **THEN** the endpoint returns HTTP 422 with a validation error
+
+### Requirement: SSE event protocol
+The endpoint SHALL stream events in the following format: `event: <type>\ndata: <json>\n\n`. The event types SHALL be:
+- `thinking`: `{"text": "<reasoning step>"}` — agent's chain-of-thought
+- `tool_call`: `{"name": "<tool>", "result_summary": "<brief>"}` — tool invocation
+- `response_delta`: `{"delta": "<text chunk>"}` — incremental answer tokens
+- `citations`: `{"items": [<citation objects>]}` — citation list
+- `metadata`: `{"route": "<route>", "confidence": <float>, "cost_usd": <float>}` — response metadata
+- `done`: `{}` — stream complete
+- `error`: `{"message": "<error>", "code": "<error_code>"}` — error occurred
+
+#### Scenario: Successful streaming response
+- **WHEN** the ADK agent processes a query successfully
+- **THEN** the endpoint streams events in order: zero or more `thinking` events, zero or more `tool_call` events, one or more `response_delta` events, one `citations` event, one `metadata` event, and one `done` event
+
+#### Scenario: Agent error during processing
+- **WHEN** the ADK agent raises an exception during processing
+- **THEN** the endpoint streams an `error` event with the error message and closes the stream
+
+### Requirement: ADK Runner integration
+The endpoint SHALL create an ADK Runner instance, invoke the root agent with the user's question as input, and stream the agent's output as SSE events. The Runner SHALL use ADK session state to pass the channel ID and question to the agent.
+
+#### Scenario: Runner invokes the root agent
+- **WHEN** a request arrives at the ask endpoint
+- **THEN** the system creates an ADK Runner, sets `channel_id` and `question` in session state, and runs the root agent
+
+### Requirement: Request cancellation on client disconnect
+The endpoint SHALL detect when the client closes the SSE connection and cancel the in-progress ADK Runner execution to avoid wasted compute.
+
+#### Scenario: Client disconnects mid-stream
+- **WHEN** the client closes the SSE connection before the `done` event
+- **THEN** the server cancels the ADK Runner execution and cleans up resources
diff --git a/openspec/changes/m2-chatbot-echo-query/specs/channel-workspace/spec.md b/openspec/changes/m2-chatbot-echo-query/specs/channel-workspace/spec.md
new file mode 100644
index 00000000..8e8e238d
--- /dev/null
+++ b/openspec/changes/m2-chatbot-echo-query/specs/channel-workspace/spec.md
@@ -0,0 +1,59 @@
+## ADDED Requirements
+
+### Requirement: Channel workspace route
+The frontend SHALL provide a route at `/channels/:id` that renders the channel workspace layout. The layout SHALL include a channel header (channel name and platform badge) and a tab bar.
+
+#### Scenario: Navigate to a channel workspace
+- **WHEN** a user navigates to `/channels/C123`
+- **THEN** the channel workspace renders with the channel name in the header and the tab bar visible
+
+### Requirement: Tab bar with five tabs
+The channel workspace SHALL display a tab bar with tabs: Wiki, Ask, Memories, Graph, Settings. The default active tab SHALL be Wiki. Clicking a tab SHALL switch the content area to that tab's component.
+
+#### Scenario: Default tab is Wiki
+- **WHEN** a user navigates to `/channels/C123` without a tab parameter
+- **THEN** the Wiki tab is active and its content is displayed
+
+#### Scenario: Switch to Ask tab
+- **WHEN** a user clicks the "Ask" tab
+- **THEN** the Ask tab becomes active and the Ask component is displayed
+
+### Requirement: Ask tab with streaming input
+The Ask tab SHALL provide a text input for questions and a response area. When a user submits a question, the tab SHALL call `POST /api/channels/:id/ask` and consume the SSE stream, rendering response tokens incrementally as they arrive.
+
+#### Scenario: Submit a question and see streaming response
+- **WHEN** a user types "what is our tech stack?" and submits
+- **THEN** the Ask tab shows a loading indicator, then progressively renders the response text as `response_delta` events arrive, and displays citations and metadata after the `done` event
+
+#### Scenario: Display thinking steps
+- **WHEN** the SSE stream includes `thinking` events
+- **THEN** the Ask tab renders thinking steps in a collapsible section above the response
+
+#### Scenario: Display error from stream
+- **WHEN** the SSE stream includes an `error` event
+- **THEN** the Ask tab displays an error message to the user
+
+### Requirement: Wiki tab placeholder
+The Wiki tab SHALL display a placeholder message indicating that wiki content will be available after channel sync (M3). This is a non-functional placeholder for M2.
+
+#### Scenario: View Wiki tab
+- **WHEN** a user clicks the Wiki tab
+- **THEN** a placeholder message is displayed: "Wiki will be available after channel sync."
+
+### Requirement: Channel list sidebar
+The frontend SHALL provide a sidebar listing available channels. Each channel entry SHALL show the channel name and platform icon. Clicking a channel SHALL navigate to `/channels/:id`.
+
+#### Scenario: View channel list
+- **WHEN** the frontend loads
+- **THEN** the sidebar displays a list of channels fetched from `GET /api/channels`
+
+#### Scenario: Click a channel to navigate
+- **WHEN** a user clicks "general" in the channel list
+- **THEN** the browser navigates to `/channels/C123` and the workspace loads
+
+### Requirement: SSE streaming hook
+The frontend SHALL provide a custom React hook `useAsk(channelId)` that manages the SSE connection to `/api/channels/:id/ask`. The hook SHALL return: `ask(question)` function, `response` (accumulated text), `thinking` (array of thinking steps), `citations` (array), `metadata` (object), `isStreaming` (boolean), and `error` (string | null).
+
+#### Scenario: Hook manages SSE lifecycle
+- **WHEN** `ask("what is X?")` is called
+- **THEN** the hook opens an SSE connection, accumulates `response_delta` events into `response`, sets `isStreaming` to true during streaming and false after `done`, and populates `citations` and `metadata` from their respective events
diff --git a/openspec/changes/m2-chatbot-echo-query/specs/chat-bot/spec.md b/openspec/changes/m2-chatbot-echo-query/specs/chat-bot/spec.md
new file mode 100644
index 00000000..8927640e
--- /dev/null
+++ b/openspec/changes/m2-chatbot-echo-query/specs/chat-bot/spec.md
@@ -0,0 +1,59 @@
+## ADDED Requirements
+
+### Requirement: Chat SDK initialization with Slack adapter
+The bot service SHALL initialize a `Chat` instance with `@chat-adapter/slack` and `@chat-adapter/state-redis`. The bot SHALL read `SLACK_BOT_TOKEN`, `SLACK_SIGNING_SECRET`, and `REDIS_URL` from environment variables. The Chat instance SHALL be configured with `userName: "beever"`.
+
+#### Scenario: Bot starts successfully with valid credentials
+- **WHEN** the bot service starts with valid `SLACK_BOT_TOKEN`, `SLACK_SIGNING_SECRET`, and `REDIS_URL`
+- **THEN** the Chat SDK initializes without error and the bot is ready to receive webhooks
+
+#### Scenario: Bot fails gracefully with missing credentials
+- **WHEN** the bot service starts without `SLACK_BOT_TOKEN` or `SLACK_SIGNING_SECRET`
+- **THEN** the bot SHALL log an error message and exit with a non-zero code
+
+### Requirement: Webhook route for Slack events
+The bot service SHALL expose a POST route at `/api/slack` that delegates to `bot.webhooks.slack` for Slack event processing. The route SHALL handle webhook verification challenges automatically via the Chat SDK.
+
+#### Scenario: Slack sends a URL verification challenge
+- **WHEN** Slack sends a `url_verification` challenge to `POST /api/slack`
+- **THEN** the bot responds with the challenge token and HTTP 200
+
+#### Scenario: Slack sends an event payload
+- **WHEN** Slack sends an `event_callback` payload to `POST /api/slack`
+- **THEN** the Chat SDK parses and routes the event to the appropriate handler
+
+### Requirement: @mention handler forwards query to backend
+The bot SHALL register an `onNewMention` handler that extracts the user's question text (stripping the @mention prefix), calls `POST /api/channels/:id/ask` on the Python backend with the question, and posts the streamed response back to the Slack thread. The handler SHALL call `thread.subscribe()` to enable follow-up messages.
+
+#### Scenario: User @mentions Beever with a question
+- **WHEN** a user sends "@beever what is our deployment process?" in a Slack channel
+- **THEN** the bot extracts "what is our deployment process?", calls the backend ask endpoint with the channel ID, and posts the response in the same thread
+
+#### Scenario: Bot subscribes to thread after first mention
+- **WHEN** the bot processes an @mention
+- **THEN** the bot calls `thread.subscribe()` so subsequent messages in the thread trigger `onSubscribedMessage`
+
+### Requirement: Subscribed message handler for follow-up queries
+The bot SHALL register an `onSubscribedMessage` handler that forwards follow-up messages in subscribed threads to the backend ask endpoint, maintaining conversational context within the thread.
+
+#### Scenario: User sends a follow-up in a subscribed thread
+- **WHEN** a user sends a follow-up message in a thread where the bot was previously mentioned
+- **THEN** the bot forwards the message to the backend ask endpoint and posts the response in the same thread
+
+### Requirement: Response formatting as Slack Block Kit
+The bot SHALL format backend responses as Slack Block Kit blocks containing: an answer section (markdown text), a citations section (if citations are present), and a route badge (semantic/graph/hybrid). The bot SHALL use `thread.post()` to send formatted responses.
+
+#### Scenario: Backend returns a response with citations
+- **WHEN** the backend streams a response with answer text and citation objects
+- **THEN** the bot posts a Slack message with the answer as a markdown section block and citations as a context block
+
+#### Scenario: Backend returns an error
+- **WHEN** the backend returns an error event in the SSE stream
+- **THEN** the bot posts an error message in the thread indicating the query could not be processed
+
+### Requirement: Multi-adapter architecture
+The Chat SDK setup SHALL be structured so that additional adapters (Teams, Discord, Linear) can be added by importing the adapter package and adding it to the `adapters` config object. No changes to handler logic SHALL be required when adding a new platform adapter.
+
+#### Scenario: Adding a new platform adapter
+- **WHEN** a developer adds `@chat-adapter/discord` to dependencies and adds `discord: createDiscordAdapter()` to the adapters config
+- **THEN** the existing `onNewMention` and `onSubscribedMessage` handlers work for Discord without modification
diff --git a/openspec/changes/m2-chatbot-echo-query/specs/normalized-message/spec.md b/openspec/changes/m2-chatbot-echo-query/specs/normalized-message/spec.md
new file mode 100644
index 00000000..4f45ce73
--- /dev/null
+++ b/openspec/changes/m2-chatbot-echo-query/specs/normalized-message/spec.md
@@ -0,0 +1,106 @@
+## ADDED Requirements
+
+### Requirement: NormalizedMessage dataclass
+The system SHALL provide a `NormalizedMessage` dataclass in `src/beever_atlas/adapters/base.py` with the following fields: `content` (str), `author` (str), `platform` (str enum: "slack" | "teams" | "discord"), `channel_id` (str), `channel_name` (str), `message_id` (str), `timestamp` (datetime), `thread_id` (str | None), `attachments` (list), `reactions` (list), `reply_count` (int), `raw_metadata` (dict).
+
+#### Scenario: Create a NormalizedMessage from Slack data
+- **WHEN** a Slack message payload is normalized
+- **THEN** a `NormalizedMessage` is created with `platform="slack"`, `channel_id` from the Slack channel, and `timestamp` parsed from Slack's `ts` field
+
+### Requirement: BaseAdapter abstract class
+The system SHALL provide a `BaseAdapter` ABC in `src/beever_atlas/adapters/base.py` with the following abstract methods:
+- `async fetch_history(channel_id, since, limit) -> list[NormalizedMessage]`
+- `async fetch_thread(channel_id, thread_id) -> list[NormalizedMessage]`
+- `async get_channel_info(channel_id) -> ChannelInfo`
+- `async list_channels() -> list[ChannelInfo]`
+
+And a concrete method:
+- `normalize_message(raw) -> NormalizedMessage` — platform-specific normalization
+
+#### Scenario: Implementing a new platform adapter
+- **WHEN** a developer creates a `TeamsAdapter` extending `BaseAdapter`
+- **THEN** the developer MUST implement `fetch_history`, `fetch_thread`, `get_channel_info`, `list_channels`, and `normalize_message`
+
+### Requirement: ChannelInfo model
+The system SHALL provide a `ChannelInfo` dataclass with fields: `channel_id` (str), `name` (str), `platform` (str), `member_count` (int | None), `topic` (str | None), `purpose` (str | None).
+
+#### Scenario: Retrieve channel info
+- **WHEN** `get_channel_info("C123")` is called on a SlackAdapter
+- **THEN** a `ChannelInfo` is returned with the channel's name, member count, topic, and purpose from the Slack API
+
+### Requirement: SlackAdapter implementation
+The system SHALL provide a `SlackAdapter` in `src/beever_atlas/adapters/slack.py` that extends `BaseAdapter` and uses `slack_sdk.web.async_client.AsyncWebClient` for API calls. The adapter SHALL:
+- Read `SLACK_BOT_TOKEN` from environment
+- Implement `fetch_history` using `conversations.history` API with pagination
+- Implement `fetch_thread` using `conversations.replies` API
+- Implement `get_channel_info` using `conversations.info` API
+- Implement `list_channels` using `conversations.list` API
+- Handle Slack rate limits with exponential backoff
+
+#### Scenario: Fetch channel history with pagination
+- **WHEN** `fetch_history("C123", limit=500)` is called and the channel has 500+ messages
+- **THEN** the adapter paginates through Slack API responses using `cursor` until the limit is reached or no more messages exist
+
+#### Scenario: Fetch history since a timestamp
+- **WHEN** `fetch_history("C123", since=datetime(2024,1,1))` is called
+- **THEN** the adapter passes `oldest` parameter to the Slack API to only fetch messages after that timestamp
+
+#### Scenario: Handle Slack rate limiting
+- **WHEN** the Slack API returns HTTP 429 with a `Retry-After` header
+- **THEN** the adapter waits for the specified duration before retrying the request
+
+#### Scenario: Missing SLACK_BOT_TOKEN
+- **WHEN** a `SlackAdapter` is created without `SLACK_BOT_TOKEN` in the environment
+- **THEN** the adapter raises a `ConfigurationError` with a descriptive message
+
+### Requirement: MockAdapter with fixture data
+The system SHALL provide a `MockAdapter` in `src/beever_atlas/adapters/mock.py` that extends `BaseAdapter` and reads from JSON fixture files instead of calling any platform API. The `MockAdapter` SHALL be activated when `ADAPTER_MOCK=true` is set in the environment. It SHALL be usable as a drop-in replacement for `SlackAdapter` in tests, local development, and CI/CD without requiring platform credentials.
+
+#### Scenario: MockAdapter loads fixture conversations
+- **WHEN** `MockAdapter` is initialized with `ADAPTER_MOCK=true`
+- **THEN** it loads conversation data from `tests/fixtures/slack_conversations.json` and serves it via `fetch_history`, `fetch_thread`, `get_channel_info`, and `list_channels`
+
+#### Scenario: MockAdapter used in tests without Slack credentials
+- **WHEN** tests run in CI/CD without `SLACK_BOT_TOKEN`
+- **THEN** the system uses `MockAdapter` and all adapter-dependent tests pass using fixture data
+
+#### Scenario: MockAdapter returns realistic multi-person conversations
+- **WHEN** `fetch_history("C_MOCK_GENERAL", limit=100)` is called on `MockAdapter`
+- **THEN** it returns `NormalizedMessage` objects from multiple authors with varied timestamps, thread replies, reactions, and realistic content
+
+### Requirement: Conversation fixture data
+The system SHALL provide fixture files at `tests/fixtures/` containing realistic multi-person Slack conversations. The fixtures SHALL include:
+- At least 6 mock users with distinct roles (e.g., engineer, PM, designer, QA, tech lead, DevOps)
+- At least 2 mock channels (`#general`, `#engineering`)
+- At least 100 messages spanning multiple days
+- Thread conversations (parent + replies) with 3+ participants per thread
+- Message patterns: technical discussions, architecture decisions, bug reports, standup updates, casual chat
+- Reactions on messages (thumbsup, eyes, white_check_mark)
+- Code snippet messages and link-sharing messages
+- Messages with varying signal-to-noise ratio (some chatter, some high-value decisions)
+
+#### Scenario: Fixtures contain decision-making conversations
+- **WHEN** a developer reads `tests/fixtures/slack_conversations.json`
+- **THEN** the data includes at least 3 decision threads where team members discuss options and reach a conclusion (useful for M3-M6 ingestion and retrieval testing)
+
+#### Scenario: Fixtures contain temporal patterns
+- **WHEN** fixture messages are sorted by timestamp
+- **THEN** they span at least 14 days with realistic distribution (weekday clusters, quiet weekends)
+
+### Requirement: Adapter factory with mock support
+The system SHALL provide an `get_adapter(platform)` factory function in `src/beever_atlas/adapters/__init__.py` that returns `MockAdapter` when `ADAPTER_MOCK=true`, otherwise returns the real platform adapter (e.g., `SlackAdapter`). This allows all API endpoints and the bot to seamlessly switch between real and mock data.
+
+#### Scenario: Factory returns MockAdapter in dev mode
+- **WHEN** `get_adapter("slack")` is called with `ADAPTER_MOCK=true` in environment
+- **THEN** a `MockAdapter` instance is returned
+
+#### Scenario: Factory returns SlackAdapter in production
+- **WHEN** `get_adapter("slack")` is called without `ADAPTER_MOCK` in environment
+- **THEN** a `SlackAdapter` instance is returned
+
+### Requirement: Adapters package exports
+The `src/beever_atlas/adapters/__init__.py` SHALL export `NormalizedMessage`, `BaseAdapter`, `ChannelInfo`, `SlackAdapter`, `MockAdapter`, and `get_adapter`.
+
+#### Scenario: Import adapter classes
+- **WHEN** a developer writes `from beever_atlas.adapters import SlackAdapter, MockAdapter, NormalizedMessage, get_adapter`
+- **THEN** the imports resolve successfully
diff --git a/openspec/changes/m2-chatbot-echo-query/tasks.md b/openspec/changes/m2-chatbot-echo-query/tasks.md
new file mode 100644
index 00000000..f5e3a45c
--- /dev/null
+++ b/openspec/changes/m2-chatbot-echo-query/tasks.md
@@ -0,0 +1,65 @@
+## 1. Python Adapter Layer (NormalizedMessage + SlackAdapter)
+
+- [x] 1.1 Create `src/beever_atlas/adapters/__init__.py` with package exports
+- [x] 1.2 Create `src/beever_atlas/adapters/base.py` with `NormalizedMessage` dataclass, `ChannelInfo` dataclass, and `BaseAdapter` ABC
+- [x] 1.3 Add `slack-sdk` dependency to `pyproject.toml`
+- [x] 1.4 Create `src/beever_atlas/adapters/slack.py` with `SlackAdapter` implementing `fetch_history`, `fetch_thread`, `get_channel_info`, `list_channels` using `AsyncWebClient`
+- [x] 1.5 Add rate-limit handling with exponential backoff in `SlackAdapter`
+- [x] 1.6 Create `src/beever_atlas/adapters/mock.py` with `MockAdapter` that reads from fixture JSON files, activated via `ADAPTER_MOCK=true`
+- [x] 1.7 Add `get_adapter(platform)` factory function in `__init__.py` that returns `MockAdapter` or real adapter based on env
+- [x] 1.8 Create `tests/fixtures/slack_conversations.json` with realistic multi-person conversations (6+ users, 2+ channels, 100+ messages, threads, reactions, decisions, code snippets, spanning 14+ days)
+- [x] 1.9 Write tests for `NormalizedMessage`, `SlackAdapter` (mocked API), `MockAdapter` (fixture data), and `get_adapter` factory
+
+## 2. ADK Echo Agent
+
+- [x] 2.1 Create `src/beever_atlas/agents/__init__.py` exporting `root_agent`
+- [x] 2.2 Create `src/beever_atlas/agents/echo.py` with echo `LlmAgent` that reads question from session state and returns formatted echo response with metadata
+- [x] 2.3 Add model configuration via `LLM_FAST_MODEL` / `LLM_QUALITY_MODEL` env vars with defaults
+- [x] 2.4 Write tests for echo agent (verify response format and metadata)
+
+## 3. SSE Streaming Ask Endpoint
+
+- [x] 3.1 Create `src/beever_atlas/api/ask.py` with `POST /api/channels/:id/ask` endpoint
+- [x] 3.2 Implement SSE event streaming with typed events (`thinking`, `response_delta`, `citations`, `metadata`, `done`, `error`)
+- [x] 3.3 Wire ADK Runner to invoke `root_agent` and stream output as SSE events
+- [x] 3.4 Add request validation (question required, non-empty)
+- [x] 3.5 Implement client disconnect detection and Runner cancellation
+- [x] 3.6 Register the ask router in the FastAPI app
+- [x] 3.7 Write tests for ask endpoint (SSE event format, validation errors, streaming)
+
+## 4. Chat SDK Bot (Slack Adapter)
+
+- [x] 4.1 Add `chat`, `@chat-adapter/slack`, `@chat-adapter/state-redis` dependencies to `bot/package.json`
+- [x] 4.2 Rewrite `bot/src/index.ts` to initialize Chat SDK with Slack adapter and Redis state
+- [x] 4.3 Add webhook route handler (`POST /api/slack` → `bot.webhooks.slack`)
+- [x] 4.4 Implement `onNewMention` handler: extract question, call backend `/api/channels/:id/ask`, post response
+- [x] 4.5 Implement `onSubscribedMessage` handler for follow-up messages in threads
+- [x] 4.6 Create Slack Block Kit response formatter (answer block, citations block, route badge)
+- [x] 4.7 Add SSE client to consume backend streaming response and accumulate for posting
+- [x] 4.8 Add environment variable validation and graceful startup/shutdown
+- [x] 4.9 Write tests for bot handlers and response formatting
+
+## 5. Channels & Messages API Endpoints
+
+- [x] 5.1 Create `src/beever_atlas/api/channels.py` with `GET /api/channels` (list channels via adapter) and `GET /api/channels/:id` (channel info)
+- [x] 5.2 Add `GET /api/channels/:id/messages` endpoint returning paginated `NormalizedMessage` list from `SlackAdapter.fetch_history()`
+- [x] 5.3 Register channels router in the FastAPI app
+- [x] 5.4 Write tests for channels and messages endpoints
+
+## 6. React Channel Workspace
+
+- [x] 6.1 Create `useAsk(channelId)` custom hook for SSE streaming (ask function, response accumulation, thinking, citations, metadata, isStreaming, error)
+- [x] 6.2 Create `ChannelWorkspace` component with tab bar (Wiki, Ask, Messages, Graph, Settings)
+- [x] 6.3 Create `AskTab` component with question input, streaming response display, collapsible thinking steps, citations, and metadata
+- [x] 6.4 Create `WikiTab` placeholder component
+- [x] 6.5 Create `MessagesTab` component fetching from `GET /api/channels/:id/messages` with pagination and message display
+- [x] 6.6 Create `ChannelList` sidebar component fetching from `GET /api/channels`
+- [x] 6.7 Add `/channels/:id` route to the React router
+- [x] 6.8 Write tests for `useAsk` hook and component rendering
+
+## 7. Integration & Docker
+
+- [x] 7.1 Update `docker-compose.yml` with Slack env vars (`SLACK_BOT_TOKEN`, `SLACK_SIGNING_SECRET`) and bot service backend URL
+- [x] 7.2 Update bot `Dockerfile` to install Chat SDK dependencies and build TypeScript
+- [x] 7.3 Run full integration test: bot startup → mock Slack webhook → backend ask endpoint → SSE response
+- [x] 7.4 Update Linear issue statuses as tasks are completed
diff --git a/openspec/changes/messages-tab-enhancement/.openspec.yaml b/openspec/changes/messages-tab-enhancement/.openspec.yaml
new file mode 100644
index 00000000..c430c5fa
--- /dev/null
+++ b/openspec/changes/messages-tab-enhancement/.openspec.yaml
@@ -0,0 +1,2 @@
+schema: spec-driven
+created: 2026-04-03
diff --git a/openspec/changes/messages-tab-enhancement/design.md b/openspec/changes/messages-tab-enhancement/design.md
new file mode 100644
index 00000000..51639ab2
--- /dev/null
+++ b/openspec/changes/messages-tab-enhancement/design.md
@@ -0,0 +1,55 @@
+## Context
+
+The Messages tab (`MessagesTab.tsx`) fetches up to 100 messages from `GET /api/channels/{id}/messages` and renders them oldest-first in a card layout. The API proxies to the bot bridge (`SlackBridge` or `DiscordBridge`), which calls the platform's message history API. There is no pagination, no filtering, no sort control, and timestamps show only relative time ("3h ago").
+
+The bot bridge already normalizes messages into a `NormalizedMessage` format across platforms. The Discord bridge uses REST API directly; the Slack bridge uses the Slack SDK's `conversations.history`.
+
+## Goals / Non-Goals
+
+**Goals:**
+- Users can browse full message history with cursor-based pagination
+- Users can sort messages newest-first (default) or oldest-first
+- Users can search and filter messages by text, author, date range, and attachments
+- Messages display full timestamps and are grouped by date
+- Message list auto-refreshes to show new activity
+- Activity sparkline gives at-a-glance volume context
+
+**Non-Goals:**
+- Server-side full-text search (out of scope — client-side filter on loaded messages only)
+- Real-time WebSocket streaming (polling is sufficient for this phase)
+- Message editing or deletion from the UI
+- Infinite scroll (explicit "Load more" button preferred for predictability)
+
+## Decisions
+
+### 1. Cursor-based pagination using `before` message ID
+**Choice**: Use `before=<message_id>` cursor instead of offset-based pagination.
+**Why**: Both Discord and Slack APIs natively support `before`/`latest` cursor params. Offset pagination breaks when messages are added/deleted between pages. Cursor pagination is stable and maps 1:1 to platform APIs.
+**Alternative rejected**: Offset pagination (`skip=100`) — fragile under concurrent writes, not supported natively by platform APIs.
+
+### 2. Client-side filtering only
+**Choice**: Search and filters operate on currently loaded messages in the browser.
+**Why**: Avoids building a search index. Messages are already loaded into state. For most channels, loading 200-500 messages covers the useful history. Platform APIs don't support text search on history.
+**Alternative rejected**: Server-side Elasticsearch — over-engineered for this feature, adds infrastructure dependency.
+
+### 3. Sort order via API `order` param
+**Choice**: Add `order=desc|asc` to the API. Default `desc` (newest first). The bridge fetches accordingly.
+**Why**: Slack returns newest-first by default; Discord returns newest-first by default. Matching the default avoids a re-sort. The API param lets the frontend toggle without client-side re-sorting of potentially large lists.
+**Alternative rejected**: Client-side sort only — breaks with pagination (you'd need all messages loaded to sort properly).
+
+### 4. Auto-refresh via polling with deduplication
+**Choice**: Poll every 30s for messages newer than the latest loaded message ID. Prepend new messages with a toast notification.
+**Why**: Simple, reliable, no WebSocket infrastructure needed. The `since` param already exists in the API.
+**Alternative rejected**: WebSocket/SSE push — requires new infrastructure, overkill for a monitoring UI.
+
+### 5. SVG sparkline without external chart library
+**Choice**: Render the activity sparkline as a simple inline SVG polyline.
+**Why**: Avoids adding recharts (~200KB) for a single tiny chart. A 7-day bar/line sparkline is trivial with SVG.
+**Alternative rejected**: recharts/visx — heavy dependency for minimal use.
+
+## Risks / Trade-offs
+
+- **[Client-side filter on large message sets]** → If a channel has 10K+ messages loaded via repeated "Load more", filtering may lag. Mitigation: Cap loaded messages at ~1000, add a "showing X of Y" indicator.
+- **[Rate limiting on pagination]** → Rapid "Load more" clicks could hit Discord/Slack rate limits. Mitigation: Debounce the button, show loading state, retry with backoff (already implemented for Discord).
+- **[Auto-refresh race condition]** → Polling while user is loading more could cause duplicates. Mitigation: Deduplicate by message_id in state, pause polling during pagination loads.
+- **[Sparkline data source]** → No pre-aggregated message counts exist. Mitigation: Compute from loaded messages for now; this gives a rough activity shape for the loaded window.
diff --git a/openspec/changes/messages-tab-enhancement/proposal.md b/openspec/changes/messages-tab-enhancement/proposal.md
new file mode 100644
index 00000000..0534d159
--- /dev/null
+++ b/openspec/changes/messages-tab-enhancement/proposal.md
@@ -0,0 +1,31 @@
+## Why
+
+The Messages tab currently loads a flat list of 100 messages in oldest-first order with no pagination, no filtering, and only relative timestamps. Users cannot browse historical messages, find specific conversations, or understand temporal context. This makes the channel message view unusable for any channel with meaningful volume.
+
+## What Changes
+
+- Default message order reversed to newest-first (most recent messages shown first)
+- Full absolute timestamps shown on hover, with date group separators ("Today", "Yesterday", "Mar 28, 2026")
+- Cursor-based pagination via `before` parameter, with "Load more" button and total count display
+- Sort toggle (newest/oldest first) in the message list header
+- Client-side search and filter bar: text search, filter by author, date range, has-attachments
+- Jump-to-date picker for navigating to a specific date's messages
+- Auto-refresh polling (30-60s) with "New messages" toast notification
+- Message volume sparkline chart showing activity per day
+
+## Capabilities
+
+### New Capabilities
+- `message-pagination`: Cursor-based pagination with `before` parameter across API, bridge, and UI layers
+- `message-filtering`: Client-side search, author filter, date range filter, attachment filter in the Messages tab
+- `message-display-enhancements`: Date separators, full timestamps, sort toggle, auto-refresh, and activity sparkline
+
+### Modified Capabilities
+<!-- No existing spec-level capabilities are changing requirements -->
+
+## Impact
+
+- **API**: `GET /api/channels/{channel_id}/messages` gains `before` (cursor) and `order` params
+- **Bot bridge**: `DiscordBridge.getMessages` and `SlackBridge.getMessages` need `before` param forwarding
+- **Frontend**: `MessagesTab.tsx` — major refactor for pagination state, filters, date separators, sparkline
+- **Dependencies**: May need a lightweight chart library (e.g., recharts) for the sparkline, or implement with SVG
diff --git a/openspec/changes/messages-tab-enhancement/specs/message-display-enhancements/spec.md b/openspec/changes/messages-tab-enhancement/specs/message-display-enhancements/spec.md
new file mode 100644
index 00000000..1e7adddc
--- /dev/null
+++ b/openspec/changes/messages-tab-enhancement/specs/message-display-enhancements/spec.md
@@ -0,0 +1,59 @@
+## ADDED Requirements
+
+### Requirement: Date group separators between messages
+The Messages tab SHALL insert visual date separators between messages from different calendar days. Separators SHALL display "Today", "Yesterday", or the formatted date (e.g., "Mar 28, 2026").
+
+#### Scenario: Messages spanning multiple days
+- **WHEN** loaded messages span 3 different calendar days
+- **THEN** 3 date separator headers are rendered between the appropriate message groups
+
+#### Scenario: All messages from today
+- **WHEN** all loaded messages are from today
+- **THEN** a single "Today" separator is shown above the first message
+
+### Requirement: Full timestamp on hover
+Each message SHALL display the full absolute timestamp (e.g., "Apr 3, 2026, 1:45:32 PM") in a tooltip when the user hovers over the relative time display.
+
+#### Scenario: Hover to see full timestamp
+- **WHEN** user hovers over "3h ago" on a message
+- **THEN** a tooltip shows "Apr 3, 2026, 1:45:32 PM"
+
+### Requirement: Sort order toggle
+The Messages tab header SHALL include a toggle to switch between newest-first and oldest-first sort order. The default order SHALL be newest-first. Changing the sort order SHALL re-fetch messages from the API with the new order.
+
+#### Scenario: Toggle to oldest first
+- **WHEN** user clicks the sort toggle from "Newest first" to "Oldest first"
+- **THEN** messages are re-fetched with `order=asc` and displayed oldest-first
+
+#### Scenario: Default sort order
+- **WHEN** user opens the Messages tab for the first time
+- **THEN** messages are displayed newest-first
+
+### Requirement: Auto-refresh with new message notification
+The Messages tab SHALL poll for new messages every 30 seconds while the tab is active. When new messages are detected, a toast/banner SHALL appear indicating the count of new messages. Clicking the toast SHALL scroll to or reveal the new messages.
+
+#### Scenario: New messages detected
+- **WHEN** polling detects 3 new messages since the last fetch
+- **THEN** a banner appears: "3 new messages" with a click-to-reveal action
+
+#### Scenario: Tab not active
+- **WHEN** user navigates away from the Messages tab
+- **THEN** polling SHALL stop to avoid unnecessary API calls
+
+### Requirement: Message activity sparkline
+The Messages tab header area SHALL display a small sparkline chart showing message volume per day for the loaded messages. The sparkline SHALL be rendered as inline SVG without external chart dependencies.
+
+#### Scenario: Sparkline rendering
+- **WHEN** loaded messages span 7 days with varying daily counts
+- **THEN** a sparkline with 7 bars/points is rendered showing relative daily volume
+
+#### Scenario: Single day of messages
+- **WHEN** all loaded messages are from one day
+- **THEN** sparkline shows a single bar/point
+
+### Requirement: Jump to date
+The Messages tab SHALL provide a date picker that, when a date is selected, fetches messages from that date. The fetch SHALL use appropriate API parameters to load messages around the selected date.
+
+#### Scenario: Jump to specific date
+- **WHEN** user selects "Mar 15, 2026" from the date picker
+- **THEN** messages from March 15, 2026 are fetched and displayed, replacing the current view
diff --git a/openspec/changes/messages-tab-enhancement/specs/message-filtering/spec.md b/openspec/changes/messages-tab-enhancement/specs/message-filtering/spec.md
new file mode 100644
index 00000000..760ae44e
--- /dev/null
+++ b/openspec/changes/messages-tab-enhancement/specs/message-filtering/spec.md
@@ -0,0 +1,44 @@
+## ADDED Requirements
+
+### Requirement: Client-side text search across loaded messages
+The Messages tab SHALL provide a search input that filters the displayed messages by matching against message content. Filtering SHALL be case-insensitive and update results as the user types.
+
+#### Scenario: Search by keyword
+- **WHEN** user types "deployment" in the search box
+- **THEN** only messages containing "deployment" (case-insensitive) are displayed
+
+#### Scenario: Clear search
+- **WHEN** user clears the search input
+- **THEN** all loaded messages are displayed again
+
+### Requirement: Filter messages by author
+The Messages tab SHALL provide an author filter dropdown populated with the unique authors from loaded messages. Selecting an author SHALL show only their messages.
+
+#### Scenario: Filter by single author
+- **WHEN** user selects "Alice" from the author filter
+- **THEN** only messages authored by "Alice" are displayed
+
+#### Scenario: Clear author filter
+- **WHEN** user clears the author filter
+- **THEN** all loaded messages are displayed
+
+### Requirement: Filter messages by date range
+The Messages tab SHALL provide date range inputs (from/to) that filter displayed messages to only those within the selected range.
+
+#### Scenario: Filter by date range
+- **WHEN** user sets date range from "2026-03-01" to "2026-03-15"
+- **THEN** only messages with timestamps within that range are displayed
+
+### Requirement: Filter messages with attachments
+The Messages tab SHALL provide a toggle to show only messages that contain attachments.
+
+#### Scenario: Toggle attachment filter
+- **WHEN** user enables "Has attachments" filter
+- **THEN** only messages with at least one attachment are displayed
+
+### Requirement: Combined filters work together
+All filters (search, author, date range, attachments) SHALL be combinable. The displayed messages SHALL be the intersection of all active filters.
+
+#### Scenario: Multiple filters active
+- **WHEN** user searches "bug" AND filters by author "Bob" AND enables "Has attachments"
+- **THEN** only messages from Bob containing "bug" that have attachments are displayed
diff --git a/openspec/changes/messages-tab-enhancement/specs/message-pagination/spec.md b/openspec/changes/messages-tab-enhancement/specs/message-pagination/spec.md
new file mode 100644
index 00000000..35b62cdb
--- /dev/null
+++ b/openspec/changes/messages-tab-enhancement/specs/message-pagination/spec.md
@@ -0,0 +1,45 @@
+## ADDED Requirements
+
+### Requirement: API supports cursor-based pagination with before parameter
+The `GET /api/channels/{channel_id}/messages` endpoint SHALL accept an optional `before` query parameter containing a message ID. When provided, the API SHALL return messages older than the specified message. The endpoint SHALL also accept an `order` parameter (`asc` or `desc`) defaulting to `desc`.
+
+#### Scenario: Fetch first page (no cursor)
+- **WHEN** client requests `/api/channels/{id}/messages?limit=50`
+- **THEN** API returns the 50 most recent messages in newest-first order
+
+#### Scenario: Fetch next page with before cursor
+- **WHEN** client requests `/api/channels/{id}/messages?limit=50&before=msg_abc123`
+- **THEN** API returns 50 messages older than `msg_abc123` in newest-first order
+
+#### Scenario: Fetch messages in ascending order
+- **WHEN** client requests `/api/channels/{id}/messages?limit=50&order=asc`
+- **THEN** API returns the 50 oldest messages in oldest-first order
+
+### Requirement: Bridge forwards before parameter to platform APIs
+The bot bridge SHALL forward the `before` parameter to the underlying platform API (Discord REST `?before=`, Slack `conversations.history` `latest` param) when provided.
+
+#### Scenario: Discord bridge pagination
+- **WHEN** bridge receives a getMessages request with `before=msg_id`
+- **THEN** bridge calls Discord REST API with `?before=msg_id&limit=N`
+
+#### Scenario: Slack bridge pagination
+- **WHEN** bridge receives a getMessages request with `before=msg_ts`
+- **THEN** bridge calls Slack `conversations.history` with `latest=msg_ts`
+
+### Requirement: UI displays Load More button for pagination
+The Messages tab SHALL display a "Load more" button at the bottom of the message list when the current page returned exactly `limit` messages (indicating more may exist). Clicking it SHALL fetch the next page using the oldest loaded message's ID as the `before` cursor.
+
+#### Scenario: Load more messages
+- **WHEN** user clicks "Load more" and 50 messages were previously loaded
+- **THEN** UI fetches next page with `before=<oldest_message_id>` and appends results to the list
+
+#### Scenario: No more messages available
+- **WHEN** a fetch returns fewer messages than the requested limit
+- **THEN** the "Load more" button SHALL not be displayed
+
+### Requirement: UI displays message count context
+The Messages tab header SHALL display the count of currently loaded messages (e.g., "150 messages loaded").
+
+#### Scenario: Message count display
+- **WHEN** user has loaded 150 messages across 3 pages
+- **THEN** header shows "150 messages loaded"
diff --git a/openspec/changes/messages-tab-enhancement/tasks.md b/openspec/changes/messages-tab-enhancement/tasks.md
new file mode 100644
index 00000000..a60a8c93
--- /dev/null
+++ b/openspec/changes/messages-tab-enhancement/tasks.md
@@ -0,0 +1,55 @@
+## 1. API & Bridge — Pagination Support
+
+- [x] 1.1 Add `before` and `order` query params to `GET /api/channels/{channel_id}/messages` in `channels.py`
+- [x] 1.2 Update `DiscordBridge.getMessages` in `bridge.ts` to forward `before` param to Discord REST API
+- [x] 1.3 Update `SlackBridge.getMessages` in `bridge.ts` to forward `before` as `latest` param to Slack SDK
+- [x] 1.4 Update bridge `/bridge/connections/{connId}/channels/{id}/messages` route to parse and forward `before` and `order` query params
+
+## 2. Frontend — Sort & Pagination
+
+- [x] 2.1 Change default message order to newest-first (`order=desc`) in `MessagesTab.tsx`
+- [x] 2.2 Add sort toggle button (Newest/Oldest) to the message list header that re-fetches with `order` param
+- [x] 2.3 Implement "Load more" button that fetches next page using `before=<oldest_message_id>` cursor
+- [x] 2.4 Track pagination state: append loaded pages, disable button when no more messages, show loading state
+- [x] 2.5 Update header to show loaded message count (e.g., "150 messages loaded")
+
+## 3. Frontend — Date Separators & Timestamps
+
+- [x] 3.1 Add date group separators between messages from different calendar days ("Today", "Yesterday", "Mar 28, 2026")
+- [x] 3.2 Add `title` attribute with full absolute timestamp to the relative time display for hover tooltip
+- [x] 3.3 Ensure date separators work correctly in both newest-first and oldest-first sort orders
+
+## 4. Frontend — Search & Filters
+
+- [x] 4.1 Add search input to message list header for client-side text filtering (case-insensitive content match)
+- [x] 4.2 Add author filter dropdown populated from unique authors in loaded messages
+- [x] 4.3 Add date range filter (from/to date inputs)
+- [x] 4.4 Add "Has attachments" toggle filter
+- [x] 4.5 Combine all filters with intersection logic — update displayed messages reactively
+
+## 5. Frontend — Auto-Refresh
+
+- [x] 5.1 Implement 30-second polling interval that fetches messages newer than the latest loaded message
+- [x] 5.2 Deduplicate incoming messages by `message_id` before prepending to state
+- [x] 5.3 Show "N new messages" toast/banner when new messages arrive, with click-to-reveal
+- [x] 5.4 Pause polling when Messages tab is not active; resume on tab focus
+
+## 6. Frontend — Jump to Date
+
+- [x] 6.1 Add date picker component in the header area
+- [x] 6.2 On date selection, fetch messages from that date using `since` param and replace current view
+- [x] 6.3 Reset pagination state and filters when jumping to a new date
+
+## 7. Frontend — Activity Sparkline
+
+- [x] 7.1 Compute daily message counts from loaded messages
+- [x] 7.2 Render inline SVG sparkline (bar chart) in the header showing daily volume
+- [x] 7.3 Update sparkline when more messages are loaded via pagination
+
+## 8. Testing & Verification
+
+- [x] 8.1 Verify pagination works end-to-end: API → Bridge → Discord/Slack → UI
+- [x] 8.2 Verify sort toggle re-fetches and displays messages in correct order
+- [x] 8.3 Verify all filters work individually and in combination
+- [x] 8.4 Verify auto-refresh detects and displays new messages without duplicates
+- [x] 8.5 Verify date separators render correctly across timezone boundaries
diff --git a/openspec/changes/multi-workspace-connections/.openspec.yaml b/openspec/changes/multi-workspace-connections/.openspec.yaml
new file mode 100644
index 00000000..0f528039
--- /dev/null
+++ b/openspec/changes/multi-workspace-connections/.openspec.yaml
@@ -0,0 +1,2 @@
+schema: spec-driven
+created: 2026-04-01
diff --git a/openspec/changes/multi-workspace-connections/design.md b/openspec/changes/multi-workspace-connections/design.md
new file mode 100644
index 00000000..e592ec22
--- /dev/null
+++ b/openspec/changes/multi-workspace-connections/design.md
@@ -0,0 +1,98 @@
+## Context
+
+Beever Atlas ingests messages from team communication platforms (Slack, Discord, Teams, Telegram) to build a knowledge graph. The system has three layers:
+
+1. **Backend** (Python/FastAPI + MongoDB) — manages platform connections, stores encrypted credentials, exposes REST API, runs ingestion via `ChatBridgeAdapter`
+2. **Bot** (TypeScript) — runs chat adapters via a `ChatManager` registry, proxies messages through a bridge HTTP server, receives webhooks
+3. **Frontend** (React/Vite) — Settings page for managing connections, wizard for onboarding new platforms
+
+Currently, a `UNIQUE` index on `platform` in MongoDB and an explicit API check enforce one connection per platform. The bot's `ChatManager` uses a `Map<string, AdapterEntry>` keyed by platform name, which overwrites on duplicate registration. The frontend renders a fixed 2x2 grid of platform cards. Webhooks arrive at per-platform endpoints (`POST /api/slack`) and the ingestion pipeline uses `ChatBridgeAdapter` which calls legacy bridge routes without connection context.
+
+### Key SDK constraint (investigated)
+
+The Chat SDK (v4.23.0) stores adapters in a flat `Record<string, Adapter>` and creates webhook handlers dynamically keyed by the same string. It does **not** parse adapter keys — they are opaque strings. Thread IDs are generated by adapters with a hardcoded platform prefix (e.g., `slack:CHANNEL:TS`), independent of the adapter key. Composite keys like `"slack:conn-1"` work for storage and webhook dispatch (`bot.webhooks["slack:conn-1"](req)`), but `bot.webhooks.slack` would no longer exist.
+
+## Goals / Non-Goals
+
+**Goals:**
+- Allow N connections per platform across all layers (backend, bot, frontend)
+- Make `display_name` required for UI-created connections so users can distinguish workspaces
+- Maintain backward compatibility for env-sourced connections (single Slack from `.env` still works)
+- Keep the Chat SDK's immutable-adapter constraint satisfied (rebuild on any change)
+- Correctly route webhooks and bridge requests to the specific connection that owns the data
+
+**Non-Goals:**
+- Cross-workspace search or unified channel views (future work)
+- OAuth-based connection flows (currently manual token entry)
+- Connection health monitoring or auto-reconnect (existing behavior unchanged)
+- Multi-tenant / org-level isolation
+
+## Decisions
+
+### 1. Composite adapter keys in ChatManager
+
+**Decision**: Use `{platform}:{connectionId}` as the adapter map key (e.g., `slack:abc-123`).
+
+**Rationale**: The Chat SDK accepts `adapters: Record<string, Adapter>`. Using composite keys avoids collisions while keeping the SDK's flat map structure. The ChatManager already rebuilds the entire Chat instance on any change, so the key format is internal. SDK investigation confirmed keys are opaque strings — no parsing occurs.
+
+**Critical implementation detail**: The `rebuild()` method currently uses `if (platform === "slack")` to select adapter factories. After this change, the loop must **parse the platform portion** from the composite key (`key.split(":")[0]`) for factory selection, then pass the full composite key to the Chat SDK adapter map.
+
+**Alternative considered**: Nested `Map<string, Map<string, AdapterEntry>>` — rejected because it adds complexity without benefit since the Chat SDK needs a flat map anyway.
+
+### 2. Per-connection webhook endpoints
+
+**Decision**: Add `POST /api/webhooks/{connectionId}` as the primary webhook route. Each connection gets its own webhook URL. Keep legacy `POST /api/slack` etc. working by trying all adapters for that platform (for backward compat during migration).
+
+**Rationale**: With multiple Slack apps, each has a different signing secret. The adapter's `handleWebhook()` verifies signatures internally. A per-connection endpoint routes directly to the right adapter. This is also the natural enterprise pattern — each Slack app is configured with its own webhook URL.
+
+**Alternative considered**: Single platform endpoint + try-all-adapters — rejected as primary approach because it's wasteful and could have false-positive verification issues. Kept as legacy fallback only.
+
+### 3. Connection-ID-routed bridge API
+
+**Decision**: Add connection-scoped bridge routes: `GET /bridge/connections/{connectionId}/channels`, `GET /bridge/connections/{connectionId}/channels/{channelId}/messages`, etc. Update backend helper functions (`_register_adapter`, `_unregister_adapter`, `_list_bridge_channels`) to accept and use connection ID. Keep legacy `/bridge/platforms/{platform}/channels` aggregating across all connections with `connection_id` in each response object.
+
+**Rationale**: The backend's `list_connection_channels()` and `validate_connection()` endpoints have the connection ID available but currently call platform-scoped bridge routes, which would return channels from the wrong workspace. Connection-scoped routes solve this precisely.
+
+**Alternative considered**: Only updating legacy routes to filter server-side — rejected because the bridge doesn't know which connection owns which channel without the caller telling it.
+
+### 4. ChatBridgeAdapter connection-awareness
+
+**Decision**: Make `ChatBridgeAdapter` (the Python ingestion adapter) connection-aware. Add an optional `connection_id` parameter. When set, route requests through connection-scoped bridge routes (`/bridge/connections/{connectionId}/channels/{id}/messages`). When unset, fall back to legacy routes for backward compatibility.
+
+**Rationale**: The ingestion pipeline uses `ChatBridgeAdapter` to fetch message history. With multiple connections, `fetch_history()` calling `/bridge/channels/{channel_id}/messages` would use `getFirstBridge()` in the bot, which arbitrarily picks one adapter. This would fail for channels belonging to other connections.
+
+### 5. Settings page: connection list with grouped headers
+
+**Decision**: Replace the fixed platform grid with a flat list of active connections, each showing platform icon + display name + status. An "Add Connection" button opens a platform picker dialog, then the existing ConnectionWizard.
+
+**Rationale**: When you have 5 Slack workspaces, a per-platform card doesn't scale. A connection-centric list naturally handles 1 or 20 connections. The "Add Connection" flow replaces the per-card Connect button.
+
+**Alternative considered**: Expandable platform cards showing nested connections — rejected because it's more complex and the empty-state (no connections yet) is awkward.
+
+### 6. display_name required for UI connections
+
+**Decision**: Make `display_name` a required field in the ConnectionWizard (step 1). Env-sourced connections auto-name as `"{Platform} (env)"`.
+
+**Rationale**: With multiple workspaces, users need to tell them apart. Making it required avoids "Slack", "Slack (2)", "Slack (3)" anti-patterns.
+
+### 7. Drop unique index, add compound index
+
+**Decision**: Drop the `UNIQUE` index on `platform`. Add a non-unique index on `(platform, source)` to support env-migration queries.
+
+**Rationale**: The unique index is the root database blocker. The compound index replaces the query pattern in `_migrate_env_connection()` which needs to check "does an env-sourced connection for this specific platform already exist?"
+
+## Risks / Trade-offs
+
+- **Chat SDK rebuild cost** → Each new connection triggers a full Chat instance rebuild (shutdown + create). With many connections this could cause brief message gaps. Mitigation: rebuilds are already fast (<1s); monitor in production.
+- **Env migration idempotency** → Changing the migration check from "any env connection exists" to "env connection for this platform exists" could create duplicate env connections on redeploy if someone also added the same platform via UI. Mitigation: env migration checks by `(platform, source="env")` specifically.
+- **Legacy webhook try-all cost** → Legacy `POST /api/slack` tries each Slack adapter sequentially. With many connections, the first N-1 will fail verification before the right one succeeds. Mitigation: legacy endpoints are transitional; new connections should use per-connection webhook URLs.
+- **Bridge route ambiguity** → Legacy routes like `GET /bridge/platforms/slack/channels` aggregate across multiple Slack connections. If channels have the same name in different workspaces, the response may contain duplicates. Mitigation: include `connection_id` field in channel response objects. Backend endpoints (`list_connection_channels`, `validate_connection`) use connection-scoped routes to avoid this.
+- **Adapter rollback during failed creation** → If `_register_adapter` succeeds but connection persistence fails, the rollback calls `_unregister_adapter`. Post-change, this must use the connection ID, but the connection hasn't been persisted yet. Mitigation: generate the connection ID before registration and pass it throughout the create flow.
+
+## Migration Plan
+
+1. **Database**: On startup, `platform_store.startup()` drops the old unique index and creates the new compound index. No data migration needed — existing single-connection data is valid.
+2. **Backend API**: Deploy backend first. The 409 check removal is backward compatible (fewer errors, not more). Updated helper functions pass connection IDs.
+3. **Bot**: Deploy bot with updated ChatManager. Startup sync passes connection IDs. New per-connection webhook routes added alongside legacy ones.
+4. **Frontend**: Deploy frontend last. New Settings page works with both single and multiple connections.
+5. **Rollback**: Re-add the unique index constraint. Any extra connections created during the window would need manual cleanup, but this is low-risk given the short window.
diff --git a/openspec/changes/multi-workspace-connections/proposal.md b/openspec/changes/multi-workspace-connections/proposal.md
new file mode 100644
index 00000000..e2ce6473
--- /dev/null
+++ b/openspec/changes/multi-workspace-connections/proposal.md
@@ -0,0 +1,31 @@
+## Why
+
+Enterprise organizations routinely operate multiple workspaces on the same platform — separate Slack workspaces per department, multiple Discord servers for different communities, distinct Teams tenants for subsidiaries. The current one-connection-per-platform constraint blocks enterprise adoption. This needs to ship before any enterprise pilot.
+
+## What Changes
+
+- **BREAKING**: Remove the unique-per-platform constraint at database and API levels, allowing N connections per platform
+- **BREAKING**: Redesign the bot's `ChatManager` registry from `Map<string, AdapterEntry>` (keyed by platform) to support composite keys (`platform:connectionId`), enabling multiple adapters of the same platform type
+- **BREAKING**: Redesign the Settings page from a fixed 4-card platform grid to a dynamic connection list with an "Add Connection" flow
+- Require `display_name` for UI-created connections (needed to distinguish multiple workspaces of the same platform)
+- Add connection ID to bot bridge routes and the internal credentials API response
+- Extend the `PlatformConnection` model's platform literal to include `"teams" | "telegram"`
+
+## Capabilities
+
+### New Capabilities
+- `multi-connection-backend`: Backend support for multiple connections per platform — drop unique index, remove 409 guard, add connection ID to bridge adapter registration/unregistration, update credentials endpoint response
+- `multi-connection-bot`: Bot ChatManager and bridge refactor — composite adapter keys, connection-ID-aware routes and handlers, multi-adapter startup sync
+- `multi-connection-frontend`: Settings page redesign — dynamic connection list grouped by platform, "Add Connection" button opening platform picker, updated PlatformCard to always show connection identity
+
+### Modified Capabilities
+
+_(none — no existing specs)_
+
+## Impact
+
+- **Database**: Drop `UNIQUE` index on `platform` in `platform_connections` collection; add compound index on `(platform, source)` for env-migration queries
+- **Backend API**: `POST /api/connections` removes 409 duplicate check; `DELETE /bridge/adapters/{platform}` becomes `DELETE /bridge/adapters/{connectionId}`; internal credentials endpoint adds `connection_id` field
+- **Bot**: `ChatManager` registry key changes from `platform` to `platform:connectionId`; all bridge handler functions gain `connectionId` parameter; Chat SDK adapter map uses composite keys
+- **Frontend**: `SettingsPage` completely rewritten; `PlatformCard` updated to always render from a connection (no more "empty platform" state — that moves to the add-connection flow); `ConnectionWizard` requires display_name
+- **Env migration**: `_migrate_env_connection()` filters by platform+source instead of just source
diff --git a/openspec/changes/multi-workspace-connections/specs/multi-connection-backend/spec.md b/openspec/changes/multi-workspace-connections/specs/multi-connection-backend/spec.md
new file mode 100644
index 00000000..bf7c1337
--- /dev/null
+++ b/openspec/changes/multi-workspace-connections/specs/multi-connection-backend/spec.md
@@ -0,0 +1,99 @@
+## ADDED Requirements
+
+### Requirement: Multiple connections per platform
+The system SHALL allow creating multiple `PlatformConnection` records with the same `platform` value. The database SHALL NOT enforce a unique constraint on the `platform` field.
+
+#### Scenario: Create second Slack connection
+- **WHEN** a user creates a connection with `platform: "slack"` and a Slack connection already exists
+- **THEN** the system SHALL create the new connection and return `201 Created`
+
+#### Scenario: Database index migration
+- **WHEN** the application starts and a unique index on `platform` exists
+- **THEN** the system SHALL drop the unique index and create a non-unique compound index on `(platform, source)`
+
+### Requirement: Connection ID in adapter registration
+The `_register_adapter()` helper SHALL pass `connection_id` to the bot bridge when registering adapters.
+
+#### Scenario: Register adapter with connection ID
+- **WHEN** a new connection is created via `POST /api/connections`
+- **THEN** the system SHALL call `POST /bridge/adapters` with `{ platform, credentials, connectionId }` where `connectionId` is the connection's `id` field
+
+#### Scenario: Validate connection passes connection ID
+- **WHEN** `POST /api/connections/{id}/validate` is called
+- **THEN** `_register_adapter()` SHALL be called with the connection's `id` as `connection_id`
+
+### Requirement: Connection ID in adapter unregistration
+The `_unregister_adapter()` helper SHALL use connection ID to target the specific adapter.
+
+#### Scenario: Unregister adapter by connection ID
+- **WHEN** a connection is deleted via `DELETE /api/connections/{id}`
+- **THEN** the system SHALL call `DELETE /bridge/adapters/{connectionId}` using the connection's `id`
+
+#### Scenario: Rollback on failed creation
+- **WHEN** a connection creation fails after adapter registration succeeds
+- **THEN** `_unregister_adapter()` SHALL use the connection's `id` (generated before registration) to unregister the correct adapter
+
+### Requirement: Connection-scoped channel listing
+The `_list_bridge_channels()` helper SHALL support connection-scoped requests.
+
+#### Scenario: List channels for a specific connection
+- **WHEN** `list_connection_channels()` is called with a connection ID
+- **THEN** it SHALL call `GET /bridge/connections/{connectionId}/channels` instead of the platform-aggregated route
+
+#### Scenario: Validate connection uses connection-scoped channels
+- **WHEN** `validate_connection()` lists channels to verify access
+- **THEN** it SHALL use the connection-scoped channel route with the connection's ID
+
+### Requirement: Connection ID in credentials endpoint
+The internal credentials endpoint SHALL include connection ID in each entry.
+
+#### Scenario: Fetch credentials for startup sync
+- **WHEN** the bot calls `GET /api/internal/connections/credentials`
+- **THEN** each entry in the response array SHALL include `connection_id`, `platform`, `credentials`, and `status`
+
+### Requirement: display_name required for UI connections
+The system SHALL require a non-empty `display_name` for connections created via the UI (`source: "ui"`).
+
+#### Scenario: Create connection without display_name
+- **WHEN** a user creates a connection via `POST /api/connections` with empty or missing `display_name`
+- **THEN** the system SHALL return `422 Unprocessable Entity` with an error message
+
+#### Scenario: Env-sourced connections auto-name
+- **WHEN** the system creates an env-sourced connection during startup migration
+- **THEN** the connection SHALL have `display_name` set to `"{Platform} (env)"` (e.g., `"Slack (env)"`)
+
+### Requirement: Platform literal includes all supported platforms
+The `PlatformConnection` model's `platform` field SHALL accept `"slack" | "discord" | "teams" | "telegram"`.
+
+#### Scenario: Create Teams connection
+- **WHEN** a user creates a connection with `platform: "teams"`
+- **THEN** the system SHALL accept and persist the connection
+
+### Requirement: Env migration scoped to platform
+The env migration logic SHALL only skip creation if an env-sourced connection for the same platform already exists.
+
+#### Scenario: Env Slack migration with existing UI Slack
+- **WHEN** `SLACK_BOT_TOKEN` is set in env and a UI-sourced Slack connection exists but no env-sourced Slack connection exists
+- **THEN** the system SHALL create the env-sourced Slack connection
+
+### Requirement: ChatBridgeAdapter connection-awareness
+The `ChatBridgeAdapter` SHALL accept an optional `connection_id` parameter and route requests through connection-scoped bridge endpoints when set.
+
+#### Scenario: Fetch history for a specific connection
+- **WHEN** `fetch_history()` is called on an adapter with `connection_id` set
+- **THEN** it SHALL request `GET /bridge/connections/{connectionId}/channels/{channelId}/messages`
+
+#### Scenario: List channels for a specific connection
+- **WHEN** `list_channels()` is called on an adapter with `connection_id` set
+- **THEN** it SHALL request `GET /bridge/connections/{connectionId}/channels`
+
+#### Scenario: Backward-compatible mode
+- **WHEN** `ChatBridgeAdapter` is created without `connection_id`
+- **THEN** it SHALL use legacy routes (`/bridge/channels/...`) as today
+
+### Requirement: Generate connection ID before registration
+The connection creation flow SHALL generate the connection ID before calling `_register_adapter()` so that the same ID is used for both registration and persistence.
+
+#### Scenario: Connection ID consistency
+- **WHEN** `POST /api/connections` is called
+- **THEN** the connection ID SHALL be generated first, passed to `_register_adapter()`, and then used to persist the connection document
diff --git a/openspec/changes/multi-workspace-connections/specs/multi-connection-bot/spec.md b/openspec/changes/multi-workspace-connections/specs/multi-connection-bot/spec.md
new file mode 100644
index 00000000..c1076ec2
--- /dev/null
+++ b/openspec/changes/multi-workspace-connections/specs/multi-connection-bot/spec.md
@@ -0,0 +1,124 @@
+## ADDED Requirements
+
+### Requirement: Composite adapter keys in ChatManager
+The `ChatManager` SHALL use `{platform}:{connectionId}` as the adapter registry key, allowing multiple adapters of the same platform type.
+
+#### Scenario: Register two Slack adapters
+- **WHEN** `register("slack", creds1, "conn-1")` and `register("slack", creds2, "conn-2")` are called
+- **THEN** the adapter map SHALL contain both entries keyed as `"slack:conn-1"` and `"slack:conn-2"`
+- **AND** the rebuilt Chat instance SHALL have both adapters active
+
+#### Scenario: Unregister one of multiple adapters
+- **WHEN** `unregister("slack", "conn-1")` is called and `"slack:conn-2"` exists
+- **THEN** only `"slack:conn-1"` SHALL be removed
+- **AND** the Chat instance SHALL rebuild with `"slack:conn-2"` still active
+
+### Requirement: Rebuild extracts platform from composite key
+The `rebuild()` method SHALL parse the platform portion from composite keys (e.g., `"slack"` from `"slack:conn-1"`) to select the correct adapter factory, while passing the full composite key to the Chat SDK adapter map.
+
+#### Scenario: Adapter factory selection with composite keys
+- **WHEN** `rebuild()` iterates over adapters with key `"slack:conn-1"`
+- **THEN** it SHALL use `"slack"` (extracted from the key) to select the Slack adapter factory
+- **AND** pass `"slack:conn-1"` as the key in the Chat SDK `adapters` map
+
+#### Scenario: Mixed platform adapters
+- **WHEN** adapters include `"slack:conn-1"`, `"slack:conn-2"`, and `"discord:conn-3"`
+- **THEN** rebuild SHALL create two Slack adapters and one Discord adapter, all with their composite keys
+
+### Requirement: Connection ID in register/unregister API
+The `register` and `unregister` methods SHALL accept an optional `connectionId` parameter. When omitted, the system SHALL fall back to using platform name as the key (backward compatibility for env-sourced connections).
+
+#### Scenario: Register with connection ID
+- **WHEN** the bridge receives `POST /bridge/adapters` with `{ platform: "slack", credentials: {...}, connectionId: "abc-123" }`
+- **THEN** `ChatManager.register("slack", credentials, "abc-123")` SHALL be called
+
+#### Scenario: Register without connection ID (legacy)
+- **WHEN** the bridge receives `POST /bridge/adapters` with `{ platform: "slack", credentials: {...} }` and no `connectionId`
+- **THEN** `ChatManager.register("slack", credentials)` SHALL be called with key `"slack:slack"` (platform as fallback ID)
+
+### Requirement: Per-connection webhook endpoints
+The bot SHALL support `POST /api/webhooks/{connectionId}` to route webhook requests directly to the correct adapter.
+
+#### Scenario: Webhook for specific connection
+- **WHEN** `POST /api/webhooks/abc-123` is received and adapter `"slack:abc-123"` exists
+- **THEN** the bot SHALL call `bot.webhooks["slack:abc-123"](request)` and return the response
+
+#### Scenario: Unknown connection ID
+- **WHEN** `POST /api/webhooks/unknown-id` is received and no adapter with that connection ID exists
+- **THEN** the bot SHALL return `404 Not Found`
+
+### Requirement: Legacy platform webhook fallback
+Legacy webhook endpoints (`POST /api/slack`, etc.) SHALL try all adapters for that platform sequentially.
+
+#### Scenario: Legacy Slack webhook with multiple connections
+- **WHEN** `POST /api/slack` is received and two Slack adapters exist (`"slack:conn-1"`, `"slack:conn-2"`)
+- **THEN** the bot SHALL try each Slack adapter's `handleWebhook()` until one succeeds (returns non-error status)
+- **AND** return that adapter's response
+
+#### Scenario: Legacy webhook with single connection
+- **WHEN** `POST /api/slack` is received and one Slack adapter exists
+- **THEN** the bot SHALL route to that adapter (no change in behavior)
+
+### Requirement: Connection-scoped bridge routes
+The bridge SHALL support routes that target a specific connection by ID.
+
+#### Scenario: Delete adapter by connection ID
+- **WHEN** `DELETE /bridge/adapters/{connectionId}` is received
+- **THEN** the bridge SHALL find the adapter with matching connection ID and unregister it
+
+#### Scenario: List channels for specific connection
+- **WHEN** `GET /bridge/connections/{connectionId}/channels` is received
+- **THEN** the bridge SHALL return channels from only that connection's adapter
+
+#### Scenario: Fetch messages for specific connection
+- **WHEN** `GET /bridge/connections/{connectionId}/channels/{channelId}/messages` is received
+- **THEN** the bridge SHALL fetch messages using only that connection's adapter
+
+#### Scenario: Get channel info for specific connection
+- **WHEN** `GET /bridge/connections/{connectionId}/channels/{channelId}` is received
+- **THEN** the bridge SHALL return channel info from that connection's adapter
+
+### Requirement: Legacy platform routes aggregate connections
+Existing routes that use platform name SHALL aggregate results across all connections for that platform.
+
+#### Scenario: List channels by platform with multiple connections
+- **WHEN** `GET /bridge/platforms/slack/channels` is received and two Slack connections exist
+- **THEN** the response SHALL include channels from both Slack connections
+- **AND** each channel object SHALL include a `connection_id` field identifying which connection it belongs to
+
+### Requirement: Startup sync with connection IDs
+The bot startup sync SHALL use connection IDs from the backend credentials endpoint.
+
+#### Scenario: Load multiple connections at startup
+- **WHEN** the bot fetches `GET /api/internal/connections/credentials` and receives two Slack entries with different `connection_id` values
+- **THEN** `register` SHALL be called once per entry with the respective `connectionId`
+- **AND** the Chat instance SHALL have both adapters active
+
+### Requirement: ChatManager adapter lookup by connection ID
+The `ChatManager` SHALL support looking up adapters by connection ID.
+
+#### Scenario: Get adapter by connection ID
+- **WHEN** `getAdapterByConnectionId("conn-1")` is called and `"slack:conn-1"` is registered
+- **THEN** it SHALL return the adapter entry with platform `"slack"` and connectionId `"conn-1"`
+
+#### Scenario: Get all adapters for a platform
+- **WHEN** `getAdaptersByPlatform("slack")` is called and `"slack:conn-1"` and `"slack:conn-2"` exist
+- **THEN** it SHALL return both adapter entries
+
+### Requirement: ChatManager lists adapters with metadata
+The `listAdapters` method SHALL return platform, connection ID, and adapter reference for each registered adapter.
+
+#### Scenario: List adapters
+- **WHEN** `listAdapters()` is called with two Slack and one Discord adapter registered
+- **THEN** the result SHALL contain three entries, each with `{ platform, connectionId, adapter }`
+
+### Requirement: getBridge connection-awareness
+The `getBridge()` function SHALL support lookup by connection ID in addition to platform name.
+
+#### Scenario: Get bridge by connection ID
+- **WHEN** `getBridge(chatManager, "slack", "conn-1")` is called
+- **THEN** it SHALL return the bridge for the adapter keyed `"slack:conn-1"`
+
+#### Scenario: Get bridge by platform only (legacy)
+- **WHEN** `getBridge(chatManager, "slack")` is called without connection ID and one Slack adapter exists
+- **THEN** it SHALL return the bridge for that adapter (backward compatible)
diff --git a/openspec/changes/multi-workspace-connections/specs/multi-connection-frontend/spec.md b/openspec/changes/multi-workspace-connections/specs/multi-connection-frontend/spec.md
new file mode 100644
index 00000000..5eaf2de9
--- /dev/null
+++ b/openspec/changes/multi-workspace-connections/specs/multi-connection-frontend/spec.md
@@ -0,0 +1,61 @@
+## ADDED Requirements
+
+### Requirement: Dynamic connection list replaces fixed platform grid
+The Settings page SHALL display a list of actual connections instead of a fixed grid of platform cards. Each connection SHALL show its platform icon, display name, status, and actions.
+
+#### Scenario: No connections exist
+- **WHEN** the user opens Settings and no connections exist
+- **THEN** the page SHALL show an empty state with an "Add Connection" call-to-action
+
+#### Scenario: Multiple connections exist
+- **WHEN** the user has 2 Slack connections and 1 Discord connection
+- **THEN** the page SHALL display 3 connection cards, each showing the platform icon, display name, status badge, and action buttons
+
+#### Scenario: Connection cards show identity
+- **WHEN** a connection card is rendered
+- **THEN** it SHALL display the platform name, the display_name, the connection status, and the channel count
+
+### Requirement: Add Connection flow with platform picker
+The Settings page SHALL provide an "Add Connection" button that opens a platform picker, then launches the ConnectionWizard for the chosen platform.
+
+#### Scenario: Add connection flow
+- **WHEN** the user clicks "Add Connection"
+- **THEN** a platform picker SHALL appear showing all supported platforms (Slack, Discord, Teams, Telegram)
+- **AND** selecting a platform SHALL open the ConnectionWizard for that platform
+
+#### Scenario: Add second connection for same platform
+- **WHEN** a Slack connection already exists and the user clicks "Add Connection" and selects Slack
+- **THEN** the ConnectionWizard SHALL open for Slack without any restriction
+
+### Requirement: display_name is required in wizard
+The ConnectionWizard SHALL require a non-empty display_name before allowing the user to proceed past step 1.
+
+#### Scenario: Empty display name blocks progress
+- **WHEN** the user is on step 1 of the wizard and the display name field is empty
+- **THEN** the "Next" button SHALL be disabled
+
+#### Scenario: Display name field has helpful placeholder
+- **WHEN** the wizard opens for Slack
+- **THEN** the display name placeholder SHALL indicate the purpose (e.g., "e.g. Engineering Workspace")
+
+#### Scenario: Display name label is not optional
+- **WHEN** the wizard step 1 is rendered
+- **THEN** the display name label SHALL show "Display name" without "(optional)"
+
+### Requirement: Connection actions
+Each connection card SHALL provide Manage Channels and Disconnect actions.
+
+#### Scenario: Manage channels on a connection
+- **WHEN** the user clicks "Manage Channels" on a specific connection
+- **THEN** the ManageChannelsDialog SHALL open for that connection's ID
+
+#### Scenario: Disconnect a connection
+- **WHEN** the user clicks Disconnect on a connection and confirms
+- **THEN** the connection SHALL be deleted and the list SHALL update
+
+### Requirement: Empty state encourages first connection
+When no connections exist, the Settings page SHALL show a welcoming empty state that guides users to add their first connection.
+
+#### Scenario: First-time user experience
+- **WHEN** the user visits Settings with no connections
+- **THEN** the page SHALL display platform icons, a brief explanation of what connecting does, and a prominent "Add Connection" button
diff --git a/openspec/changes/multi-workspace-connections/tasks.md b/openspec/changes/multi-workspace-connections/tasks.md
new file mode 100644
index 00000000..9b870463
--- /dev/null
+++ b/openspec/changes/multi-workspace-connections/tasks.md
@@ -0,0 +1,86 @@
+## 1. Backend: Database & Model
+
+- [x] 1.1 Update `PlatformConnection` model's `platform` literal to include `"teams" | "telegram"`
+- [x] 1.2 Update `platform_store.py` startup: drop unique index on `platform`, create compound index on `(platform, source)`
+- [x] 1.3 Add `get_connections_by_platform_and_source()` method for env migration queries
+
+## 2. Backend: API Helper Functions
+
+- [x] 2.1 Update `_register_adapter()` to accept `connection_id` param and pass `connectionId` in the bridge request body
+- [x] 2.2 Update `_unregister_adapter()` to accept `connection_id` param and call `DELETE /bridge/adapters/{connectionId}`
+- [x] 2.3 Update `_list_bridge_channels()` to accept optional `connection_id` param — when set, call `GET /bridge/connections/{connectionId}/channels`; when unset, use legacy platform route
+
+## 3. Backend: API Endpoints
+
+- [x] 3.1 Remove the 409 duplicate-platform check in `POST /api/connections`
+- [x] 3.2 Add validation: require non-empty `display_name` for `source="ui"` connections (return 422 if missing)
+- [x] 3.3 Update `create_connection()`: generate connection ID before calling `_register_adapter()`, pass it throughout the flow, use it for rollback
+- [x] 3.4 Update `delete_connection()` at line 274: pass `conn.id` to `_unregister_adapter()` instead of `conn.platform`
+- [x] 3.5 Update `validate_connection()` at line 293: pass `conn.id` to `_register_adapter()` and `_list_bridge_channels()`
+- [x] 3.6 Update `list_connection_channels()` at line 323: pass `conn.id` to `_list_bridge_channels()`
+- [x] 3.7 Update `_InternalConnectionItem` model to include `connection_id: str` field; populate it in the credentials endpoint response
+
+## 4. Backend: Env Migration & ChatBridgeAdapter
+
+- [x] 4.1 Update `_migrate_env_connection()` to check by `(platform="slack", source="env")` instead of just `source="env"`
+- [x] 4.2 Set `display_name` to `"{Platform} (env)"` for env-sourced connections
+- [x] 4.3 Add optional `connection_id` parameter to `ChatBridgeAdapter.__init__()`
+- [x] 4.4 When `connection_id` is set, route `fetch_history()` through `/bridge/connections/{connectionId}/channels/{channelId}/messages`
+- [x] 4.5 When `connection_id` is set, route `list_channels()` through `/bridge/connections/{connectionId}/channels`
+- [x] 4.6 When `connection_id` is set, route `get_channel_info()` through `/bridge/connections/{connectionId}/channels/{channelId}`
+- [x] 4.7 When `connection_id` is set, route `fetch_thread()` through connection-scoped thread endpoint
+
+## 5. Bot: ChatManager Refactor
+
+- [x] 5.1 Change adapter registry type to store `{ platform, connectionId, config }` per entry, keyed by composite `{platform}:{connectionId}`
+- [x] 5.2 Update `register(platform, credentials, connectionId?)` — use composite key, fall back to `{platform}:{platform}` when no connectionId
+- [x] 5.3 Update `unregister(platform, connectionId?)` — remove by composite key
+- [x] 5.4 Update `rebuild()` — parse platform from composite key (`key.split(":")[0]`) for adapter factory selection, pass full composite key to Chat SDK adapter map
+- [x] 5.5 Update `listAdapters()` to return `{ platform, connectionId, adapter }` for each entry
+- [x] 5.6 Add `getAdapterByConnectionId(connectionId)` — find entry where connectionId matches
+- [x] 5.7 Add `getAdaptersByPlatform(platform)` — return all entries for a given platform
+
+## 6. Bot: Webhook Routing
+
+- [x] 6.1 Add `POST /api/webhooks/{connectionId}` route — look up adapter by connection ID, call `bot.webhooks[compositeKey](request)`
+- [x] 6.2 Update `handleSlackWebhook()` to try all Slack adapters when multiple exist (legacy fallback)
+- [x] 6.3 Update `handleGenericWebhook()` to try all adapters for the given platform (legacy fallback)
+- [x] 6.4 Log which connection ID handled each webhook for observability
+
+## 7. Bot: Bridge Routes & Handlers
+
+- [x] 7.1 Update `POST /bridge/adapters` handler to extract `connectionId` from body and pass to `register()`
+- [x] 7.2 Update `DELETE /bridge/adapters/{connectionId}` — find adapter by connection ID and unregister
+- [x] 7.3 Add `GET /bridge/connections/{connectionId}/channels` route and handler
+- [x] 7.4 Add `GET /bridge/connections/{connectionId}/channels/{channelId}/messages` route and handler
+- [x] 7.5 Add `GET /bridge/connections/{connectionId}/channels/{channelId}` route and handler (channel info)
+- [x] 7.6 Add `GET /bridge/connections/{connectionId}/channels/{channelId}/threads/{threadId}/messages` route and handler
+- [x] 7.7 Update `getBridge()` to accept optional `connectionId` — when set, look up by connection ID; when unset, fall back to platform lookup
+- [x] 7.8 Update legacy `GET /bridge/platforms/{platform}/channels` to aggregate across all connections, adding `connection_id` to each channel object
+- [x] 7.9 Update startup sync in `index.ts` to pass `connection_id` from each credentials entry to `register()`
+
+## 8. Frontend: Settings Page Redesign
+
+- [x] 8.1 Rewrite `SettingsPage` to render a dynamic list of connections instead of fixed platform grid
+- [x] 8.2 Create "Add Connection" button and platform picker dialog
+- [x] 8.3 Create empty state component for when no connections exist (platform icons, explanation, CTA)
+- [x] 8.4 Update `PlatformCard` to always render from a connection object (remove null connection state)
+
+## 9. Frontend: ConnectionWizard Updates
+
+- [x] 9.1 Make `display_name` required — remove "(optional)" label, disable "Next" when empty
+- [x] 9.2 Update placeholder text to be more descriptive (e.g., "Engineering Workspace")
+- [x] 9.3 Fix `display_name: displayName || undefined` to send empty string validation to backend instead of `undefined`
+
+## 10. Testing & Verification
+
+- [x] 10.1 Test creating multiple connections for the same platform via API
+- [x] 10.2 Test bot startup sync with multiple connections of the same platform
+- [x] 10.3 Test per-connection webhook endpoint routes to correct adapter
+- [x] 10.4 Test legacy webhook endpoint tries all adapters for platform
+- [x] 10.5 Test connection-scoped bridge routes (channels, messages, threads)
+- [x] 10.6 Test Settings page with 0, 1, and 3+ connections
+- [x] 10.7 Test disconnect of one connection while others remain active
+- [x] 10.8 Test env migration with existing UI connection for same platform
+- [x] 10.9 Test ChatBridgeAdapter with connection_id fetches from correct connection
+- [x] 10.10 Test validate_connection uses connection-scoped channel listing
diff --git a/openspec/changes/oss-cla-copyright-assignment/.openspec.yaml b/openspec/changes/oss-cla-copyright-assignment/.openspec.yaml
new file mode 100644
index 00000000..863bff18
--- /dev/null
+++ b/openspec/changes/oss-cla-copyright-assignment/.openspec.yaml
@@ -0,0 +1,2 @@
+schema: spec-driven
+created: 2026-04-17
diff --git a/openspec/changes/oss-cla-copyright-assignment/design.md b/openspec/changes/oss-cla-copyright-assignment/design.md
new file mode 100644
index 00000000..290a7acd
--- /dev/null
+++ b/openspec/changes/oss-cla-copyright-assignment/design.md
@@ -0,0 +1,117 @@
+## Context
+
+Beever Atlas is transitioning from a closed internal tool to a public
+open-source project. Three IP questions must be answered before that
+transition is complete:
+
+1. **Who owns the copyright?** The Apache 2.0 license header must name an
+   authoritative legal entity, not a diffuse "contributors" collective, so
+   the company can relicense, dual-license, or defend IP in M&A and funding
+   due diligence without chasing individual contributor consents.
+
+2. **What trademarks does the company hold?** Without a documented policy,
+   third parties can name forks after the project, register confusingly
+   similar domains, or imply endorsement. The company needs a public record
+   of the marks it claims, even before formal registration.
+
+3. **What are contributors agreeing to?** DCO `Signed-off-by` trailers
+   certify origin but do not transfer copyright. The company's IP strategy
+   requires copyright assignment with a license-back. That CLA cannot go
+   live until a lawyer approves the text, so Phase A documents the interim
+   state honestly.
+
+## Goals / Non-Goals
+
+**Goals:**
+- Update the copyright line in LICENSE and NOTICE to name "Beever AI Limited"
+- Add a trademark reference in NOTICE pointing to TRADEMARK.md
+- Publish a plain-English trademark policy for Beever-family marks
+- Document the interim contribution IP posture in CONTRIBUTING.md
+- Write a lawyer-facing CLA draft off-main for Phase B
+
+**Non-Goals:**
+- Shipping CLA.md to `main` before legal review (Phase B)
+- Installing CLA-bot or DCO CI enforcement (Phase B)
+- Filing trademark registrations (external, jurisdiction-specific)
+- Retroactively auditing pre-existing external contributions
+
+## Decisions
+
+### D1: Copyright Assignment with License-Back (over DCO-only and Apache ICLA)
+
+**Choice:** Adopt Harmony HA-CAA-I-ANY (Individual Copyright Assignment
+Agreement) as the CLA template, supplemented with FSF-style license-back
+language.
+
+**Rationale:** Only copyright assignment gives the company unilateral
+relicensing capability. DCO-only and Apache ICLA (license grant only) both
+leave copyright with individual contributors, requiring unanimous consent
+for any future relicensing event -- unworkable at scale.
+
+**License-back is non-negotiable** (see D2). The combination of assignment
+plus license-back is the Canonical / Google model and is widely accepted
+in commercial OSS.
+
+**Alternatives considered:**
+- *DCO only*: Rejected -- no copyright transfer; fails M&A due diligence.
+- *Apache ICLA*: Rejected -- license grant only; same relicensing problem.
+- *In-bound = out-bound (Apache 2.0)*: Rejected -- company cannot relicense.
+
+### D2: License-Back is Non-Negotiable
+
+**Choice:** The CLA must grant contributors a perpetual, worldwide,
+non-exclusive, royalty-free, irrevocable license back to their own
+contributions.
+
+**Rationale:** Copyright assignment without a license-back is hostile to
+contributors. The Elasticsearch pre-fork is a cautionary example: SSPL
+relicensing without a license-back caused a hard community fracture.
+Contributors must be able to use their own code in other projects.
+
+### D3: Ontario, Canada Governing Law
+
+**Choice:** The CLA governing law clause specifies the Province of Ontario
+and the federal laws of Canada applicable therein, with exclusive
+jurisdiction of the courts of Ontario.
+
+**Rationale:** Beever AI Limited is incorporated in Toronto, Ontario,
+Canada. Using the company's home jurisdiction is conventional, defensible,
+and avoids ambiguity about which law applies. Ontario has a well-developed
+body of intellectual-property case law, making it a practical venue for
+IP-related disputes.
+
+### D4: A/B Phase Split -- CLA Off-Main Until Legal Review
+
+**Choice:** Phase A ships ownership and trademark documentation. The CLA
+draft lives in `.omc/plans/cla-draft-v1.md` (gitignored) until a qualified
+IP lawyer approves the text. Phase B ships CLA.md + CLA-bot.
+
+**Rationale:** Shipping a DRAFT CLA to `main` creates contributor chilling
+effect without legal benefit. Contributors may refuse to submit PRs if they
+see an unreviewed CLA. Keeping the draft off-main during legal review is
+the responsible approach. The `.omc/` directory is listed in `.gitignore`
+so the draft is never accidentally committed.
+
+**Consequences:** Between Phase A and Phase B, contributions are implicitly
+Apache 2.0 licensed. Contributors retain their copyright during this period.
+CONTRIBUTING.md makes this explicit.
+
+### D5: Beever-Family Trademark Scope Only
+
+**Choice:** TRADEMARK.md covers only "Beever", "Beever Atlas", "Beever AI",
+and the Beever logo. Votee-family marks are explicitly excluded with a
+one-line rationale.
+
+**Rationale:** Overreaching trademark claims (claiming marks the company
+does not actually use in this repository) invite challenge and erode
+community trust. Votee-family marks belong to separate Votee legal entities
+and are not relevant to this repository.
+
+## Risks
+
+| Risk | Likelihood | Impact | Mitigation |
+|------|-----------|--------|------------|
+| CLA enforced prematurely -- maintainer treats the draft as binding before legal review | Medium | High | CLA.md does not exist on `main`. CONTRIBUTING.md explicitly states CLA is "under development". |
+| CLA draft contains hallucinated clauses with no legal basis | Medium | Medium | Every clause in the draft MUST cite a Harmony or FSF source via HTML comment. Acceptance check requires >= 8 citations. |
+| Trademark list overreaches -- claiming marks not used in this repo | Low | Medium | Scoped to Beever-family only. All marks labeled TM (unregistered). |
+| Phase B delayed indefinitely -- legal review never happens | Medium | Medium | CONTRIBUTING.md commits to a 4-week target. Follow-up issue tracked in RES-232. |
diff --git a/openspec/changes/oss-cla-copyright-assignment/proposal.md b/openspec/changes/oss-cla-copyright-assignment/proposal.md
new file mode 100644
index 00000000..2c362e31
--- /dev/null
+++ b/openspec/changes/oss-cla-copyright-assignment/proposal.md
@@ -0,0 +1,57 @@
+## Why
+
+Beever Atlas is opening to the public. Without clear IP posture the project
+cannot relicense, cannot defend its brand, and cannot pass M&A or funding
+due diligence. Copyright assignment was chosen over DCO-only and Apache ICLA
+because only full assignment gives the company unilateral relicensing
+capability. A two-phase rollout avoids shipping unreviewed legal text to
+`main`: Phase A establishes ownership, trademark, and forward-looking CLA
+intent without adding contributor friction; Phase B (follow-up PR, gated on
+legal review) ships the actual CLA and enforcement tooling.
+
+## What Changes
+
+- `LICENSE` copyright line updated to "Copyright 2026 Beever AI Limited"
+- `NOTICE` copyright line updated to match LICENSE; trademark reference to
+  TRADEMARK.md added
+- `TRADEMARK.md` created: unregistered-mark policy scoped to Beever-family
+  marks only (Beever, Beever Atlas, Beever AI, the Beever logo); Votee-family
+  marks explicitly excluded
+- `CONTRIBUTING.md` updated: forward-looking CLA note added (RES-232
+  tracking reference, interim Apache 2.0 terms, 4-week target); false
+  "CI enforces the sign-off" claim replaced with honest state
+- `.omc/plans/cla-draft-v1.md` written off-main for lawyer review (not
+  committed to `main`)
+- This openspec record (5 files)
+
+## Capabilities
+
+### New Capabilities
+
+- `legal-ownership-attribution`: Explicit copyright ownership by Beever AI
+  Limited declared in LICENSE and NOTICE. All subsequent files reference a
+  single authoritative owner string, making relicensing and M&A due diligence
+  tractable.
+- `trademark-policy`: Unregistered trademark policy for Beever-family marks
+  published in TRADEMARK.md. Covers permitted and prohibited uses in plain
+  English. Votee-family marks explicitly out of scope. Contact address
+  provided for licensing inquiries.
+- `contributor-agreement-posture`: Forward-looking CLA policy documented in
+  CONTRIBUTING.md. States current interim position (Apache 2.0, contributors
+  retain copyright), RES-232 tracking reference, and 4-week target for CLA
+  finalization. Does not create any obligation or restriction in Phase A.
+
+### Modified Capabilities
+
+(none -- this is a policy-only change)
+
+## Impact
+
+- **Runtime / code:** Zero. No source files (.ts, .py, .js, etc.) are
+  modified. No build, test, or deployment changes.
+- **Contributor flow:** No new friction in Phase A. Contributors continue
+  to submit PRs under Apache 2.0 terms. Phase B will add a one-time CLA
+  sign-off requirement, communicated in advance.
+- **Legal posture:** Becomes explicit and consistent. Copyright ownership,
+  trademark rights, and contribution terms are now documented in dedicated
+  files that are straightforward to locate and audit.
diff --git a/openspec/changes/oss-cla-copyright-assignment/specs/copyright-posture/spec.md b/openspec/changes/oss-cla-copyright-assignment/specs/copyright-posture/spec.md
new file mode 100644
index 00000000..10ec32e5
--- /dev/null
+++ b/openspec/changes/oss-cla-copyright-assignment/specs/copyright-posture/spec.md
@@ -0,0 +1,80 @@
+## ADDED Requirements
+
+### Requirement: Explicit copyright ownership declared in LICENSE
+The LICENSE file SHALL identify "Beever AI Limited" as the copyright holder
+on line 1, and all subsequent lines of the Apache 2.0 license text SHALL
+remain byte-identical to the upstream Apache 2.0 template.
+
+#### Scenario: LICENSE line 1 names the correct legal entity
+- **WHEN** the LICENSE file is read from the repository root
+- **THEN** line 1 SHALL be exactly `Copyright 2026 Beever AI Limited`
+- **AND** lines 2 onwards SHALL be byte-identical to the Apache License,
+  Version 2.0 standard text
+
+### Requirement: NOTICE declares trademark policy and consistent copyright
+The NOTICE file SHALL contain the updated copyright line naming
+"Beever AI Limited" and SHALL reference TRADEMARK.md for the full trademark
+policy. The third-party attribution block below the separator SHALL remain
+unchanged.
+
+#### Scenario: NOTICE copyright line matches LICENSE
+- **WHEN** the NOTICE file is read from the repository root
+- **THEN** it SHALL contain the string "Copyright 2026 Beever AI Limited"
+- **AND** it SHALL contain the string "TRADEMARK.md"
+
+#### Scenario: Third-party attribution block is preserved verbatim
+- **WHEN** the NOTICE file is compared against the prior version
+- **THEN** the third-party dependency list (from the separator line
+  onwards) SHALL be byte-identical to the previous version
+
+### Requirement: Trademark policy covers Beever-family marks and excludes Votee marks
+TRADEMARK.md SHALL list the Beever-family marks (Beever, Beever Atlas,
+Beever AI, the Beever logo) with TM designation, explicitly state that
+Votee-family marks are not claimed in this repository, and provide a
+contact address for licensing inquiries.
+
+#### Scenario: TRADEMARK.md uses TM, never (R)
+- **WHEN** TRADEMARK.md is scanned for registered trademark symbols
+- **THEN** it SHALL NOT contain "(R)" or the Unicode registered trademark
+  character (U+00AE)
+
+#### Scenario: TRADEMARK.md explicitly excludes Votee marks
+- **WHEN** TRADEMARK.md is read
+- **THEN** it SHALL contain a statement that Votee-family marks are not
+  claimed in this repository (e.g., "not claimed in this repository")
+
+#### Scenario: TRADEMARK.md provides a contact address
+- **WHEN** TRADEMARK.md is read
+- **THEN** it SHALL contain the string "legal@beever.ai"
+
+### Requirement: CONTRIBUTING.md states interim contribution IP posture
+CONTRIBUTING.md SHALL include a forward-looking CLA note that references
+RES-232, states that contributions are accepted under Apache 2.0 terms
+during the interim period, and does NOT contain any sign-off ritual,
+affirmation language, or claim that CI enforces DCO sign-off.
+
+#### Scenario: Forward-looking CLA note is present
+- **WHEN** CONTRIBUTING.md is read
+- **THEN** it SHALL contain the string "under development" in the context
+  of the CLA
+- **AND** it SHALL contain the string "RES-232"
+
+#### Scenario: No pseudo-enforcement language present
+- **WHEN** CONTRIBUTING.md is scanned for enforcement claims
+- **THEN** it SHALL NOT contain the phrase "I have read and agree to the CLA"
+- **AND** it SHALL NOT contain the phrase "CI enforces the sign-off"
+
+### Requirement: CLA draft is available off-main for lawyer review
+The file `.omc/plans/cla-draft-v1.md` SHALL exist in the working tree,
+SHALL NOT be tracked by git (the `.omc/` directory is gitignored), SHALL
+contain a DRAFT banner, at least 8 HTML source citations referencing
+Harmony or FSF precedent, an Ontario, Canada governing law clause, and a
+license-back clause. It SHALL NOT contain enforceability claims.
+
+#### Scenario: CLA draft has DRAFT banner and source citations
+- **WHEN** .omc/plans/cla-draft-v1.md is read
+- **THEN** it SHALL contain the string "DRAFT"
+- **AND** it SHALL contain at least 8 occurrences of "<!-- source:"
+- **AND** it SHALL contain the string "Ontario"
+- **AND** it SHALL contain a license-back clause (string "license-back"
+  or "License-Back")
diff --git a/openspec/changes/oss-cla-copyright-assignment/tasks.md b/openspec/changes/oss-cla-copyright-assignment/tasks.md
new file mode 100644
index 00000000..ce85e208
--- /dev/null
+++ b/openspec/changes/oss-cla-copyright-assignment/tasks.md
@@ -0,0 +1,18 @@
+## Phase A (this PR)
+
+- [x] A1. Update LICENSE line 1 copyright to "Copyright 2026 Beever AI Limited"
+- [x] A2. Update NOTICE copyright line to "Copyright 2026 Beever AI Limited" and add trademark reference to TRADEMARK.md
+- [x] A3. Create TRADEMARK.md with Beever-family marks only (TM, not R), Votee exclusion rationale, `legal@beever.ai` contact
+- [x] A4. Update CONTRIBUTING.md: forward-looking CLA note (RES-232 reference, interim Apache 2.0 terms, 4-week target) + fix false CI-enforcement claim
+- [x] A5. Create openspec record (5 files: .openspec.yaml, proposal.md, design.md, tasks.md, specs/copyright-posture/spec.md)
+- [x] A6. Write .omc/plans/cla-draft-v1.md off-main for lawyer review (DRAFT banner, >= 8 source citations, Ontario, Canada governing law, license-back)
+- [x] A7. Run all Phase A acceptance checks and confirm pass
+
+## Phase B (follow-up PR, gated on legal review)
+
+- [ ] B1. Obtain qualified IP lawyer approval of .omc/plans/cla-draft-v1.md text
+- [ ] B2. Finalize CLA.md from .omc/plans/cla-draft-v1.md (strip <!-- source: --> comments, remove DRAFT banner)
+- [ ] B3. Update CONTRIBUTING.md: replace forward-looking note with CLA-enforcement section and link to CLA.md
+- [ ] B4. Install and configure CLA-bot (.github/workflows/cla.yml or equivalent)
+- [ ] B5. Add DCO CI enforcement alongside CLA-bot
+- [ ] B6. Run all Phase B acceptance checks
diff --git a/openspec/changes/res-177-p0-quality-hardening/.openspec.yaml b/openspec/changes/res-177-p0-quality-hardening/.openspec.yaml
new file mode 100644
index 00000000..c4036b7c
--- /dev/null
+++ b/openspec/changes/res-177-p0-quality-hardening/.openspec.yaml
@@ -0,0 +1,2 @@
+schema: spec-driven
+created: 2026-04-20
diff --git a/openspec/changes/res-177-p0-quality-hardening/design.md b/openspec/changes/res-177-p0-quality-hardening/design.md
new file mode 100644
index 00000000..b4295ee5
--- /dev/null
+++ b/openspec/changes/res-177-p0-quality-hardening/design.md
@@ -0,0 +1,203 @@
+## Context
+
+This change closes the seven P0 sub-issues of Linear RES-177 that remain open
+after PR #42 shipped the H1/H2/H3/H4/M1/M6 security fixes. The umbrella
+milestone is **v1.0 OSS Launch**. Each sub-issue was authored by the
+`SECURITY_REVIEW_2026-04-15` audit; this doc consolidates the implementation
+approach because the seven fixes share one branch, one PR, and one shared
+concern: making the repo safe to be read and deployed by strangers after OSS.
+
+Constraints:
+
+- Current baseline: backend `uv run pytest` = 1,140 pass / 2 pre-existing fail
+  (`test_qa_chat_overhaul` — unrelated) / 25 skipped; bot `npm test` = 116 pass
+  / 0 fail. Must not regress.
+- Web tests are currently RED on `main` (RES-214) — this change must make them
+  GREEN, not just "equally broken".
+- Root main branch has moved since audit; two items regressed
+  (`bridge.ts` +633 LoC, `test_graph_store_contract.py` +1 unguarded import).
+  All location references below reflect the current `main` tree.
+- No user-facing API surface changes except `/api/health` (see decision D5).
+
+## Goals / Non-Goals
+
+**Goals:**
+
+- Close all seven RES-177 P0 sub-issues with testable success criteria.
+- Ship one atomic branch with per-ticket commits so each Linear issue can be
+  moved to Done with a clear commit pointer.
+- Add just enough CI enforcement that a future regression on any of these
+  seven items fails a check run rather than landing on `main`.
+- Leave the refactored `bridge.ts` behaviourally identical — same routes,
+  same response envelopes, same platform-error classification, same tokens
+  attached to outbound fetches.
+
+**Non-Goals:**
+
+- Not raising coverage to 80% on every security-sensitive module (RES-208
+  Q3.3 enumerates 9 modules at 0–15%). Scope kept to: landing the ratchet at
+  50% and adding tests for the two most critical (`share_store`,
+  `quality_gates`). Remaining coverage work is tracked as a follow-up.
+- Not rewriting `Makefile`, `deploy.yml` smoke harness, or Nebula nightly
+  service ordering (RES-206 Q1.6 / Q1.7 / Q1.9). Out-of-scope stretch items
+  land as follow-up tickets.
+- Not introducing Renovate. Dependabot extended config is sufficient for
+  digest + chat-SDK grouping.
+- Not replacing `chat` SDK family with `@slack/bolt` / `discord.js` etc.
+  (RES-195 recommendation #3). That's a large migration tracked separately;
+  scope here is pin + verify.
+
+## Decisions
+
+### D1 — One branch, seven commits, one PR
+
+Alternatives: seven sibling branches (one per ticket) merged independently;
+rejected because six of the seven fixes either touch CI or Docker configs
+that must land together to avoid transient CI failures. Commit-per-ticket on
+one branch keeps PR review legible and each Linear close pinned to a single
+SHA.
+
+### D2 — `classifyPlatformError` canonical location
+
+Move the canonical (hardened) implementation to
+`bot/src/bridge/platformError.ts`. `bot/src/bridge-classifier.ts` is
+**deleted** (not a re-export shim) — keeping the file tempts future edits to
+drift it again. Both test files
+(`bot/src/bridge-error-classifier.test.ts` and
+`bot/src/bridge-classifier.test.ts`) are updated to import from the new
+canonical module. If the tests are redundant after the merge, they are
+consolidated into a single test file.
+
+Alternatives: keep `bridge-classifier.ts` as a `export *` shim — rejected,
+since reviewers and IDE-goto-definition would land in the shim and still
+need one extra hop.
+
+### D3 — Route table for `registerBridgeRoutes`
+
+Replace the 201-line `if/else` cascade with a table:
+
+```ts
+const ROUTE_TABLE: Array<{ method: string; pattern: RegExp; handler: Handler }> = [
+  { method: "POST", pattern: /^\/bridge\/send$/, handler: handleSend },
+  // …
+];
+```
+
+Each handler is wrapped with `withPlatformError(handler)`. The three-axis
+duplication (legacy + connection-scoped + platform-prefixed) is collapsed by
+having the pattern resolver normalise the matched URL into a canonical form
+before dispatching.
+
+### D4 — Logger is `bot/src/logger.ts`, not pino
+
+60-line wrapper around `console.*` with level gates driven by
+`LOG_LEVEL` env. Reasons: (a) zero new deps — keeps bot install small; (b)
+no existing structured-log consumer — pino's JSON output would be wasted;
+(c) easy to swap later. The bar is "can silence debug in prod" + "one
+module touches stdout/stderr so a future pino swap is one-line".
+
+### D5 — `/api/health` never raises
+
+Today `infra/health.py:69` raises `redis.exceptions.ConnectionError` when
+Redis is down. The fix wraps each probe in its own typed `try/except`,
+aggregates into `{status: "healthy"|"degraded"|"unhealthy", failing: [...]}`,
+always returns HTTP 200. **This is a behavioural change** visible to health
+checkers — Kubernetes readiness probes that currently interpret 5xx as "not
+ready" will now always see 200 and must switch to reading the JSON status
+field. Updated in ops README.
+
+Alternatives: return HTTP 503 on degraded — rejected because AWS ALB and
+other L7 checkers treat 503 as fail-closed-remove-from-LB; we want degraded
+to be an observable state, not an auto-remove.
+
+### D6 — Docker digest pinning via Dependabot + fall-back manual pin
+
+Dependabot 2 supports digest updates for Docker via `package-ecosystem:
+docker` with `rebase-strategy: auto`. Initial digests captured by running
+`docker buildx imagetools inspect <image>:<tag>` on the build host and
+committed. Dependabot then opens PRs when upstream repoints the tag.
+
+Rejected: Renovate — would require a new bot + config file; for the current
+update volume (two Python, two Node, 5 service images), Dependabot is
+sufficient.
+
+### D7 — CodeQL workflow: fail + upload (do not delete)
+
+Drop both `continue-on-error: true` and `upload: never`. Keep the workflow.
+Reason: without CodeQL we have no JavaScript/TypeScript SAST at all. The
+current configuration is strictly worse than nothing because it advertises
+coverage that doesn't exist.
+
+### D8 — Coverage ratchet starts at 50%, not today's 51%
+
+`pytest --cov-fail-under=50`. Chose 50% over 51% to leave room for landing
+silent-except-pass fixes (Q3.5) which typically **drop** line coverage by
+1–2 pts because they log a previously-untested branch. Once this change
+merges, a follow-up issue raises the floor to 55% and each quarter thereafter.
+
+### D9 — `.gitignore` drops `openspec/`
+
+Recommended fix from RES-213 Q8.1. The alternative ("untrack existing files
+and document explicit exceptions") was rejected because the project is
+already actively working with OpenSpec changes (this doc is one of them)
+and invisibility in `git status` has caused real confusion.
+
+## Risks / Trade-offs
+
+- **Risk:** Digest-pinning delays security updates for base images until a
+  Dependabot PR lands. → **Mitigation:** Dependabot runs daily; auto-merge
+  enabled on the digest-update workflow for the Docker ecosystem only after
+  CI green.
+- **Risk:** Decomposing `bridge.ts` into route modules changes import paths
+  consumed by `bot/src/index.ts` and test files. → **Mitigation:** keep the
+  top-level `bot/src/bridge.ts` as a re-export shim that still exposes
+  `registerBridgeRoutes` and `classifyPlatformError` for backward compat
+  within the bot package only. Delete the shim in a follow-up once all
+  internal consumers are migrated.
+- **Risk:** CodeQL hard-fail surfaces a backlog of alerts that blocks the PR.
+  → **Mitigation:** run CodeQL locally first; triage and either fix or
+  add `# codeql[suppress]` with a tracking ticket before turning hard-fail on.
+- **Risk:** `/api/health` never-raise contract changes healthchecker
+  semantics. → **Mitigation:** document in README + release notes;
+  smoke-test against the deployed EIP before prod cutover.
+- **Trade-off:** Scope cut on RES-208 coverage goal (not hitting 80% on every
+  security-sensitive module) keeps this PR reviewable. Explicit follow-up
+  ticket linked in Q3 close-out comment.
+- **Trade-off:** One big PR has larger blast radius than seven small ones.
+  Acceptable because per-commit atomicity means `git revert` per ticket
+  remains possible, and the CI gates landing in this PR protect the whole
+  bundle.
+
+## Migration Plan
+
+1. Create branch `feature/res-177-p0-quality-hardening` (done).
+2. Phase in commits in the order below; each phase is independently
+   revertible. Each phase ends with the targeted Linear sub-issue moved to
+   **Done** with a comment citing the commit SHA.
+   - Phase 1 — RES-213 (Q8): docs / env / hygiene. Lowest blast radius.
+   - Phase 2 — RES-206 (Q1): CI gates. Must land before later phases so
+     their regressions fail CI.
+   - Phase 3 — RES-194 (H5): Docker digest pins.
+   - Phase 4 — RES-195 (H6): Chat SDK pinning.
+   - Phase 5 — RES-214 (Q9): Web test harness.
+   - Phase 6 — RES-208 (Q3): Backend test baseline.
+   - Phase 7 — RES-209 (Q4): bridge.ts decomposition. Highest risk, landed
+     last with full test safety net from earlier phases.
+3. Final commit: `docs(security): add RES-177 P0 follow-up close-out notes`
+   linking each closed ticket to its commit.
+4. PR to `main`. Mark RES-177 umbrella as "all P0s closed" once merged.
+
+**Rollback:** `git revert` per phase commit. CI contract added in Phase 2
+prevents a phase-specific rollback from reintroducing the fixed regression
+silently (e.g. re-raise on `/api/health` would fail a new health-shape test).
+
+## Open Questions
+
+- **Q:** Do we want Renovate eventually, or is Dependabot our long-term
+  answer? → Decision deferred to a separate ticket; this PR uses Dependabot.
+- **Q:** Should the CodeQL workflow run on every PR or only `main` + nightly?
+  → Today it runs on both; keeping the current trigger set. Revisit if PR
+  runtime becomes a bottleneck.
+- **Q:** Coverage ratchet cadence — monthly or per-PR high-watermark?
+  → Monthly for now; high-watermark has a well-known sawtooth problem on
+  refactors.
diff --git a/openspec/changes/res-177-p0-quality-hardening/proposal.md b/openspec/changes/res-177-p0-quality-hardening/proposal.md
new file mode 100644
index 00000000..ccaa05d9
--- /dev/null
+++ b/openspec/changes/res-177-p0-quality-hardening/proposal.md
@@ -0,0 +1,115 @@
+## Why
+
+The `RES-177` security/quality audit produced eight P0 sub-issues. Six went out
+in PR #42 (H1/H2/H3/H4/M1/M6). The remaining **seven P0s** are still open on
+`main` — and two have regressed since the audit (`bridge.ts` grew from 1,758 →
+2,391 LoC; `tests/contracts/test_graph_store_contract.py` grew from 3 → 4
+unguarded `nebula3` imports). Shipping v1.0 OSS with these open leaves us with
+floating Docker tags, caret-ranged supply-chain vectors, a CodeQL workflow that
+never fails or reports, 9 red frontend tests on `main`, a flaky Python suite,
+`.env.example` missing 30+ settings, and a single-file bot bridge that makes
+every route edit a shotgun-surgery hazard.
+
+## What Changes
+
+Seven tightly-scoped hardening changes bundled into one delta because they all
+feed into the same OSS-launch gate (Linear RES-177 umbrella):
+
+- **RES-213 (Q8)** — Docs, `.env.example` drift, release / repo hygiene. Drop
+  `openspec/` from `.gitignore` so new OpenSpec changes are reviewable; rewrite
+  `.env.example` to cover every chat-platform token + web + advanced-tuning
+  setting consumed in code; fix `CHANGELOG.md` compare link + backfill
+  `[Unreleased]`; replace the Vite-scaffold `web/README.md`; move orphaned
+  binaries out of repo root; align lint target with tsconfig.
+- **RES-206 (Q1)** — CI quality gates. Drop `continue-on-error: true` and
+  `upload: never` from CodeQL (stop the security-theatre); add `pyright` loose
+  typecheck; add `pytest --cov --cov-fail-under=50`; add `vitest --coverage`;
+  add `npm run lint` for bot (with `@typescript-eslint` rules); add
+  `ruff format --check`; align `Makefile` with CI once bot lint exists.
+- **RES-194 (H5)** — Digest-pin every base image. `Dockerfile`, `web/Dockerfile`,
+  `bot/Dockerfile`, `docker-compose.yml`, `docker-compose.nebula.yml`, the
+  `COPY --from=ghcr.io/astral-sh/uv:latest` stage. All pinned by
+  `@sha256:<digest>`; add Renovate/Dependabot config for digest refresh.
+- **RES-195 (H6)** — Pin `chat` SDK family to exact versions (no carets); add
+  `npm ci --audit-signatures` to `bot/Dockerfile`; enable Dependabot grouped
+  updates so churn in this family is visible.
+- **RES-214 (Q9)** — Fix 9 failing web tests. Add a localStorage shim in
+  `web/src/test-setup.ts` (covering the jsdom 27 prototype-inheritance
+  regression); align `ChatInputBar.tools.test.tsx` expectation with the current
+  `(N/M)` label format (or restore the label).
+- **RES-208 (Q3)** — Make `uv run pytest` green on a fresh checkout with no
+  services running. Gate `nebula3` imports via `pytest.importorskip`; make
+  `/api/health` return `degraded` instead of raising when Redis/Mongo probes
+  fail; skip service-dependent tests when the service is unreachable; replace
+  the bundle of silent `except Exception: pass` sites with typed +
+  DEBUG-logged excepts at the locations the ticket enumerates.
+- **RES-209 (Q4)** — Decompose `bot/src/bridge.ts` (now 2,391 LoC). Dedupe
+  `classifyPlatformError` (canonical implementation lives in a single module;
+  both test files import the same source); extract a `withPlatformError`
+  wrapper to collapse the 10+ identical `try/catch` envelopes; extract
+  `jsonResponse` + logger to shared modules; split route handlers into route
+  modules so no bot source file exceeds 1,000 LoC.
+
+## Capabilities
+
+### New Capabilities
+
+- `docs-env-hygiene`: `.env.example` coverage contract, `CHANGELOG` discipline,
+  top-level READMEs, repo-root binary policy, `openspec/` tracked-vs-ignored
+  contract.
+- `ci-quality-gates`: CodeQL hard-fail + SARIF upload, Python typecheck,
+  coverage threshold gate, bot lint parity, `ruff format --check`, deploy
+  approval + smoke contract.
+- `container-supply-chain`: Digest-pinned base images across every Dockerfile
+  + compose file + build stage; automated digest-refresh PRs.
+- `bot-dependency-pinning`: Exact-version pins for the `chat` SDK family;
+  `npm ci --audit-signatures` in `bot/Dockerfile`; Dependabot grouped updates.
+- `web-test-harness`: `test-setup.ts` provides a portable `localStorage`
+  (and any other web-platform API jsdom strips); every test file inherits it
+  without per-file shims.
+- `backend-test-baseline`: `uv run pytest` returns green on a fresh checkout
+  with **no** live services; optional-extra imports gated by
+  `importorskip`; health endpoints never raise; silent-except sites log at
+  DEBUG with the exception class.
+- `bot-bridge-decomposition`: One canonical `classifyPlatformError`; one
+  `withPlatformError` error wrapper; one shared `jsonResponse` + logger; no
+  bot source file > 1,000 LoC.
+
+### Modified Capabilities
+
+_None — none of the new capability areas are covered by existing specs under
+`openspec/specs/` (only `ask-chat-ui` is present)._
+
+## Impact
+
+- **Code.**
+  - `.gitignore`, `.env.example`, `CHANGELOG.md`, `web/README.md`, root READMEs,
+    `Beever_Atlas_Feature_Spec.docx` (move), `daily_update.md` (untrack).
+  - `.github/workflows/{codeql,ci,audit,deploy,nightly}.yml`, `pyproject.toml`
+    ([tool.pyright] + [tool.coverage] if needed), `Makefile`,
+    `bot/package.json`, `bot/.eslintrc.*` (new), `bot/tsconfig.json`.
+  - `Dockerfile`, `web/Dockerfile`, `bot/Dockerfile`, `docker-compose.yml`,
+    `docker-compose.nebula.yml`, `.github/dependabot.yml` or `renovate.json`.
+  - `bot/package.json`, `bot/package-lock.json`, `bot/Dockerfile`.
+  - `web/src/test-setup.ts`, `web/src/components/channel/__tests__/ChatInputBar.tools.test.tsx`
+    (or `ChatInputBar.tsx`).
+  - `tests/contracts/test_graph_store_contract.py`, `src/beever_atlas/infra/health.py`,
+    `tests/test_health.py`, `tests/test_ask_share.py`, `tests/test_ask_disabled_tools.py`,
+    20+ `except Exception: pass` sites enumerated in the Q3 ticket.
+  - `bot/src/bridge.ts` → split into `bot/src/bridge/app.ts`,
+    `bot/src/bridge/routes/*.ts`, `bot/src/bridge/platformError.ts`,
+    `bot/src/bridge/withPlatformError.ts`, `bot/src/http-utils.ts`,
+    `bot/src/logger.ts`. Delete `bot/src/bridge-classifier.ts`; update both
+    test files to import from the canonical module.
+- **APIs.** No user-facing surface breaks. `/api/health` behaviour
+  changes from "raise on service error" to "return `{status: degraded, ...}`"
+  — **BREAKING** only for clients that currently treat 5xx as "healthcheck
+  failed"; ops docs updated.
+- **Data migration.** None.
+- **Env vars.** None added. `.env.example` now documents settings already
+  consumed in code (no new runtime reads).
+- **Dependencies.**
+  - Bot: pin `chat` family to exact versions (no runtime behaviour change).
+  - Backend: add `pyright` and `pytest-cov` as dev deps (already available via
+    `uv` for the latter; confirm).
+  - CI: introduce Renovate or extend Dependabot config.
diff --git a/openspec/changes/res-177-p0-quality-hardening/specs/backend-test-baseline/spec.md b/openspec/changes/res-177-p0-quality-hardening/specs/backend-test-baseline/spec.md
new file mode 100644
index 00000000..4ff37118
--- /dev/null
+++ b/openspec/changes/res-177-p0-quality-hardening/specs/backend-test-baseline/spec.md
@@ -0,0 +1,66 @@
+## ADDED Requirements
+
+### Requirement: `uv run pytest` is green on a fresh checkout
+After `make install`, `uv run pytest` SHALL exit 0 without any live
+service running (no Redis, no Mongo, no Weaviate, no Neo4j, no Nebula).
+Existing pre-existing-fail tests that were documented as the regression
+floor before this change SHALL either be fixed or explicitly marked with
+`xfail`/`skip` and an upstream ticket reference.
+
+#### Scenario: Fresh clone passes tests
+- **WHEN** a contributor clones the repo, runs `make install`, and runs
+  `uv run pytest` with no services running
+- **THEN** the command exits 0
+
+### Requirement: Optional-extra imports are guarded
+Any Python test module that imports a package declared under an optional
+extra (e.g., `nebula3` under `--extra nebula`) SHALL guard the import
+with `pytest.importorskip(...)` or a module-level `pytest.mark.skipif`
+so that the test file does not fail to collect when the extra is not
+installed.
+
+#### Scenario: Nebula tests skip when extra is absent
+- **WHEN** `nebula3` is not installed
+- **THEN** every test in
+  `tests/contracts/test_graph_store_contract.py` that imports
+  `beever_atlas.stores.nebula_store` reports as skipped, not errored
+
+### Requirement: `/api/health` never raises
+The health endpoint SHALL return HTTP 200 with a JSON body
+`{status, failing, …}` regardless of upstream-service reachability. The
+`status` field SHALL be `"healthy"`, `"degraded"`, or `"unhealthy"`.
+Each probe SHALL be wrapped in its own `try/except`; a failing probe
+MUST NOT cause the handler to raise.
+
+#### Scenario: Redis is down
+- **WHEN** Redis is unreachable and `GET /api/health` is called
+- **THEN** the response is HTTP 200 with
+  `status == "degraded"` and `"redis"` listed in `failing`
+
+#### Scenario: All stores are down
+- **WHEN** Redis, Mongo, Weaviate, and Neo4j are all unreachable
+- **THEN** the response is HTTP 200 with
+  `status == "unhealthy"` and every failing probe listed in `failing`
+
+### Requirement: Silent `except Exception: pass` sites log at DEBUG
+Enumerated suppression sites in `api/dev.py`, `server/app.py`,
+`services/batch_processor.py`, `stores/nebula_store.py`, and
+`wiki/compiler.py` (per RES-208 Q3.5) SHALL log the suppressed exception
+class and message at DEBUG level before the suppression. Silent bare
+excepts without any log SHALL NOT land on `main`.
+
+#### Scenario: A previously-silent failure is now observable
+- **WHEN** one of the enumerated suppression sites catches an exception
+- **THEN** the suppressed exception class and message are emitted at
+  DEBUG level to the configured logger
+
+### Requirement: Coverage for share-store and quality-gates reaches 80%
+Unit tests SHALL bring `services/share_store.py` and
+`agents/callbacks/quality_gates.py` to ≥ 80% line coverage. These are
+two of the nine security-sensitive modules the Q3 ticket flagged at
+0–15%; the remaining seven land in a follow-up ticket.
+
+#### Scenario: Coverage floor on share-store
+- **WHEN** `uv run pytest --cov=src/beever_atlas/services/share_store
+  --cov-fail-under=80` runs
+- **THEN** the command exits 0
diff --git a/openspec/changes/res-177-p0-quality-hardening/specs/bot-bridge-decomposition/spec.md b/openspec/changes/res-177-p0-quality-hardening/specs/bot-bridge-decomposition/spec.md
new file mode 100644
index 00000000..c24b8f7c
--- /dev/null
+++ b/openspec/changes/res-177-p0-quality-hardening/specs/bot-bridge-decomposition/spec.md
@@ -0,0 +1,79 @@
+## ADDED Requirements
+
+### Requirement: One canonical `classifyPlatformError`
+`classifyPlatformError` SHALL exist in exactly one source module. Every
+import site (tests and production) SHALL resolve to the same symbol.
+
+#### Scenario: Repo has one definition
+- **WHEN** a developer greps for
+  `export function classifyPlatformError` in the bot source tree
+- **THEN** exactly one match is found
+
+#### Scenario: Bug fix does not drift between copies
+- **WHEN** a behaviour change is made to `classifyPlatformError`
+- **THEN** no further action is required to keep multiple copies in sync,
+  because only one copy exists
+
+### Requirement: Every route handler is wrapped by `withPlatformError`
+Every bot route handler SHALL return its final JSON response through a
+shared `withPlatformError(handler)` wrapper that applies
+`classifyPlatformError` on any thrown error and emits a normalised
+`{ error, code }` envelope. Inline `try/catch` that duplicates the
+classification-plus-envelope pattern SHALL NOT appear in handler bodies.
+
+#### Scenario: New handler inherits error envelope automatically
+- **WHEN** a developer adds a new bot route handler and wraps it with
+  `withPlatformError`
+- **THEN** thrown errors produce the standard `{ status, code, error }`
+  envelope without per-handler `try/catch`
+
+### Requirement: Bot source files are under 1,000 LoC
+No bot source file in `bot/src/` (excluding generated `dist/` and
+`node_modules/`) SHALL exceed 1,000 lines of code. The current
+`bot/src/bridge.ts` (2,391 LoC as of the RES-209 audit) SHALL be split
+into route modules, a shared app bootstrap, and dedicated utility
+modules (`http-utils`, `logger`, `platformError`, `withPlatformError`).
+
+#### Scenario: `bridge.ts` file size drops below the cap
+- **WHEN** `wc -l bot/src/bridge.ts` runs after this change
+- **THEN** the result is < 1,000
+
+#### Scenario: No other bot source file exceeds the cap
+- **WHEN** `find bot/src -name '*.ts' -not -path '*/node_modules/*' | xargs wc -l`
+  runs
+- **THEN** every non-test file is under 1,000 lines
+
+### Requirement: Route registration uses a table, not regex cascades
+`registerBridgeRoutes` SHALL dispatch by a route table
+`{ method, pattern, handler }` rather than a flat sequential `if/else`
+chain. The table SHALL collapse legacy + connection-scoped +
+platform-prefixed variants via a single resolver.
+
+#### Scenario: Adding a route touches one file
+- **WHEN** a developer adds a new `/bridge/<endpoint>` route
+- **THEN** the change is additive in one route module and one entry in
+  the route table — no cascade of `else if` branches is edited
+
+### Requirement: `jsonResponse` is a shared utility
+`jsonResponse` (and any sibling HTTP helpers) SHALL live in
+`bot/src/http-utils.ts` (or equivalent shared module) and be imported by
+both `bot/src/bridge/*` and `bot/src/index.ts`. Inline
+`writeHead(…, { "Content-Type": "application/json" })` sites in
+`bot/src/index.ts` SHALL be migrated to the shared helper.
+
+#### Scenario: Inline writeHead is gone from index.ts
+- **WHEN** a reviewer greps for `writeHead.*application\/json` in
+  `bot/src/index.ts`
+- **THEN** zero matches remain
+
+### Requirement: Bot uses a level-gated logger
+`bot/src/logger.ts` (or equivalent) SHALL provide a minimal level-gated
+logging interface (at least `debug`, `info`, `warn`, `error`). The
+`LOG_LEVEL` environment variable SHALL control visibility. Bare
+`console.*` calls in `bot/src/{bridge/*, index, chat-manager, webhook-buffer}.ts`
+SHALL be migrated to the logger (at minimum, debug-only lines SHALL be
+gated so prod does not emit them).
+
+#### Scenario: `LOG_LEVEL=info` silences debug lines
+- **WHEN** the bot runs with `LOG_LEVEL=info`
+- **THEN** `logger.debug(...)` calls produce no output
diff --git a/openspec/changes/res-177-p0-quality-hardening/specs/bot-dependency-pinning/spec.md b/openspec/changes/res-177-p0-quality-hardening/specs/bot-dependency-pinning/spec.md
new file mode 100644
index 00000000..a52c0a6a
--- /dev/null
+++ b/openspec/changes/res-177-p0-quality-hardening/specs/bot-dependency-pinning/spec.md
@@ -0,0 +1,33 @@
+## ADDED Requirements
+
+### Requirement: `chat` SDK family is pinned to exact versions
+`bot/package.json` SHALL declare the `chat` SDK family — `chat`,
+`@chat-adapter/slack`, `@chat-adapter/discord`, `@chat-adapter/teams`,
+`@chat-adapter/telegram`, `@chat-adapter/state-redis`, and
+`chat-adapter-mattermost` — at exact versions (no leading `^` or `~`).
+
+#### Scenario: Caret range on a chat-family package fails CI
+- **WHEN** a contributor edits `bot/package.json` to set
+  `"chat": "^4.26.0"` (or any non-exact range)
+- **THEN** CI fails with a lint step pointing at the non-exact pin
+
+### Requirement: `bot/Dockerfile` verifies npm registry signatures
+`bot/Dockerfile` SHALL invoke `npm ci --audit-signatures` when installing
+production dependencies. If signature verification fails, the image build
+SHALL fail.
+
+#### Scenario: Tampered lockfile entry fails the image build
+- **WHEN** an attacker mutates a package tarball signature in the
+  registry and a CI build runs
+- **THEN** `npm ci --audit-signatures` exits non-zero and the image does
+  not build
+
+### Requirement: Dependabot groups updates for the chat SDK family
+The `.github/dependabot.yml` config SHALL include a grouped-updates rule
+for the chat SDK family so that version bumps across the family land in
+a single PR, preventing partial upgrades.
+
+#### Scenario: Upstream chat 4.27.0 release produces one grouped PR
+- **WHEN** the chat SDK family publishes a new minor release
+- **THEN** Dependabot opens a single PR covering every chat-family
+  package, not one PR per package
diff --git a/openspec/changes/res-177-p0-quality-hardening/specs/ci-quality-gates/spec.md b/openspec/changes/res-177-p0-quality-hardening/specs/ci-quality-gates/spec.md
new file mode 100644
index 00000000..c2450aef
--- /dev/null
+++ b/openspec/changes/res-177-p0-quality-hardening/specs/ci-quality-gates/spec.md
@@ -0,0 +1,61 @@
+## ADDED Requirements
+
+### Requirement: CodeQL fails the job on findings and uploads SARIF to Code Scanning
+The `.github/workflows/codeql.yml` workflow SHALL NOT set
+`continue-on-error: true` on the `analyze` step and SHALL NOT pass
+`upload: never`. Vulnerabilities found by CodeQL SHALL fail the CI run
+and SHALL surface in the GitHub Security → Code Scanning tab.
+
+#### Scenario: A new CodeQL finding blocks the PR
+- **WHEN** a contributor introduces a pattern CodeQL flags
+- **THEN** the `Analyze (python)` or `Analyze (javascript-typescript)` job
+  fails and the finding appears in the repo's Code Scanning tab
+
+### Requirement: CI enforces a Python coverage floor
+The backend CI job SHALL run
+`uv run pytest --cov=src/beever_atlas --cov-fail-under=50` (or an equivalent
+invocation that fails when coverage drops below the floor). The floor
+SHALL be ratcheted upward via follow-up PRs, not lowered.
+
+#### Scenario: A PR that drops coverage below the floor fails CI
+- **WHEN** a PR removes test coverage that pushes overall line coverage
+  below the configured floor
+- **THEN** the backend CI job fails with a coverage-below-threshold error
+
+### Requirement: CI runs a Python typechecker
+The backend CI job SHALL run `pyright` (or an equivalent typechecker) with
+a documented loose configuration. The configuration SHALL be tightened
+over time via follow-up PRs.
+
+#### Scenario: Typechecker failure blocks the PR
+- **WHEN** a PR introduces a type error (e.g. calling a function with the
+  wrong number of args)
+- **THEN** the backend CI job fails at the typecheck step
+
+### Requirement: Bot CI job has lint parity with web
+The `bot/` package SHALL declare an `npm run lint` script backed by
+`@typescript-eslint` with at minimum the rules `no-explicit-any`,
+`no-unused-vars`, and `no-floating-promises`. The `ci.yml` bot job SHALL
+invoke `npm run lint` alongside `npm run build` and `npm test`.
+
+#### Scenario: Lint failure blocks the PR
+- **WHEN** a PR to the `bot/` package introduces an explicit `any` or a
+  floating promise
+- **THEN** the bot CI job fails at the lint step
+
+### Requirement: CI enforces `ruff format --check`
+The backend CI job SHALL run `uv run ruff format --check src/ tests/`.
+Formatting drift SHALL fail the job.
+
+#### Scenario: Unformatted code blocks the PR
+- **WHEN** a PR lands Python source with non-canonical formatting
+- **THEN** the backend CI job fails at the format step
+
+### Requirement: Coverage floor is discoverable in CI config
+The coverage floor value SHALL live in a single source-of-truth (e.g.
+`pyproject.toml [tool.coverage.report.fail_under]` or a CI env var) so
+future ratchet PRs touch one location.
+
+#### Scenario: Raising the floor is a single-line change
+- **WHEN** a maintainer wants to raise the floor from 50 to 55
+- **THEN** they can do so by editing one file
diff --git a/openspec/changes/res-177-p0-quality-hardening/specs/container-supply-chain/spec.md b/openspec/changes/res-177-p0-quality-hardening/specs/container-supply-chain/spec.md
new file mode 100644
index 00000000..1ba40acc
--- /dev/null
+++ b/openspec/changes/res-177-p0-quality-hardening/specs/container-supply-chain/spec.md
@@ -0,0 +1,41 @@
+## ADDED Requirements
+
+### Requirement: Every base image is pinned by `@sha256:<digest>`
+Every `FROM` directive in every Dockerfile and every `image:` entry in
+every compose file SHALL pin the base image by its `@sha256:<digest>`.
+Floating tags (e.g. `python:3.12-slim` without a digest) SHALL NOT appear
+in committed images. `COPY --from=<external-image>` stages SHALL likewise
+pin by digest.
+
+#### Scenario: Floating tag in a Dockerfile fails CI
+- **WHEN** a contributor adds `FROM python:3.12-slim` (no digest)
+- **THEN** CI fails with a lint step pointing at the unpinned image
+
+#### Scenario: Every base image in the monorepo resolves to a digest
+- **WHEN** an auditor greps every `FROM ` and `image:` line in
+  `Dockerfile`, `web/Dockerfile`, `bot/Dockerfile`, `docker-compose.yml`,
+  `docker-compose.nebula.yml`
+- **THEN** every match includes `@sha256:<64-hex>`
+
+### Requirement: Digest updates are automated via Dependabot
+The `.github/dependabot.yml` config SHALL declare `package-ecosystem:
+docker` for every path that contains a Dockerfile or compose file, so
+Dependabot opens PRs when an upstream image publishes a new digest for
+the pinned tag.
+
+#### Scenario: Upstream digest change produces a Dependabot PR
+- **WHEN** an upstream registry publishes a new digest for one of the
+  pinned `<tag>` values
+- **THEN** Dependabot opens a PR updating the digest, referencing the
+  new upstream release notes
+
+### Requirement: Deploy bootstrap uses only pinned images
+`scripts/deploy/bootstrap.sh` (and any other deploy entry point) SHALL
+invoke `docker compose build` against compose files whose images are all
+digest-pinned. The bootstrap SHALL NOT mutate or strip digests at build
+time.
+
+#### Scenario: Deploy pipeline builds from pinned images only
+- **WHEN** the deploy workflow runs `docker compose build`
+- **THEN** every image pulled is identified by its digest, not a floating
+  tag
diff --git a/openspec/changes/res-177-p0-quality-hardening/specs/docs-env-hygiene/spec.md b/openspec/changes/res-177-p0-quality-hardening/specs/docs-env-hygiene/spec.md
new file mode 100644
index 00000000..c64fea98
--- /dev/null
+++ b/openspec/changes/res-177-p0-quality-hardening/specs/docs-env-hygiene/spec.md
@@ -0,0 +1,68 @@
+## ADDED Requirements
+
+### Requirement: `.env.example` documents every runtime-consumed setting
+The `.env.example` file at the repo root SHALL contain an entry for every
+environment variable that backend, bot, or web code reads at runtime — including
+chat-platform credentials (`SLACK_BOT_TOKEN`, `SLACK_SIGNING_SECRET`,
+`DISCORD_BOT_TOKEN`, `DISCORD_PUBLIC_KEY`, `DISCORD_APPLICATION_ID`,
+`TEAMS_APP_ID`, `TEAMS_APP_PASSWORD`, `TELEGRAM_BOT_TOKEN`), web build-time
+vars (`VITE_BEEVER_API_KEY`, `VITE_BEEVER_ADMIN_TOKEN`), and every
+`Settings` field declared in `src/beever_atlas/infra/config.py`. Each
+entry SHALL carry a one-line comment and a safe default (or the literal
+string `__REQUIRED__` for secrets that must be operator-supplied).
+
+#### Scenario: Fresh contributor can bring the stack up from `.env.example`
+- **WHEN** a contributor runs `cp .env.example .env` and starts the dev stack
+- **THEN** every environment variable read by source code resolves to a
+  value documented in `.env.example`, and no `KeyError`/`undefined env var`
+  is raised during import of the backend, bot, or web build
+
+#### Scenario: A new `Settings` field requires an `.env.example` entry
+- **WHEN** a developer adds a new `Settings` field in
+  `src/beever_atlas/infra/config.py`
+- **THEN** CI enforces that the new field name appears in `.env.example`
+  (via a smoke test or lint)
+
+### Requirement: `openspec/` directory is tracked, not ignored
+The `.gitignore` file SHALL NOT ignore the `openspec/` directory. New
+OpenSpec changes SHALL appear in `git status` so PR reviewers can see them.
+
+#### Scenario: Creating a new OpenSpec change makes it visible in `git status`
+- **WHEN** a developer runs `openspec new change <name>`
+- **THEN** `git status` shows the new
+  `openspec/changes/<name>/proposal.md` as an untracked file
+
+### Requirement: `CHANGELOG.md` reflects shipped work
+The `[Unreleased]` section of `CHANGELOG.md` SHALL list every user-visible
+change on `main` that post-dates the most recent git tag. The compare link
+at the bottom of the file SHALL point from the most recent tag to `HEAD`,
+not `HEAD...HEAD`.
+
+#### Scenario: New PR updates CHANGELOG
+- **WHEN** a PR with user-visible changes lands on `main`
+- **THEN** `CHANGELOG.md [Unreleased]` lists it (or the repo has a CI step
+  that blocks the PR until one is added)
+
+### Requirement: `web/README.md` is Beever-specific, not the Vite scaffold
+The `web/README.md` file SHALL describe the Beever Atlas web app, its dev
+workflow (`npm run dev`, `npm test`, `npm run lint`), and the environment
+variables it consumes (`VITE_API_URL`, `VITE_BEEVER_API_KEY`,
+`VITE_BEEVER_ADMIN_TOKEN`). It SHALL NOT be the verbatim Vite scaffold
+template.
+
+#### Scenario: Web README answers "what lives here" on first read
+- **WHEN** a contributor opens `web/README.md`
+- **THEN** it identifies the app as part of Beever Atlas, lists the dev
+  scripts, and names the env vars required to run it locally
+
+### Requirement: Root-level binaries and orphan docs are housed correctly
+`Beever_Atlas_Feature_Spec.docx` SHALL be moved out of the repo root (into
+`docs/` or a Release attachment). `daily_update.md` SHALL be untracked
+(it is already present in `.gitignore`). Security review outputs
+SHALL live under `docs/security-reviews/` or be excluded from `main`.
+
+#### Scenario: Repo root contains no stray binaries
+- **WHEN** a contributor lists the repo root
+- **THEN** the only binary-format files at root are those explicitly
+  needed for top-level tooling (e.g., `pyproject.toml`, `package.json`,
+  not `.docx` spec documents)
diff --git a/openspec/changes/res-177-p0-quality-hardening/specs/web-test-harness/spec.md b/openspec/changes/res-177-p0-quality-hardening/specs/web-test-harness/spec.md
new file mode 100644
index 00000000..06126f7a
--- /dev/null
+++ b/openspec/changes/res-177-p0-quality-hardening/specs/web-test-harness/spec.md
@@ -0,0 +1,38 @@
+## ADDED Requirements
+
+### Requirement: `web/src/test-setup.ts` provides a working `localStorage`
+The web test setup file SHALL ensure `window.localStorage` has working
+`getItem`, `setItem`, `removeItem`, `clear`, `key`, and `length` members
+before any test runs. The shim SHALL be installed idempotently so that if
+jsdom provides a working `Storage`, the shim does not clobber it.
+
+#### Scenario: `localStorage.clear()` works in every test file
+- **WHEN** a test file calls `localStorage.clear()` in `beforeEach`
+- **THEN** no `localStorage.clear is not a function` error is raised,
+  regardless of jsdom version
+
+#### Scenario: `localStorage.setItem`/`getItem` roundtrip works
+- **WHEN** a test calls `localStorage.setItem("k", "v")` and immediately
+  reads it back
+- **THEN** `localStorage.getItem("k") === "v"`
+
+### Requirement: `web npm test -- --run` is green on `main`
+The web test suite SHALL complete with zero failing specs on a fresh
+`main` checkout.
+
+#### Scenario: Fresh checkout runs green
+- **WHEN** a contributor runs `cd web && npm install && npm test -- --run`
+  on `main`
+- **THEN** the test command exits 0 with zero failing specs
+
+### Requirement: `ChatInputBar` tool-count label matches its test
+The `ChatInputBar` component's tool-count label SHALL match the format
+asserted by its tests (either restore `(N/M)` rendering or update the
+test to match the current label). Drift between the component and test
+SHALL NOT land on `main`.
+
+#### Scenario: Tools label assertion matches rendered DOM
+- **WHEN** `web/src/components/channel/__tests__/ChatInputBar.tools.test.tsx`
+  runs
+- **THEN** every assertion targeting the tool-count label finds a matching
+  string in the rendered output
diff --git a/openspec/changes/res-177-p0-quality-hardening/tasks.md b/openspec/changes/res-177-p0-quality-hardening/tasks.md
new file mode 100644
index 00000000..9fe09b4d
--- /dev/null
+++ b/openspec/changes/res-177-p0-quality-hardening/tasks.md
@@ -0,0 +1,106 @@
+## 0. Baseline
+
+- [ ] 0.1 Confirm branch `feature/res-177-p0-quality-hardening` is current and tracks `main`.
+- [ ] 0.2 Capture baselines: `uv run pytest` pass/fail counts, `cd bot && npm test` counts, `cd web && npm test -- --run` pass/fail counts, `uv run ruff check .` count, `wc -l bot/src/bridge.ts`.
+- [ ] 0.3 Move `RES-213, RES-206, RES-194, RES-195, RES-214, RES-208, RES-209` to **In Progress** on Linear with a comment linking this OpenSpec change.
+
+## 1. Phase 1 — RES-213 (Q8) Docs, .env.example, repo hygiene
+
+- [ ] 1.1 Drop `openspec/` entry from `.gitignore`; verify `git status` then surfaces `openspec/changes/res-177-p0-quality-hardening/` as untracked.
+- [ ] 1.2 Rewrite `.env.example`: grouped sections (Core, Chat Platform Credentials, Web Build, Advanced Tuning). Every env var read by backend (`Settings` fields in `infra/config.py`), bot (`bot/src/index.ts`), or web (`web/src/lib/api.ts`) must appear with a 1-line comment and a safe default or `__REQUIRED__`.
+- [ ] 1.3 Replace `web/README.md` with a Beever-specific overview covering: purpose, `npm install`, `npm run dev`, `npm test`, `npm run lint`, env vars (`VITE_API_URL`, `VITE_BEEVER_API_KEY`, `VITE_BEEVER_ADMIN_TOKEN`).
+- [ ] 1.4 Add `src/beever_atlas/README.md` with one line per top-level package.
+- [ ] 1.5 Add `bot/README.md` mirroring `web/README.md`'s pattern; add `scripts/README.md` with one line per script.
+- [ ] 1.6 Fix `CHANGELOG.md`: backfill `[Unreleased]` with commits since `v0.1.1` (mobile responsiveness, consolidation streaming fix, wiki Key Facts, glossary fix, attachments, brand refresh, security fixes); fix the `compare/HEAD...HEAD` link to `compare/v0.1.1...HEAD`.
+- [ ] 1.7 Move `Beever_Atlas_Feature_Spec.docx` out of repo root (to `docs/` or delete); `git rm --cached daily_update.md` if tracked.
+- [ ] 1.8 Bump `web/eslint.config.js` `ecmaVersion` to `2022`; delete or fill the commented-out scaffolding in `openspec/config.yaml` lines 3–21.
+- [ ] 1.9 Verify: `cp .env.example .env` followed by importing `beever_atlas.infra.config` raises no `MissingEnv`/`KeyError` for documented fields.
+- [ ] 1.10 Commit `docs(hygiene): close RES-213 (Q8) — .env.example coverage, READMEs, openspec visibility, CHANGELOG`; move RES-213 → Done with SHA.
+
+## 2. Phase 2 — RES-206 (Q1) CI quality gates
+
+- [ ] 2.1 `.github/workflows/codeql.yml`: delete `continue-on-error: true` and `upload: never` on the `analyze` step.
+- [ ] 2.2 Add `pyright` loose config to `pyproject.toml` ([tool.pyright] with `typeCheckingMode: "basic"` and an explicit include list) and install as a dev dep.
+- [ ] 2.3 Add `pyright` step to `.github/workflows/ci.yml` backend job.
+- [ ] 2.4 Wire `uv run pytest --cov=src/beever_atlas --cov-report=term-missing --cov-fail-under=50` into `ci.yml` backend job; add `[tool.coverage]` config to `pyproject.toml`.
+- [ ] 2.5 Add `uv run ruff format --check src/ tests/` step to `ci.yml`.
+- [ ] 2.6 Add bot ESLint: `bot/.eslintrc.json` with `@typescript-eslint` rules (`no-explicit-any: error`, `no-unused-vars: error`, `no-floating-promises: error`); `bot/package.json` script `"lint": "eslint src --ext .ts"`; add lint step to `ci.yml` bot job.
+- [ ] 2.7 Fix any lint/typecheck/format/coverage findings surfaced by the new gates. If CodeQL finds existing issues, either fix them or file follow-up tickets and add `// codeql[suppress]` with the ticket link.
+- [ ] 2.8 Verify: `uv run pyright`, `uv run ruff format --check`, `uv run pytest --cov --cov-fail-under=50`, `cd bot && npm run lint` all exit 0 locally.
+- [ ] 2.9 Commit `chore(ci): close RES-206 (Q1) — hard-fail CodeQL + pyright + coverage + ruff format + bot lint`; move RES-206 → Done.
+
+## 3. Phase 3 — RES-194 (H5) Docker digest pins
+
+- [ ] 3.1 Resolve current manifest digests for: `python:3.12-slim`, `node:22-alpine`, `nginx:alpine`, `ghcr.io/astral-sh/uv:<specific-version>` (replace `:latest` with a specific release).
+- [ ] 3.2 Resolve digests for compose images: `cr.weaviate.io/semitechnologies/weaviate:1.28.0`, `neo4j:5.26-community`, `mongo:7.0`, `redis:7-alpine`.
+- [ ] 3.3 Resolve digests for `docker-compose.nebula.yml`: `vesoft/nebula-graphd:v3.8.0`, `vesoft/nebula-metad:v3.8.0`, `vesoft/nebula-storaged:v3.8.0`, `vesoft/nebula-console:v3.8.0` (as applicable).
+- [ ] 3.4 Pin every `FROM` and `image:` by `@sha256:<digest>` while keeping the human-readable `:<tag>` for context (`image: foo:1.2.3@sha256:…`).
+- [ ] 3.5 Pin the `COPY --from=ghcr.io/astral-sh/uv:latest` stage — switch to a dated release tag + digest.
+- [ ] 3.6 Add a `docker` entry to `.github/dependabot.yml` for `/`, `/web`, `/bot` (so all three Dockerfiles get digest-update PRs); add entries for the compose files if Dependabot supports them in the current version.
+- [ ] 3.7 Add a CI lint step (simple grep) that fails on any `FROM` / `image:` without `@sha256:`.
+- [ ] 3.8 Verify: `docker compose build` succeeds locally with the pinned digests (or confirm on a host with Docker available).
+- [ ] 3.9 Commit `chore(supply-chain): close RES-194 (H5) — digest-pin all Docker base images + Dependabot`; move RES-194 → Done.
+
+## 4. Phase 4 — RES-195 (H6) Chat SDK pinning
+
+- [ ] 4.1 Edit `bot/package.json`: strip `^` from `chat`, `@chat-adapter/{slack,discord,teams,telegram,state-redis}`, and `chat-adapter-mattermost`. Pin to the exact version currently in `bot/package-lock.json`.
+- [ ] 4.2 `cd bot && npm install` to regenerate `bot/package-lock.json`; confirm no version drift.
+- [ ] 4.3 Edit `bot/Dockerfile` to use `npm ci --audit-signatures` for the prod install stage.
+- [ ] 4.4 Add grouped Dependabot rule in `.github/dependabot.yml` for the chat family (`/bot/` ecosystem, group name `chat-sdk-family`, patterns matching the above packages).
+- [ ] 4.5 Add a CI lint step that fails on a caret or tilde range in the chat family of `bot/package.json` (simple grep).
+- [ ] 4.6 Verify: `cd bot && npm ci --audit-signatures` succeeds; `cd bot && npm test` stays at its current pass count.
+- [ ] 4.7 Commit `chore(supply-chain): close RES-195 (H6) — pin chat SDK family + audit-signatures + Dependabot group`; move RES-195 → Done.
+
+## 5. Phase 5 — RES-214 (Q9) Web test harness
+
+- [ ] 5.1 Add idempotent `localStorage` shim to `web/src/test-setup.ts` (install only when `window.localStorage` is absent or its `clear`/`setItem` are not functions) — covers the jsdom 27 prototype-inheritance regression.
+- [ ] 5.2 Run `cd web && npm test -- --run` and confirm the 8 `localStorage`-related failures are green.
+- [ ] 5.3 Inspect `ChatInputBar` component + the current DOM the test renders; either restore `(N/M)` label format in the component or update `ChatInputBar.tools.test.tsx:46` assertion to match the new label.
+- [ ] 5.4 Run `cd web && npm test -- --run` and confirm the `ChatInputBar` failure is green.
+- [ ] 5.5 Verify the total: **0 failing specs**, pre-change pass count preserved.
+- [ ] 5.6 Commit `fix(web-tests): close RES-214 (Q9) — localStorage shim + ChatInputBar label alignment`; move RES-214 → Done.
+
+## 6. Phase 6 — RES-208 (Q3) Backend test baseline
+
+- [ ] 6.1 In `tests/contracts/test_graph_store_contract.py`, replace each top-level `from beever_atlas.stores.nebula_store import NebulaStore` (lines 74, 237, 256, 277) with a `pytest.importorskip("nebula3")` guarded import (or module-level `pytestmark = pytest.mark.skipif(importlib.util.find_spec("nebula3") is None, reason="nebula extra not installed")`).
+- [ ] 6.2 Rewrite `src/beever_atlas/infra/health.py`: wrap each probe in its own typed `try/except`; aggregate into `{status: "healthy"|"degraded"|"unhealthy", failing: [...], checks: {...}}`; handler must never raise; return HTTP 200 always.
+- [ ] 6.3 Update `tests/test_health.py` to expect the new contract.
+- [ ] 6.4 Skip or service-gate `tests/test_ask_share.py` (11 errors when Mongo/Redis absent) and `tests/test_ask_disabled_tools.py` (4 failures) using `pytest.importorskip` on the service client or a `pytest.mark.integration` marker registered in `pyproject.toml`.
+- [ ] 6.5 Replace the enumerated silent `except Exception: pass` sites with `except Exception as exc: logger.debug(...)` at: `api/dev.py:46,53,75,127`; `server/app.py:149`; `services/batch_processor.py:886`; `stores/nebula_store.py:278,489,1613`; `wiki/compiler.py:1899,1912,1922,1924,1934,2318,2405`.
+- [ ] 6.6 Add unit tests raising `services/share_store.py` coverage to ≥ 80% using `mongomock`.
+- [ ] 6.7 Add unit tests raising `agents/callbacks/quality_gates.py` coverage to ≥ 80%.
+- [ ] 6.8 Delete obsolete skipped tests in `tests/test_pdf_chunking.py:74-89` if the functions they reference are truly gone.
+- [ ] 6.9 Verify: `uv run pytest` exits 0 with services OFF; coverage floor 50% still passes.
+- [ ] 6.10 Commit `fix(backend-tests): close RES-208 (Q3) — importorskip nebula3, health never raises, silent-except bundle, share_store/quality_gates coverage`; move RES-208 → Done.
+
+## 7. Phase 7 — RES-209 (Q4) bridge.ts decomposition
+
+- [ ] 7.1 Create `bot/src/bridge/platformError.ts` containing the canonical hardened `classifyPlatformError` + `PlatformErrorShape` (move from `bot/src/bridge.ts:233–250`). Delete `bot/src/bridge-classifier.ts`.
+- [ ] 7.2 Update both test files (`bot/src/bridge-error-classifier.test.ts`, `bot/src/bridge-classifier.test.ts`) to import from `./bridge/platformError.js`. If the two test files cover the same surface, merge into one.
+- [ ] 7.3 Create `bot/src/bridge/withPlatformError.ts`: higher-order wrapper `(handler) => async (req, res) => { try { await handler(req, res); } catch (err) { const { status, code } = classifyPlatformError(err); jsonResponse(res, status, { error: String(err), code }); } }`.
+- [ ] 7.4 Create `bot/src/http-utils.ts`: move `jsonResponse` out of `bridge.ts`; export it; update `bot/src/index.ts:342,439,447,470,498,506` to import from `http-utils`.
+- [ ] 7.5 Create `bot/src/logger.ts`: minimal level-gated logger honoring `LOG_LEVEL`; gate `bridge.ts:1744-1748` debug lines through `logger.debug(...)`.
+- [ ] 7.6 Create `bot/src/bridge/app.ts`: owns the Node HTTP server + the route table `ROUTE_TABLE: Array<{ method, pattern, handler }>`. Provide `registerBridgeRoutes` as a thin re-export from here.
+- [ ] 7.7 Split route handlers into route modules: `bot/src/bridge/routes/send.ts`, `bot/src/bridge/routes/history.ts`, `bot/src/bridge/routes/files.ts`, `bot/src/bridge/routes/validate.ts`, etc. — one file per route group, each handler wrapped in `withPlatformError`.
+- [ ] 7.8 Replace the 201-line `if/else` cascade in `registerBridgeRoutes` with a table-driven dispatcher. Collapse legacy + connection-scoped + platform-prefixed variants via a single resolver.
+- [ ] 7.9 Extract magic numbers: `DEFAULT_MESSAGE_LIMIT = 100`, `MAX_MESSAGE_LIMIT = 500` — both in the relevant route module.
+- [ ] 7.10 Keep `bot/src/bridge.ts` as a backward-compat re-export shim (`export * from "./bridge/app.js"`) so `bot/src/index.ts` keeps compiling unchanged.
+- [ ] 7.11 Verify: `wc -l bot/src/bridge.ts` < 50 (shim); every file under `bot/src/` is < 1,000 lines.
+- [ ] 7.12 Verify: `cd bot && npm run build && npm test && npm run lint` all pass; current 116-test baseline preserved (or raised).
+- [ ] 7.13 Smoke-test: start backend + bot, create a connection, fetch via `/bridge/channels`, fetch a file via `/bridge/files` — all three endpoints respond with the same shape as before.
+- [ ] 7.14 Commit `refactor(bot): close RES-209 (Q4) — decompose bridge.ts, dedupe classifyPlatformError, withPlatformError wrapper`; move RES-209 → Done.
+
+## 8. Cross-cutting verification + PR
+
+- [ ] 8.1 Run the full backend suite (`uv run pytest --cov --cov-fail-under=50`) on the final commit — must be green with zero regressions vs. the 1,140-pass baseline.
+- [ ] 8.2 Run `cd bot && npm test && npm run lint && npm run build` — must be green.
+- [ ] 8.3 Run `cd web && npm test -- --run && npm run lint && npm run build` — zero failing specs, zero lint errors.
+- [ ] 8.4 Run `uv run ruff check .` and `uv run ruff format --check src/ tests/` — clean.
+- [ ] 8.5 Run `uv run pyright` — clean (given loose config).
+- [ ] 8.6 Run CodeQL locally (or in a draft PR) — no blocking findings, or each one has a tracking ticket.
+- [ ] 8.7 Post a close-out comment on Linear RES-177 listing each sub-issue, its commit SHA, and the final test counts.
+- [ ] 8.8 Open PR from `feature/res-177-p0-quality-hardening` → `main` with a summary block per closed sub-issue and a test-plan checklist.
+
+## 9. Archive
+
+- [ ] 9.1 After PR merge, run `/opsx:archive res-177-p0-quality-hardening` to fold the seven new capability specs into `openspec/specs/`.
diff --git a/openspec/config.yaml b/openspec/config.yaml
new file mode 100644
index 00000000..b4bbeb94
--- /dev/null
+++ b/openspec/config.yaml
@@ -0,0 +1 @@
+schema: spec-driven
diff --git a/scripts/deploy/.gitignore b/scripts/deploy/.gitignore
new file mode 100644
index 00000000..95f7491e
--- /dev/null
+++ b/scripts/deploy/.gitignore
@@ -0,0 +1 @@
+.state/
diff --git a/scripts/deploy/README.md b/scripts/deploy/README.md
new file mode 100644
index 00000000..161e8b50
--- /dev/null
+++ b/scripts/deploy/README.md
@@ -0,0 +1,53 @@
+# Beever Atlas — AWS Deploy Automation
+
+Single-instance deploy of the full stack (backend, web, bot, Weaviate, Neo4j, MongoDB, Redis) to a single EC2 host via `docker compose`. Intended for internal testing.
+
+## Prerequisites
+- AWS CLI configured (`aws sts get-caller-identity` works)
+- `rsync`, `ssh`, `jq` installed locally
+- Region: `us-east-2`
+
+## Usage
+
+```bash
+# One-shot deploy (provisions AWS infra + uploads code + starts services)
+./scripts/deploy/deploy.sh
+
+# Re-deploy after code changes (reuses existing instance)
+./scripts/deploy/deploy.sh
+
+# SSH into the box
+./scripts/deploy/ssh.sh
+
+# Tear everything down
+./scripts/deploy/destroy.sh
+```
+
+## Files
+
+- `deploy.sh` — end-to-end entrypoint
+- `provision.sh` — creates AWS infra (keypair, SG, EC2, EIP)
+- `bootstrap.sh` — runs on the instance; installs Docker + starts compose
+- `ssh.sh` — convenience SSH wrapper
+- `destroy.sh` — deletes all created AWS resources
+- `env.template` — .env generator template
+- `.state/` — gitignored; stores resource IDs, SSH key, generated secrets
+
+## After first deploy
+
+The app boots with **placeholder** API keys. To make it actually work:
+
+```bash
+./scripts/deploy/ssh.sh
+cd /opt/beever-atlas-v2
+sudo nano .env            # fill GOOGLE_API_KEY, JINA_API_KEY, TAVILY_API_KEY
+sudo docker compose up -d --build
+```
+
+## Access
+
+After deploy completes, URLs are printed:
+- Web UI: `http://<EIP>/`
+- API:    `http://<EIP>:8000/api/health`
+
+Ports 22, 80, 8000 are restricted to **your current public IP** only. Re-run `./scripts/deploy/update-ip.sh` if your IP changes.
diff --git a/scripts/deploy/bootstrap.sh b/scripts/deploy/bootstrap.sh
new file mode 100755
index 00000000..2ae4c09f
--- /dev/null
+++ b/scripts/deploy/bootstrap.sh
@@ -0,0 +1,58 @@
+#!/usr/bin/env bash
+# Runs ON the EC2 instance. Installs Docker + Caddy + starts compose stack.
+# Caddy listens on 80/443 with auto Let's Encrypt; reverse-proxies to web (:3000).
+set -euo pipefail
+
+HOSTNAME_ARG="${1:?hostname required}"
+cd /opt/beever-atlas-v2
+
+if ! command -v docker >/dev/null 2>&1; then
+  echo "[bootstrap] installing Docker"
+  sudo apt-get update -y || sudo apt-get update -y
+  sudo apt-get install -y ca-certificates curl gnupg
+  sudo install -m 0755 -d /etc/apt/keyrings
+  curl -fsSL https://download.docker.com/linux/ubuntu/gpg | \
+    sudo gpg --dearmor -o /etc/apt/keyrings/docker.gpg
+  sudo chmod a+r /etc/apt/keyrings/docker.gpg
+  echo "deb [arch=$(dpkg --print-architecture) signed-by=/etc/apt/keyrings/docker.gpg] \
+https://download.docker.com/linux/ubuntu $(. /etc/os-release && echo "$VERSION_CODENAME") stable" | \
+    sudo tee /etc/apt/sources.list.d/docker.list >/dev/null
+  sudo apt-get update -y
+  sudo apt-get install -y docker-ce docker-ce-cli containerd.io docker-buildx-plugin docker-compose-plugin
+  sudo usermod -aG docker ubuntu
+fi
+
+if ! command -v caddy >/dev/null 2>&1; then
+  echo "[bootstrap] installing Caddy"
+  sudo apt-get install -y debian-keyring debian-archive-keyring apt-transport-https curl
+  curl -1sLf 'https://dl.cloudsmith.io/public/caddy/stable/gpg.key' | \
+    sudo gpg --dearmor -o /usr/share/keyrings/caddy-stable-archive-keyring.gpg
+  curl -1sLf 'https://dl.cloudsmith.io/public/caddy/stable/debian.deb.txt' | \
+    sudo tee /etc/apt/sources.list.d/caddy-stable.list >/dev/null
+  sudo apt-get update -y
+  sudo apt-get install -y caddy
+fi
+
+echo "[bootstrap] writing Caddyfile for $HOSTNAME_ARG"
+sudo tee /etc/caddy/Caddyfile >/dev/null <<EOF
+$HOSTNAME_ARG {
+    encode gzip
+    reverse_proxy localhost:3000
+}
+EOF
+sudo systemctl reload caddy || sudo systemctl restart caddy
+
+echo "[bootstrap] starting compose stack"
+sudo docker compose pull 2>/dev/null || true
+sudo docker compose up -d --build
+
+echo "[bootstrap] waiting for backend health (up to 5 min)..."
+for i in $(seq 1 60); do
+  if curl -fsS http://localhost:8000/api/health >/dev/null 2>&1; then
+    echo "[bootstrap] backend healthy"
+    break
+  fi
+  sleep 5
+done
+
+sudo docker compose ps
diff --git a/scripts/deploy/deploy.sh b/scripts/deploy/deploy.sh
new file mode 100755
index 00000000..59b0d04e
--- /dev/null
+++ b/scripts/deploy/deploy.sh
@@ -0,0 +1,112 @@
+#!/usr/bin/env bash
+# End-to-end deploy: provision AWS → rsync code → generate .env → bootstrap (Docker + Caddy + HTTPS).
+set -euo pipefail
+
+HERE="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+REPO="$(cd "$HERE/../.." && pwd)"
+STATE="$HERE/.state"
+
+log() { echo -e "\033[1;34m[deploy]\033[0m $*"; }
+
+log "1/5  provisioning AWS infra"
+bash "$HERE/provision.sh"
+
+PUBLIC_IP="$(cat "$STATE/public_ip")"
+HOSTNAME="$(cat "$STATE/hostname")"
+NAME="${NAME:-beever-atlas}"
+KEY_FILE="$STATE/${NAME}-key.pem"
+SSH_OPTS=(-i "$KEY_FILE" -o StrictHostKeyChecking=no -o UserKnownHostsFile=/dev/null -o ConnectTimeout=10)
+
+log "2/5  waiting for SSH on $PUBLIC_IP"
+for i in $(seq 1 40); do
+  if ssh "${SSH_OPTS[@]}" ubuntu@"$PUBLIC_IP" 'echo ok' >/dev/null 2>&1; then
+    log "      SSH ready"; break
+  fi
+  sleep 5
+  [[ $i -eq 40 ]] && { echo "SSH never came up" >&2; exit 1; }
+done
+
+log "3/5  generating .env from .env.example (host=$HOSTNAME)"
+gen_secret() { openssl rand -hex 32; }
+gen_password() { openssl rand -base64 24 | tr -d '=+/' | head -c 24; }
+
+for f in master_key api_key admin_token weaviate_key bridge_key; do
+  [[ -f "$STATE/$f" ]] || gen_secret > "$STATE/$f"
+done
+[[ -f "$STATE/neo4j_password" ]]  || gen_password > "$STATE/neo4j_password"
+[[ -f "$STATE/nebula_password" ]] || gen_password > "$STATE/nebula_password"
+
+ENV_OUT="$STATE/.env.generated"
+cp "$REPO/.env.example" "$ENV_OUT"
+
+patch_env() {
+  python3 - "$ENV_OUT" "$1" "$2" <<'PY'
+import re, sys
+p, k, v = sys.argv[1], sys.argv[2], sys.argv[3]
+s = open(p).read()
+new, n = re.subn(rf'^{re.escape(k)}=.*$', f'{k}={v}', s, count=1, flags=re.M)
+if n == 0:
+    new = s.rstrip() + f'\n{k}={v}\n'
+open(p, 'w').write(new)
+PY
+}
+
+NEO4J_PW="$(cat "$STATE/neo4j_password")"
+patch_env BEEVER_ENV            "production"
+patch_env BEEVER_API_URL        "https://$HOSTNAME"
+patch_env CORS_ORIGINS          "https://$HOSTNAME"
+patch_env VITE_API_URL          "https://$HOSTNAME"
+patch_env BEEVER_API_KEYS       "$(cat "$STATE/api_key")"
+patch_env VITE_BEEVER_API_KEY   "$(cat "$STATE/api_key")"
+patch_env BEEVER_ADMIN_TOKEN    "$(cat "$STATE/admin_token")"
+patch_env WEAVIATE_API_KEY      "$(cat "$STATE/weaviate_key")"
+patch_env NEO4J_AUTH            "neo4j/$NEO4J_PW"
+patch_env NEO4J_PASSWORD        "$NEO4J_PW"
+patch_env NEBULA_PASSWORD       "$(cat "$STATE/nebula_password")"
+patch_env BRIDGE_API_KEY        "$(cat "$STATE/bridge_key")"
+patch_env CREDENTIAL_MASTER_KEY "$(cat "$STATE/master_key")"
+patch_env ADAPTER_MOCK          "false"
+
+log "4/5  syncing code to instance"
+ssh "${SSH_OPTS[@]}" ubuntu@"$PUBLIC_IP" \
+  'sudo mkdir -p /opt/beever-atlas-v2 && sudo chown -R ubuntu:ubuntu /opt/beever-atlas-v2'
+
+rsync -az --delete \
+  --exclude='.git' --exclude='node_modules' --exclude='web/node_modules' \
+  --exclude='web/dist' --exclude='.venv' --exclude='__pycache__' --exclude='*.pyc' \
+  --exclude='scripts/deploy/.state' --exclude='.omc' --exclude='memory' \
+  -e "ssh ${SSH_OPTS[*]}" \
+  "$REPO/" ubuntu@"$PUBLIC_IP":/opt/beever-atlas-v2/
+
+scp "${SSH_OPTS[@]}" "$ENV_OUT" ubuntu@"$PUBLIC_IP":/opt/beever-atlas-v2/.env
+
+log "5/5  running bootstrap on instance"
+ssh "${SSH_OPTS[@]}" ubuntu@"$PUBLIC_IP" \
+  "cd /opt/beever-atlas-v2 && bash scripts/deploy/bootstrap.sh '$HOSTNAME'"
+
+cat <<EOF
+
+\033[1;32m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\033[0m
+\033[1;32m  DEPLOY COMPLETE\033[0m
+\033[1;32m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\033[0m
+
+  URL:          https://$HOSTNAME
+  IP (raw):     $PUBLIC_IP
+  API key:      $(cat "$STATE/api_key")
+  Admin token:  $(cat "$STATE/admin_token")
+
+  SSH:          ./scripts/deploy/ssh.sh
+  Stop billing: ./scripts/deploy/stop.sh
+  Destroy:      ./scripts/deploy/destroy.sh
+
+  \033[1;33mAUTO-DEPLOY ON GIT PUSH:\033[0m
+    1. Add these GitHub repo secrets (Settings → Secrets → Actions):
+         EC2_HOST     = $PUBLIC_IP
+         EC2_SSH_KEY  = (contents of $KEY_FILE)
+    2. Push to main → .github/workflows/deploy.yml updates the server.
+
+  \033[1;33mLLM keys still placeholders:\033[0m
+    ssh in, edit /opt/beever-atlas-v2/.env (GOOGLE_API_KEY, JINA_API_KEY),
+    then sudo docker compose up -d --build
+
+EOF
diff --git a/scripts/deploy/destroy.sh b/scripts/deploy/destroy.sh
new file mode 100755
index 00000000..2adfb9fe
--- /dev/null
+++ b/scripts/deploy/destroy.sh
@@ -0,0 +1,41 @@
+#!/usr/bin/env bash
+# Tears down all AWS resources created by provision.sh
+set -euo pipefail
+
+HERE="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+STATE="$HERE/.state"
+REGION="${AWS_REGION:-us-east-2}"
+NAME="${NAME:-beever-atlas}"
+
+log() { echo "[destroy] $*"; }
+
+read -r -p "Destroy AWS resources for NAME=$NAME? [y/N] " ans
+[[ "$ans" == "y" || "$ans" == "Y" ]] || { echo "aborted"; exit 0; }
+
+if [[ -f "$STATE/instance_id" ]]; then
+  INSTANCE_ID="$(cat "$STATE/instance_id")"
+  log "terminating instance $INSTANCE_ID"
+  aws ec2 terminate-instances --region "$REGION" --instance-ids "$INSTANCE_ID" >/dev/null || true
+  aws ec2 wait instance-terminated --region "$REGION" --instance-ids "$INSTANCE_ID" || true
+  rm -f "$STATE/instance_id"
+fi
+
+if [[ -f "$STATE/eip_alloc" ]]; then
+  EIP_ALLOC="$(cat "$STATE/eip_alloc")"
+  log "releasing EIP $EIP_ALLOC"
+  aws ec2 release-address --region "$REGION" --allocation-id "$EIP_ALLOC" || true
+  rm -f "$STATE/eip_alloc"
+fi
+
+if [[ -f "$STATE/sg_id" ]]; then
+  SG_ID="$(cat "$STATE/sg_id")"
+  log "deleting SG $SG_ID"
+  aws ec2 delete-security-group --region "$REGION" --group-id "$SG_ID" || true
+  rm -f "$STATE/sg_id"
+fi
+
+log "deleting keypair"
+aws ec2 delete-key-pair --region "$REGION" --key-name "${NAME}-key" || true
+
+rm -f "$STATE/public_ip" "$STATE/.env.generated"
+log "done. (Kept SSH key + secrets in $STATE for reuse; delete manually if you want)"
diff --git a/scripts/deploy/provision.sh b/scripts/deploy/provision.sh
new file mode 100755
index 00000000..9711496f
--- /dev/null
+++ b/scripts/deploy/provision.sh
@@ -0,0 +1,133 @@
+#!/usr/bin/env bash
+# Provisions AWS resources for a single-instance Beever Atlas deploy.
+# Idempotent: re-running reuses existing resources.
+set -euo pipefail
+
+HERE="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+STATE="$HERE/.state"
+mkdir -p "$STATE"
+
+REGION="${AWS_REGION:-us-east-2}"
+# Set NAME to scope AWS resources (keypair, security group, EIP tag).
+# Defaults to `beever-atlas`; use NAME=beever-atlas-ee for the EE side-by-side deploy.
+NAME="${NAME:-beever-atlas}"
+KEY_NAME="${NAME}-key"
+SG_NAME="${NAME}-sg"
+INSTANCE_TYPE="${INSTANCE_TYPE:-t4g.large}"
+DISK_GB="${DISK_GB:-30}"
+
+log() { echo "[provision] $*" >&2; }
+
+KEY_FILE="$STATE/${KEY_NAME}.pem"
+if [[ ! -f "$KEY_FILE" ]]; then
+  log "generating SSH keypair → $KEY_FILE"
+  ssh-keygen -t ed25519 -N "" -f "$KEY_FILE" -q
+  chmod 600 "$KEY_FILE"
+fi
+
+if ! aws ec2 describe-key-pairs --region "$REGION" --key-names "$KEY_NAME" >/dev/null 2>&1; then
+  log "importing keypair to AWS as $KEY_NAME"
+  aws ec2 import-key-pair --region "$REGION" \
+    --key-name "$KEY_NAME" \
+    --public-key-material "fileb://${KEY_FILE}.pub" >/dev/null
+fi
+
+VPC_ID=$(aws ec2 describe-vpcs --region "$REGION" \
+  --filters Name=is-default,Values=true \
+  --query 'Vpcs[0].VpcId' --output text)
+SUBNET_ID=$(aws ec2 describe-subnets --region "$REGION" \
+  --filters "Name=vpc-id,Values=$VPC_ID" "Name=default-for-az,Values=true" \
+  --query 'Subnets[0].SubnetId' --output text)
+log "using VPC=$VPC_ID subnet=$SUBNET_ID"
+
+MY_IP="$(curl -s https://checkip.amazonaws.com | tr -d '[:space:]')/32"
+log "your public IP: $MY_IP"
+
+SG_ID=$(aws ec2 describe-security-groups --region "$REGION" \
+  --filters "Name=group-name,Values=$SG_NAME" "Name=vpc-id,Values=$VPC_ID" \
+  --query 'SecurityGroups[0].GroupId' --output text 2>/dev/null || echo "None")
+
+if [[ "$SG_ID" == "None" || -z "$SG_ID" ]]; then
+  log "creating security group $SG_NAME"
+  SG_ID=$(aws ec2 create-security-group --region "$REGION" \
+    --group-name "$SG_NAME" --description "Beever Atlas internal testing" \
+    --vpc-id "$VPC_ID" --query 'GroupId' --output text)
+fi
+
+# SSH restricted to your IP; HTTP/HTTPS open to world (HTTPS via Caddy/Let's Encrypt)
+aws ec2 revoke-security-group-ingress --region "$REGION" \
+  --group-id "$SG_ID" --protocol tcp --port 22 --cidr "$MY_IP" 2>/dev/null || true
+aws ec2 authorize-security-group-ingress --region "$REGION" \
+  --group-id "$SG_ID" --protocol tcp --port 22 --cidr "$MY_IP" >/dev/null 2>&1 || true
+for port in 80 443; do
+  aws ec2 authorize-security-group-ingress --region "$REGION" \
+    --group-id "$SG_ID" --protocol tcp --port "$port" --cidr 0.0.0.0/0 >/dev/null 2>&1 || true
+done
+# Open SSH to world too — GitHub Actions runners need it for push-to-deploy.
+# Still gated by ed25519 key auth (no passwords).
+aws ec2 authorize-security-group-ingress --region "$REGION" \
+  --group-id "$SG_ID" --protocol tcp --port 22 --cidr 0.0.0.0/0 >/dev/null 2>&1 || true
+log "SG $SG_ID: ssh from world (key-only), 80/443 from world"
+
+AMI_ID=$(aws ec2 describe-images --region "$REGION" \
+  --owners 099720109477 \
+  --filters "Name=name,Values=ubuntu/images/hvm-ssd-gp3/ubuntu-noble-24.04-arm64-server-*" \
+            "Name=state,Values=available" \
+  --query 'sort_by(Images, &CreationDate)[-1].ImageId' --output text)
+log "AMI=$AMI_ID"
+
+INSTANCE_ID=""
+if [[ -f "$STATE/instance_id" ]]; then
+  INSTANCE_ID="$(cat "$STATE/instance_id")"
+  STATE_NAME=$(aws ec2 describe-instances --region "$REGION" \
+    --instance-ids "$INSTANCE_ID" \
+    --query 'Reservations[0].Instances[0].State.Name' --output text 2>/dev/null || echo "missing")
+  if [[ "$STATE_NAME" != "running" && "$STATE_NAME" != "pending" ]]; then
+    INSTANCE_ID=""
+  fi
+fi
+
+if [[ -z "$INSTANCE_ID" ]]; then
+  log "launching EC2 instance ($INSTANCE_TYPE, ${DISK_GB}GB)"
+  INSTANCE_ID=$(aws ec2 run-instances --region "$REGION" \
+    --image-id "$AMI_ID" \
+    --instance-type "$INSTANCE_TYPE" \
+    --key-name "$KEY_NAME" \
+    --security-group-ids "$SG_ID" \
+    --subnet-id "$SUBNET_ID" \
+    --associate-public-ip-address \
+    --block-device-mappings "DeviceName=/dev/sda1,Ebs={VolumeSize=$DISK_GB,VolumeType=gp3}" \
+    --tag-specifications "ResourceType=instance,Tags=[{Key=Name,Value=$NAME}]" \
+    --query 'Instances[0].InstanceId' --output text)
+  echo "$INSTANCE_ID" > "$STATE/instance_id"
+  log "instance $INSTANCE_ID launched; waiting for running state"
+  aws ec2 wait instance-running --region "$REGION" --instance-ids "$INSTANCE_ID"
+fi
+
+EIP_ALLOC=""
+if [[ -f "$STATE/eip_alloc" ]]; then
+  EIP_ALLOC="$(cat "$STATE/eip_alloc")"
+  aws ec2 describe-addresses --region "$REGION" --allocation-ids "$EIP_ALLOC" >/dev/null 2>&1 || EIP_ALLOC=""
+fi
+if [[ -z "$EIP_ALLOC" ]]; then
+  log "allocating Elastic IP"
+  EIP_ALLOC=$(aws ec2 allocate-address --region "$REGION" --domain vpc \
+    --query 'AllocationId' --output text)
+  echo "$EIP_ALLOC" > "$STATE/eip_alloc"
+fi
+aws ec2 associate-address --region "$REGION" \
+  --instance-id "$INSTANCE_ID" --allocation-id "$EIP_ALLOC" >/dev/null
+
+PUBLIC_IP=$(aws ec2 describe-addresses --region "$REGION" \
+  --allocation-ids "$EIP_ALLOC" --query 'Addresses[0].PublicIp' --output text)
+
+# Hostname for HTTPS via Let's Encrypt — nip.io resolves <dashed-ip>.nip.io → <ip>
+HOSTNAME="${PUBLIC_IP//./-}.nip.io"
+
+echo "$PUBLIC_IP" > "$STATE/public_ip"
+echo "$SG_ID"    > "$STATE/sg_id"
+echo "$HOSTNAME" > "$STATE/hostname"
+
+log "public IP: $PUBLIC_IP"
+log "hostname:  $HOSTNAME"
+log "provision complete."
diff --git a/scripts/deploy/ssh.sh b/scripts/deploy/ssh.sh
new file mode 100755
index 00000000..c19cbce5
--- /dev/null
+++ b/scripts/deploy/ssh.sh
@@ -0,0 +1,9 @@
+#!/usr/bin/env bash
+set -euo pipefail
+HERE="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+STATE="$HERE/.state"
+NAME="${NAME:-beever-atlas}"
+PUBLIC_IP="$(cat "$STATE/public_ip")"
+KEY="$STATE/${NAME}-key.pem"
+exec ssh -i "$KEY" -o StrictHostKeyChecking=no -o UserKnownHostsFile=/dev/null \
+  ubuntu@"$PUBLIC_IP" "$@"
diff --git a/scripts/deploy/start.sh b/scripts/deploy/start.sh
new file mode 100755
index 00000000..a193f618
--- /dev/null
+++ b/scripts/deploy/start.sh
@@ -0,0 +1,10 @@
+#!/usr/bin/env bash
+# Start a previously stopped instance. EIP is retained so URL stays the same.
+set -euo pipefail
+HERE="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+REGION="${AWS_REGION:-us-east-2}"
+ID="$(cat "$HERE/.state/instance_id")"
+aws ec2 start-instances --region "$REGION" --instance-ids "$ID" >/dev/null
+aws ec2 wait instance-running --region "$REGION" --instance-ids "$ID"
+IP="$(cat "$HERE/.state/public_ip")"
+echo "[start] running. URL: http://$IP/"
diff --git a/scripts/deploy/stop.sh b/scripts/deploy/stop.sh
new file mode 100755
index 00000000..bf3eddf6
--- /dev/null
+++ b/scripts/deploy/stop.sh
@@ -0,0 +1,10 @@
+#!/usr/bin/env bash
+# Stop the EC2 instance (keeps disk + EIP; pay only storage ~$5/mo).
+set -euo pipefail
+HERE="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+REGION="${AWS_REGION:-us-east-2}"
+ID="$(cat "$HERE/.state/instance_id")"
+aws ec2 stop-instances --region "$REGION" --instance-ids "$ID" >/dev/null
+echo "[stop] stopping $ID — billing paused for compute"
+aws ec2 wait instance-stopped --region "$REGION" --instance-ids "$ID"
+echo "[stop] stopped."
diff --git a/web/src/components/settings/__tests__/AgentModelsTab.test.tsx b/web/src/components/settings/__tests__/AgentModelsTab.test.tsx
index 4fb517fb..30cf1fb4 100644
--- a/web/src/components/settings/__tests__/AgentModelsTab.test.tsx
+++ b/web/src/components/settings/__tests__/AgentModelsTab.test.tsx
@@ -188,12 +188,18 @@ describe("AgentModelsTab", () => {
     await waitFor(() => expect(screen.getByText("Gemini balanced")).toBeTruthy());
     fireEvent.click(screen.getByText("Gemini balanced"));
 
-    // CI runners are slower than local — the toast lands after the POST
-    // returns + a setState flush. The inner waitFor needs 5000ms headroom,
-    // so the outer test budget must exceed it (default vitest is 5000ms).
+    // CI runners are slower than local — and the toast self-dismisses after
+    // INFO_TTL_MS=2500ms (useToast), so a string-text waitFor can race the
+    // auto-dismiss on a slow runner. Query by role="status" + match
+    // textContent — tolerates whitespace/em-dash variation, polls quickly,
+    // and survives the dismiss-flicker.
     await waitFor(
-      () => expect(screen.getByText(/Applied 'Gemini balanced' — 1 updated/)).toBeTruthy(),
-      { timeout: 5000 },
+      () => {
+        const status = screen.queryByRole("status");
+        expect(status).not.toBeNull();
+        expect(status!.textContent ?? "").toMatch(/Applied 'Gemini balanced'.*1 updated/);
+      },
+      { timeout: 5000, interval: 50 },
     );
   }, 15000);
 
diff --git a/web/src/pages/ChannelWorkspace.tsx b/web/src/pages/ChannelWorkspace.tsx
index c484de5e..c8485a08 100644
--- a/web/src/pages/ChannelWorkspace.tsx
+++ b/web/src/pages/ChannelWorkspace.tsx
@@ -1,4 +1,4 @@
-import { useEffect, useState, type ComponentType } from "react";
+import { useCallback, useEffect, useMemo, useState, type ComponentType } from "react";
 import { useParams, Outlet, useNavigate, useLocation, Link, Navigate } from "react-router-dom";
 import { api } from "@/lib/api";
 import {
@@ -11,6 +11,7 @@ import {
   FileText,
   History,
   Settings,
+  X,
 } from "lucide-react";
 import { cn } from "@/lib/utils";
 import { useConnectionMap } from "@/hooks/useConnectionMap";
@@ -297,6 +298,53 @@ export function ChannelWorkspace() {
   const syncCompletedWithNoNew =
     syncState.state === "idle" && !!syncState.job_id && (syncState.total_messages ?? 0) === 0;
 
+  // Stale-failure dismiss UX. After a sync fails, the backend keeps
+  // returning the error state on `/sync/status` until a newer sync
+  // succeeds — leaving the red banner visible forever for channels
+  // where the user doesn't want to retry. We let the user dismiss the
+  // current failure, persisted per-channel in localStorage. The
+  // signature is `{job_id}|{first 200 chars of message}` so:
+  //   * A NEW failure (different job_id) shows the banner again.
+  //   * A re-render with the SAME failure stays dismissed.
+  // Cooldown messages are intentionally NOT dismissable — they're
+  // time-bounded and informative, not noise.
+  const failureSignature = useMemo(() => {
+    if (!syncFailureMessage) return null;
+    return `${syncState.job_id ?? "?"}|${syncFailureMessage.slice(0, 200)}`;
+  }, [syncFailureMessage, syncState.job_id]);
+  const dismissStorageKey = id ? `beever.sync-failure-dismissed.${id}` : null;
+  const [dismissedFailureSig, setDismissedFailureSig] = useState<string | null>(() => {
+    if (typeof window === "undefined" || !dismissStorageKey) return null;
+    try {
+      return window.localStorage.getItem(dismissStorageKey);
+    } catch {
+      return null;
+    }
+  });
+  // If the user navigates between channels, re-hydrate from storage so
+  // each channel's dismissal state is correct.
+  useEffect(() => {
+    if (typeof window === "undefined" || !dismissStorageKey) return;
+    try {
+      setDismissedFailureSig(window.localStorage.getItem(dismissStorageKey));
+    } catch {
+      setDismissedFailureSig(null);
+    }
+  }, [dismissStorageKey]);
+  const failureDismissed =
+    !isCoolingDown &&
+    failureSignature != null &&
+    failureSignature === dismissedFailureSig;
+  const dismissFailureBanner = useCallback(() => {
+    if (!failureSignature || !dismissStorageKey) return;
+    setDismissedFailureSig(failureSignature);
+    try {
+      window.localStorage.setItem(dismissStorageKey, failureSignature);
+    } catch {
+      /* localStorage quota — fine, dismissal is in-memory only */
+    }
+  }, [failureSignature, dismissStorageKey]);
+
   function handleRefreshStatus() {
     if (!id) return;
     setRefreshing(true);
@@ -397,16 +445,33 @@ export function ChannelWorkspace() {
           </div>
           {isMember && (
             <>
-              {displayFailureMessage && (
+              {displayFailureMessage && !failureDismissed && (
                 <div
                   className={cn(
-                    "rounded-lg border px-3 py-2 text-xs",
+                    "rounded-lg border px-3 py-2 text-xs flex items-start gap-2",
                     isCoolingDown
                       ? "border-amber-200 dark:border-amber-900 bg-amber-50 dark:bg-amber-950/30 text-amber-700 dark:text-amber-300"
                       : "border-rose-200 dark:border-rose-900 bg-rose-50 dark:bg-rose-950/30 text-rose-700 dark:text-rose-300",
                   )}
                 >
-                  {isCoolingDown ? displayFailureMessage : `Sync failed: ${displayFailureMessage}`}
+                  <span className="flex-1 min-w-0 break-words">
+                    {isCoolingDown ? displayFailureMessage : `Sync failed: ${displayFailureMessage}`}
+                  </span>
+                  {/* Dismiss is only offered for the failure banner — cooldown
+                      timers are time-bounded informational state and shouldn't
+                      hide. Per-channel localStorage means a NEW failure (new
+                      job_id) brings the banner back. */}
+                  {!isCoolingDown && failureSignature && (
+                    <button
+                      type="button"
+                      onClick={dismissFailureBanner}
+                      aria-label="Dismiss sync failure"
+                      title="Dismiss this failure (will reappear if a newer sync also fails)"
+                      className="shrink-0 -mr-1 -mt-0.5 p-0.5 rounded hover:bg-rose-200/40 dark:hover:bg-rose-900/40 transition-colors"
+                    >
+                      <X size={14} />
+                    </button>
+                  )}
                 </div>
               )}
               {syncCompletedWithNoNew && (

From e8ea324e0600257f337eb6409819c430642db3d8 Mon Sep 17 00:00:00 2001
From: "dependabot[bot]" <49699333+dependabot[bot]@users.noreply.github.com>
Date: Mon, 18 May 2026 20:16:36 +0000
Subject: [PATCH 2/2] chore(deps): bump brace-expansion from 5.0.5 to 5.0.6 in
 /web

Bumps [brace-expansion](https://github.com/juliangruber/brace-expansion) from 5.0.5 to 5.0.6.
- [Release notes](https://github.com/juliangruber/brace-expansion/releases)
- [Commits](https://github.com/juliangruber/brace-expansion/compare/v5.0.5...v5.0.6)

---
updated-dependencies:
- dependency-name: brace-expansion
  dependency-version: 5.0.6
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
---
 web/package-lock.json | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/web/package-lock.json b/web/package-lock.json
index 5f701788..20077110 100644
--- a/web/package-lock.json
+++ b/web/package-lock.json
@@ -3470,9 +3470,9 @@
       }
     },
     "node_modules/brace-expansion": {
-      "version": "5.0.5",
-      "resolved": "https://registry.npmjs.org/brace-expansion/-/brace-expansion-5.0.5.tgz",
-      "integrity": "sha512-VZznLgtwhn+Mact9tfiwx64fA9erHH/MCXEUfB/0bX/6Fz6ny5EGTXYltMocqg4xFAQZtnO3DHWWXi8RiuN7cQ==",
+      "version": "5.0.6",
+      "resolved": "https://registry.npmjs.org/brace-expansion/-/brace-expansion-5.0.6.tgz",
+      "integrity": "sha512-kLpxurY4Z4r9sgMsyG0Z9uzsBlgiU/EFKhj/h91/8yHu0edo7XuixOIH3VcJ8kkxs6/jPzoI6U9Vj3WqbMQ94g==",
       "license": "MIT",
       "dependencies": {
         "balanced-match": "^4.0.2"