docs(governance): rewrite Token budgets and limits page for ADP GA#6
Merged
docs(governance): rewrite Token budgets and limits page for ADP GA#6
Conversation
Replaces the 3-line placeholder at modules/governance/pages/budgets.adoc with a 5-section concept page aligned to the Governance V0 PRD V2 framing for cap-management. Page structure: * Intro (overview pattern, ~70 words) — what ADP records automatically and what cap-management arrives in a later release. * What ADP records automatically — spending event fields (input/output/ cached tokens, total_cost_microcents, request count, provider/model/ user/org context); microcent unit explainer (1 cent = 100 microcents, $1 = 10,000 microcents). * Where to view your spend — Governance dashboard, Transcripts, Breakdown queries via GetSpendingBreakdown; SpendingFilter dimension combinations. * Guardrail evaluator cost — same SpendingService stream as user-facing LLM calls, separate provider attribution. * Multi-tenant patterns at GA — viewing only (not cap-setting); per-org per-user breakdowns via SpendingFilter dimensions. * Coming in a later release — configurable caps, halt-vs-notify, per-agent caps, alert hooks, multi-tenant cap-setting. Verdict shifted from U to C: master plan source page ai-agents/ai-gateway/admin/setup-guide.adoc was deleted in cloud-docs#562; current adp-docs file is a placeholder being rewritten. Master plan §3 #6 was realigned to PRD V2 framing in master plan v20: original pre-beta items (per-agent caps, halt vs. notify, alerts, multi-tenant cap-setting) deferred to post-GA. No source spend_limit.proto or ratelimit.proto on cloudv2 main; AI Gateway RFC 0002 Phase 4 (Virtual Keys + budgets) is Not started; partial work was removed in Kafka-kvstore migration. SpendingService (read-only) is the GA surface. TODO markers track 5 of the 9 open design questions in the companion plan: persona vocabulary, default-on capture behavior, user_id/org_id population, guardrail-cost-stream confirmation, public-API exposure of organization_id, V2 surface forward link. Cross-links to xref:governance:dashboard/index.adoc point at the planned post-PR-#5 path; those resolve once Governance Dashboard PR #5 merges. No nav restructure — flat governance:budgets.adoc entry stays under Trust & Governance per master plan §2. Sub-folder split layered post-GA when V2 cap-management lands. Refs: DOC-2114 RFC: apps/aigw/docs/rfcs/0010-cost-reporting.md (cloudv2 main) Proto: proto/public/cloud/redpanda/api/adp/v1alpha1/spending_service.proto (cloudv2 main) Plan: https://redpandadata.atlassian.net/wiki/spaces/DOC/pages/1884717057 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
✅ Deploy Preview for redpanda-agentic-data-plane ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
7 tasks
micheleRP
added a commit
that referenced
this pull request
Apr 29, 2026
Beta deliverable (master Week 3, May 4-8): single concept page covering INPUT/OUTPUT phase model, the three evaluator types (PII / Toxicity / Custom webhook), violation framing, and attachment scope. Anchors on RFC 0002 Phase 5; resource-model details left as TODO markers pending team-ai's post-pivot answer. GA deliverables (master Weeks 4-6): four pages stubbed with full outlines and TODO markers keyed to specific Open Questions in the Confluence companion plan, so live walkthroughs on adp-production can resolve them in place: - create-guardrail.adoc — how-to for configuring + attaching - types-reference.adoc — config schema reference per evaluator - violations.adoc — read & troubleshoot violations - cost-tracking.adoc — per-evaluator cost shape + capping Nav: replaces the flat governance:guardrails.adoc placeholder under Trust & Governance with a Guardrails parent + 5 children. The 3-line governance/pages/guardrails.adoc placeholder is deleted in this commit. Cross-link to governance:dashboard/index.adoc is transient-broken until adp-docs PR #5 (Governance Dashboard) merges; same pattern as Budgets PR #6. Refs: DOC-2113. Plan: https://redpandadata.atlassian.net/wiki/spaces/DOC/pages/1881702438 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> # Conflicts: # modules/ROOT/nav.adoc
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Replaces the 3-line
governance/pages/budgets.adocplaceholder with a 5-section concept page aligned to the Governance V0 PRD's V2 framing for cap-management. Implements workflow #6 (DOC-2114) per the Token Budgets and Limits companion plan.Preview pages
What this PR does
modules/governance/pages/budgets.adocwith a 5-section concept page covering:total_cost_microcents, request count, provider / model / user / org context). Microcent unit explainer (1 cent = 100 microcents, $1 = 10,000 microcents) lifted from RFC 0010 + the SpendingService proto.dashboard/index.adoc), Transcripts (xrefobservability/transcripts.adoc), Breakdown queries viaGetSpendingBreakdown. SpendingFilter dimension combinations.SpendingFilter.organization_idanduser_id. Explicitly notes "see, not cap.":page-topic-type: overview. Personas:platform_admin, evaluator(with// TODO:flagging the persona-vocabulary mismatch with PRD names — Open Q D2).governance:budgets.adocentry stays under Trust & Governance per the master plan §2 nav. Sub-folder split layers post-GA when V2 cap-management lands.Status
Open questions / known issues
ai-agents/ai-gateway/admin/setup-guide.adoc) was deleted in cloud-docs#562 (merged 2026-04-28); current adp-docs file is the 3-line placeholder being rewritten. Effective verdict is greenfield create. Reflected in master plan v20.spend_limit.protoorratelimit.protoon cloudv2 main. AI Gateway RFC 0002 Phase 4 (Virtual Keys + budgets) is "Not started"; the partial work that landed was removed in a Kafka-kvstore migration. So at GA there is no user-facing cap-setting feature; the page's "Coming in a later release" section sketches V2 capabilities per the PRD without committing to specifics.governance:dashboard/index.adocfrom this PR. The Dashboard pages live in adp-docs PR #5 (also draft); the xrefs resolve once PR docs(governance): add Governance Dashboard pages for ADP GA #5 merges.user_id/organization_idpopulation (A2), guardrail-cost-stream confirmation (A3), public-API exposure oforganization_id(B1), and the V2 forward-link section (C1).Test plan
npm run build && npm run serve) renders the Trust & Governance > Token budgets and limits page; cross-links to Transcripts and Guardrails resolve. (Cross-link to Dashboard resolves once PR docs(governance): add Governance Dashboard pages for ADP GA #5 merges.)adp-productiononce spending data flows: walkGetSpendingBreakdownagainst each dimension (provider / model / user / organization / provider type) and confirm prose matches actual response shapes.// TODO:markers as eng confirms (default-on, OIDC claim mapping, guardrail-cost stream, public-APIorganization_idexposure).docs-team-standards:reviewon the page.adp-la.adocpartial include at GA.🤖 Generated with Claude Code