WORKBOOK v4 — 24/7 Autonomous Dev Fleet (F0–F6)#28
Open
CTlanston wants to merge 1 commit into
Open
Conversation
Next-stage workbook (operator chose to go straight to the 24/7 fleet). Supersedes WORKBOOK_v3.1/PR #27 — v3.1's gap-closure becomes v4 F0. Pushes the system from a single-operator, human-in-the-loop, one-mission cockpit to a continuous, concurrent, multi-agent autonomous dev team. F0 first nails the two audit-verified gaps (real Gemini verdict, real E2E) because an auto-merging fleet on an unexercised Gemini gate is dangerous, not autonomous. F1-F6 then revive the parked packages (agent-mesh/cli-robust/moves/interrupt-bus/sentinel/chaos), each ADR-gated, to add autonomous intake, concurrency, crash recovery, risk-gated auto-merge, fleet observability, and a continuous soak. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What
Adds
WORKBOOK_v4.md— the next-stage execution plan for Codex: take the system from a single-operator, human-in-the-loop, one-mission cockpit to a continuous, concurrent, multi-agent 24/7 autonomous dev team (Anthropic-internal style).Supersedes #27 / WORKBOOK_v3.1 — v3.1's gap-closure (real Gemini, real E2E) is absorbed as v4 F0. Recommend closing #27.
Why F0 comes first (engineering honesty)
The operator chose to go straight to the fleet. But an independent audit of
a7d400fverified two open gaps: the isolated Gemini hard gate has only ever been mock-tested (a real Gemini has never judged a real diff), and the P7 E2E ran on a/tmpsandbox with no real Gemini verdict. An auto-merging 24/7 fleet built on an unexercised merge gate is dangerous, not autonomous — so F0 nails those gaps before any fleet behavior is added.Stages (F0–F6, each = one commit + one PR, actually pushed; each revived package is ADR-gated)
hermus-agent), and an operator-absent fail-closed merge test. (absorbs v3.1 H1–H2)roadmap-agent→ approval → mission queue; one mission runs hands-off; sub-95% clarification escalates to phone + HOLDs that mission without blocking the fleet.agent-mesh(fan-out/fan-in) +cli-robust(session pool + quota oracle); ≥3 concurrent missions across repos; quota exhaustion → HOLD, never a paid-API fallback.moves/saga;kill -9mid-mission → resume exactly-once; idempotent side effects.interrupt-bus.sentinelfor tool-call interception.chaos: a real soak window proving missions complete, holds auto-resolve, 0 safety violations, recovery from fault injection.Standing safety rules (new fleet ground rules GF1–GF5)
Unattended merge is fail-closed (Gemini PASS ∧ risk≤threshold ∧
allow_remote_writes∧repo.enabled∧ no forbidden path); reviving any parked package requires an ADR; concurrency/budget caps with quota-HOLD (never paid-API fallback); idempotent + resumable (exactly-once merge); gates route to phone, no self-approve, one mission's HOLD never blocks the fleet. Inherits all WORKBOOK_v3/v3.1 ground rules.See
WORKBOOK_v4.md§3 for full per-stage L1 acceptance.🤖 Generated with Claude Code