AI Sidekicks

      o
     .-.
  .--┴-┴--.
  | O   O |   >> A collaborative agent operating system for software work.
  | ||||| |   >> Humans + agents, one session.
  '--___--'

AI Sidekicks is a desktop runtime where humans and AI agents share live sessions, co-edit code through proper git flow, and collaborate in real time. Think of it as a shared workspace where you can invite teammates, attach multiple AI agents from different providers, and steer their work as it happens — all inside one unified session.

Why AI Sidekicks

Today's AI coding tools are single-user, single-agent, and disconnected. You run one agent at a time, on your machine, against your checkout. There's no way to:

Invite a colleague into a live agent session so they can see what's happening and contribute
Run Claude and Codex side-by-side on the same task with coordinated git flow
Pause an agent mid-run, steer its direction, then resume — without losing state
Get real approval gates before agents install packages, run migrations, or push code
See a unified timeline of every message, tool call, diff, and decision across all agents

AI Sidekicks exists to solve these problems. It treats the session — not the agent — as the first-class primitive, and builds real collaboration into the runtime from day one.

Core Concept

The first-class object is not agent. It is session.

A session contains participants, agents, runs, channels, repo mounts, approvals, artifacts, invites, and presence. "Two agents talking," "one user chatting with one agent," and "workflow orchestration" are all different views over the same session and event model.

                    ┌──────────────────────────────────┐
                    │           SESSION                │
                    │                                  │
                    │   Participants  ←  Humans        │
                    │   Agents       ←  Claude, Codex  │
                    │   Channels     ←  Chat, Workflow │
                    │   Runs         ←  Active work    │
                    │   Repo Mounts  ←  Git repos      │
                    │   Approvals    ←  Safety gates   │
                    │   Artifacts    ←  Diffs, files   │
                    │   Events       ←  Full timeline  │
                    │                                  │
                    └──────────────────────────────────┘

A participant can join a live session, chat directly in it, and attach one or more agents from their own local machine. People and machines are both first-class participants.

Key Features

Multi-User, Multi-Agent Sessions

Start a session, invite a teammate via shareable link, and both bring your own agents. Everyone sees the same live timeline. Agents from different participants run on their respective machines while sharing session state.

Queue, Steer, Pause, Resume

Real runtime control — not UI illusions. The queue is daemon-backed. Steer is modeled as an intervention against an active run. Pause is a runtime state with persisted context. Resume continues from where the agent left off.

Approval Gates

9 categories of approval gates (tool execution, file write, network access, destructive git, user input, plan approval, MCP elicitation, workflow gate, and human phase contribution) ensure agents never take unsupervised action on anything that matters. Approve, deny, or set remembered rules.

Worktree-First Git Flow

Every coding run binds to a repo mount and execution mode: read-only, branch, worktree, or ephemeral clone. The default is worktree — agents work on isolated branches, produce attributed diffs, and prepare PRs without touching your main checkout.

Live Timeline and Visibility

A unified event stream shows every message, tool call, approval, diff, state transition, and handoff across all agents and participants. Replay any session from its event log.

Provider Drivers

AI agents run behind explicit driver adapters — claude-driver and codex-driver ship in V1. The product is not a wrapper around a single provider CLI; it's a runtime that normalizes agent behavior across providers.

Local-First with Collaboration

Agent execution stays on your machine. The collaboration control plane handles auth, invites, presence, relay, and shared metadata — it never executes code. Single-user mode works fully offline.

Architecture

┌──────────────────┐     ┌──────────────────┐
│   Desktop Shell  │     │       CLI        │
│   (Electron)     │     │   (sidekicks)    │
└────────┬─────────┘     └────────┬─────────┘
         │         Typed SDK      │
         └────────────┬───────────┘
                      │ IPC (Unix socket / named pipe)
              ┌───────┴────────┐
              │  Local Runtime │
              │    Daemon      │
              │                │
              │  ┌──────────┐  │
              │  │ Session  │  │
              │  │ Engine   │  │
              │  ├──────────┤  │
              │  │ Provider │  │    ┌────────────────────┐
              │  │ Drivers  │──┼───►│ Claude / Codex API │
              │  ├──────────┤  │    └────────────────────┘
              │  │   Git    │  │
              │  │ Engine   │  │
              │  ├──────────┤  │
              │  │ SQLite   │  │
              │  └──────────┘  │
              └───────┬────────┘
                      │ tRPC + WebSocket
           ┌──────────┴──────────┐
           │  Collaboration      │
           │  Control Plane      │
           │  (Auth, Invites,    │
           │   Presence, Relay)  │
           │                     │
           │  ┌──────────────┐   │
           │  │  Postgres    │   │
           │  └──────────────┘   │
           └─────────────────────┘

Desktop Shell — Electron main process. Thin layer for windowing, native dialogs, notifications, and daemon supervision.

CLI — First client delivery track. Proves the typed SDK and IPC contract before the desktop UI ships.

Local Runtime Daemon — Machine-local execution authority. Owns provider processes, git worktrees, terminal sessions, tool execution, and local persistence (SQLite).

Collaboration Control Plane — Hosted or self-hosted service for auth (PASETO v4 + WebAuthn), invites, presence (Yjs Awareness CRDT), relay (E2E encrypted), and shared metadata (Postgres).

CLI-First

The CLI (sidekicks) is the first client delivery track — it proves the typed SDK and IPC contract before the desktop UI ships.

Technology Stack

Layer	Technology
Language	TypeScript (daemon, CLI, desktop, contracts)
Desktop Shell	Electron
Desktop UI	React + Vite
Local Database	SQLite (WAL mode, 41 tables)
Shared Database	Postgres (18 tables)
Auth	PASETO v4 (access + refresh), WebAuthn, DPoP
Relay Encryption	X25519 + XChaCha20-Poly1305 (V1), MLS RFC 9420 (V2)
State Machines	XState v5
API Framework	tRPC v11
IPC	Unix socket (macOS/Linux), named pipe (Windows)
Validation	Zod
Authorization	Cedar (policy-based) per ADR-012
Presence	Yjs Awareness protocol
Observability	OpenTelemetry

V1 Scope

V1 ships 17 core features across CLI and Desktop GUI per ADR-015: V1 Feature Scope Definition.

#	Feature	Description
1	Session creation and join	Foundational session primitive
2	Mid-session invites	Shareable link with PASETO token
3	Membership roles	Owner, admin, collaborator, viewer
4	Runtime node attach	Participants contribute local compute
5	Single-agent runs	Claude and Codex via provider drivers
6	Queue, steer, pause, resume	Real runtime control and interventions
7	Approval gates	9 categories of human-in-the-loop safety
8	Repo attach	Bind sessions to git repositories
9	Worktree execution	Isolated branches per agent run
10	Session timeline	Unified event stream with replay
11	Local daemon + CLI	First client over the typed SDK
12	Presence	Online / idle / offline awareness
13	Event audit log	Event-sourced persistence backbone
14	Local artifacts	Diffs, files, and attachments
15	Desktop GUI	Electron shell + React/Vite renderer over the same typed SDK
16	Multi-agent channels	Cross-agent coordination primitives per Spec-016
17	Workflow authoring and execution	Full workflow engine per Spec-017

V1.1 additions: MLS relay E2EE, email invite delivery, cross-node shared artifacts, plus the criterion-gated workflow subfeatures named in ADR-015.

Build Order

Implementation follows the tiered dependency graph defined in docs/architecture/cross-plan-dependencies.md. V1 spans 27 implementation plans; Plan-017 Workflow Authoring remains in review while the other V1 plans are approved.

Tier 1  ► Plan-001  Shared Session Core
         Plan-024  Rust PTY Sidecar
Tier 2  ► Plan-002  Invite, Membership, Presence
Tier 3  ► Plan-003  Runtime Node Attach
Tier 4  ► Plan-005  Provider Driver Contract
         Plan-006  Event Taxonomy and Audit Log
         Plan-007  Local IPC and Daemon Control
Tier 5  ► Plan-004  Queue, Steer, Pause, Resume
         Plan-008  Control Plane Relay
         Plan-018  Identity and Participant State
         Plan-022  Data Retention and GDPR
         Plan-025  Self-Hostable Node Relay (crypto package only)
Tier 6  ► Plan-009  Repo Attachment and Workspace Binding
         Plan-010  Worktree Lifecycle
         Plan-012  Approvals and Permissions
         Plan-016  Multi-Agent Channels
         Plan-021  Rate Limiting Policy
Tier 7  ► Plan-011  Git Flow, PR, Diff Attribution
         Plan-014  Artifacts, Files, Attachments
         Plan-015  Persistence, Recovery, Replay
         Plan-025  Self-Hostable Node Relay (remaining steps)
Tier 8  ► Plan-013  Live Timeline and Visibility
         Plan-017  Workflow Authoring and Execution
         Plan-019  Notifications and Attention
         Plan-020  Observability and Failure Recovery
         Plan-023  Desktop Shell and Renderer
Tier 9  ► Plan-026  First-Run Three-Way-Choice Onboarding
         Plan-027  Cross-Node Dispatch and Approval

Each tier's prerequisites are the prior tier's completion. See docs/architecture/cross-plan-dependencies.md for the full dependency graph and table ownership map.

Project Status

Phase: Documentation validation remediation complete. Code implementation not yet started.

Current documentation corpus:

27 V1 implementation plans with step-by-step build instructions; 26 are approved and Plan-017 is in review
26 approved specifications covering every feature and cross-cutting concern, plus Spec-027 (Self-Host Secure Defaults) currently in draft
12 domain models (run state machine, intervention model, participant model, workflow model, etc.)
16 architecture documents (schemas, contracts, security, deployment, dependencies)
11 operations runbooks (CLI commands, SLOs, on-call routing, self-host secure defaults)
21 accepted ADRs recording key design decisions (ADR-013 reserved-skipped)

Documentation

Area	Path	Description
Vision	`docs/vision.md`	Product thesis and architectural position
Specs	`docs/specs/`	Feature specifications (001-027)
Plans	`docs/plans/`	Implementation plans (001-027)
Architecture	`docs/architecture/`	Schemas, contracts, security, deployment
Domain Models	`docs/domain/`	State machines, glossary, entity models
ADRs	`docs/decisions/`	Architectural decision records
Operations	`docs/operations/`	Runbooks, SLOs, on-call routing
V1 Scope	`docs/architecture/v1-feature-scope.md`	What ships in V1 vs V2
Build Order	`docs/architecture/cross-plan-dependencies.md`	9-tier implementation sequence
Contributing	`CONTRIBUTING.md`	Branch naming, commit format, PR workflow

License

AI Sidekicks is licensed under the Apache License, Version 2.0 — see ADR-020 for the deployment-model and license commitment.

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.claude		.claude
.codex		.codex
assets/hero		assets/hero
docs		docs
.gitignore		.gitignore
.markdownlint-cli2.yaml		.markdownlint-cli2.yaml
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Sidekicks

Table of Contents

Why AI Sidekicks

Core Concept

Key Features

Multi-User, Multi-Agent Sessions

Queue, Steer, Pause, Resume

Approval Gates

Worktree-First Git Flow

Live Timeline and Visibility

Provider Drivers

Local-First with Collaboration

Architecture

CLI-First

Technology Stack

V1 Scope

Build Order

Project Status

Documentation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Sidekicks

Table of Contents

Why AI Sidekicks

Core Concept

Key Features

Multi-User, Multi-Agent Sessions

Queue, Steer, Pause, Resume

Approval Gates

Worktree-First Git Flow

Live Timeline and Visibility

Provider Drivers

Local-First with Collaboration

Architecture

CLI-First

Technology Stack

V1 Scope

Build Order

Project Status

Documentation

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages