GitHub - aws-samples/sample-multi-tenancy-openclaw-on-eks

Multi-tenancy OpenClaw on EKS

Background

OpenClaw is an open-source AI agent framework designed for single-user operation — one instance serves one user. This works well for individual use, but presents a challenge when you want to offer OpenClaw as a service to many users simultaneously.

What This Project Does

This sample project provides the orchestration layer that turns single-user OpenClaw instances into a multi-tenant service on Amazon EKS. Rather than modifying OpenClaw itself, it manages the full lifecycle of many isolated OpenClaw instances — provisioning, routing, state persistence, and teardown — so each tenant gets a dedicated agent with VM-level isolation via Kata Containers.

Note: This is a reference implementation for learning and experimentation. It uses Telegram as the sole messaging channel for demonstration purposes. Production deployments would need to adapt the webhook routing and channel integration to your specific requirements.

Architecture

flowchart TB
    TG["☁ Telegram"] -->|webhook| CF["CloudFront"]
    CF --> ALB["Internal ALB"]

    subgraph CP ["Control Plane"]
        ALB --> R["Router"]
        R -->|cache hit| Pod
        R -->|cache miss| O["Orchestrator"]
        R <--> Redis[("Redis")]
        O <--> DDB[("DynamoDB")]
    end

    O -->|warm pool| Pod
    O -->|cold start| KP["Karpenter"] --> Pod

    subgraph DP ["Data Plane · Bare Metal Nodes"]
        Pod["🧠 Kata VM Pod\n(OpenClaw Agent)"]
    end

    Pod <-->|state sync| S3[("S3")]

How It Works

Webhook arrives — Telegram sends a message to CloudFront → Internal ALB → Router
Routing — Router checks Redis for the tenant's pod IP. On cache miss, it calls the Orchestrator to wake the tenant
Pod lifecycle — Orchestrator creates a Kata VM pod (or claims one from the warm pool), restores state from S3, and starts the OpenClaw agent
Isolation — Each tenant runs in a separate microVM (Kata Containers), with VPC CNI NetworkPolicy enforcing network boundaries and S3 ABAC restricting data access
Idle management — After a configurable timeout, the Informer-based reconciler tears down idle pods, syncing state to S3 before termination

Key Features

VM-level tenant isolation — Kata Containers microVM per tenant on bare metal nodes
Network isolation — VPC CNI NetworkPolicy (eBPF) enforces cross-tenant boundaries
Data isolation — EKS Pod Identity ABAC session tags restrict S3 access per tenant
Event-driven reconciler — K8s Informer detects pod failures in ~1s
Warm pool — Pre-provisioned pods with PriorityClass preemption for faster cold start
Graceful persistence — PreStop hook guarantees final S3 state sync
CLI tooling — otm CLI for tenant CRUD, config management, log streaming

Components

Component	Role
Router	Webhook ingress, async pod wake + message forwarding
Orchestrator	Tenant lifecycle, DynamoDB registry, Informer reconciler, warm pool
Redis	Pod IP cache, distributed lock, shared config
DynamoDB	Tenant registry with GSI for status queries
S3	Per-tenant state persistence
Karpenter	Bare metal node autoscaling for Kata workloads

Documentation

Document	Description
Architecture	System design, Mermaid diagrams, decision rationale
Setup Guide	End-to-end deployment with real-world lessons learned
Operations	Day-to-day operations, `otm` CLI reference, troubleshooting
Configuration	Environment variables and config options
Cold Start Analysis	Kata vs runc, cold vs warm benchmark
Kata Containers	Resource overhead and pod density analysis
Network Policy	VPC CNI + Kata compatibility validation

Getting Started

See the Setup Guide for step-by-step deployment instructions.

Security

See CONTRIBUTING for more information.

License

This project is licensed under the Apache-2.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
cmd		cmd
deploy		deploy
docs		docs
internal		internal
scripts		scripts
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile.openclaw		Dockerfile.openclaw
Dockerfile.orchestrator		Dockerfile.orchestrator
Dockerfile.router		Dockerfile.router
LICENSE		LICENSE
Makefile		Makefile
NOTICE		NOTICE
README.md		README.md
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-tenancy OpenClaw on EKS

Background

What This Project Does

Architecture

How It Works

Key Features

Components

Documentation

Getting Started

Security

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-tenancy OpenClaw on EKS

Background

What This Project Does

Architecture

How It Works

Key Features

Components

Documentation

Getting Started

Security

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages