k6-fargate-api-loadtest

Run repeatable k6 API load tests from AWS Fargate (not your laptop). Each run is an ephemeral Fargate task; results are uploaded to S3 and logs go to CloudWatch.

What this is (and is not)

This is:

A repeatable way to generate load from a consistent AWS environment.
A learning-friendly reference for ECS/Fargate + k6 + Terraform.

This is not:

A full load-testing platform (no UI, no scheduler, no long-running control plane).
A production "performance testing service".

Why this exists (vs running k6 locally)

Running load tests from a laptop is often misleading:

ISP / Wi‑Fi variability adds noise
Local CPU throttling affects throughput
NAT / routing changes impact latency
Harder to reproduce across teammates

This repo provisions a small, consistent load generator in AWS:

k6 runs inside a Fargate task
Results uploaded to S3 (summary.json)
Logs stored in CloudWatch Logs
Everything triggered via simple Python tools (no long-running services)

Architecture (what runs where)

On your laptop

infra/terraform/ provisions AWS resources (Terraform)
tools/build_push.py builds + pushes the Docker image to ECR
tools/run_task.py starts a Fargate task, prints RUN_ID, optional --tail

In AWS

ECR: container registry
ECS/Fargate: executes k6
CloudWatch Logs: container logs
S3: stores summary.json

Why run k6 on Fargate

Running load tests from laptops or shared CI agents often introduces noise: limited CPU, background processes, VPN routing, and network throttling.

Using Fargate provides an isolated and reproducible execution environment with predictable CPU, memory, and networking, making results easier to compare across runs.

Operating Model

What “scaling” means here

One ECS Fargate task = one k6 load generator.

If you need more load than a single task can produce, scale by sharding across multiple tasks (multiple independent runs), not by endlessly increasing VUs inside one task. This keeps runs simpler, more reproducible, and avoids a single generator becoming the bottleneck.

When the generator becomes the bottleneck

If throughput plateaus or latency inflates, it’s often the load generator, not the target system. Common causes:

CPU limits (TLS, request generation, JS execution)
Network throughput limits
Connection churn / poor keep-alive behavior

Rule of thumb:

Increase task size (vCPU/memory) to remove local bottlenecks
If you still can’t reach target rate, shard across tasks

Rate vs concurrency (common source of confusion)

k6 has two separate “levers”:

Arrival rate / pacing (how many iterations/requests per second you try to generate)
Concurrency (VUs) (how many parallel workers can execute requests at the same time)

If the target system responds quickly but the achieved request rate is below the configured arrival rate, increase VUs or task size. If the target is slow, raising VUs can increase in-flight requests and amplify tail latency — shard across multiple tasks instead of pushing a single generator to unrealistic VU levels.

Reproducibility expectations

This repo is built for repeatable experiments:

consistent AWS execution environment
pinned container image tags for “baseline vs change”
results stored per RUN_ID in S3

For comparisons, keep the image tag stable as your baseline and change one variable at a time (target version, task size, VUs, script/thresholds).

Quick Start

Prerequisites

AWS credentials configured locally (via SSO, ~/.aws/credentials, env vars, etc.)
Terraform >= 1.5
AWS CLI v2 (used for aws ecr get-login-password)
Docker + buildx
Python 3.11+ (venv recommended)

Notes:

Default region is eu-west-1 (Terraform var.region, tools default to eu-west-1 unless AWS_REGION/AWS_DEFAULT_REGION is set).
This repo intentionally defaults to public subnets + public IP for cost/simplicity.

1) Create virtual environment

python3 -m venv .venv
source .venv/bin/activate
pip install -r tools/requirements.txt

# Optional (contributors): linting
pip install -r tools/requirements-dev.txt

Windows PowerShell:

python -m venv .venv
.venv\Scripts\Activate.ps1
pip install -r tools\requirements.txt

# Optional (contributors): linting
pip install -r tools\requirements-dev.txt

2) Provision infrastructure

cd infra/terraform
terraform init
terraform apply

Remote state (recommended for shared use): By default Terraform stores state locally in terraform.tfstate. If you lose that file you lose the ability to manage or destroy the infrastructure. A ready-to-use S3 + DynamoDB backend template is provided at infra/terraform/backend.tf.example — copy it to backend.tf, fill in your bucket/table names, and re-run terraform init to migrate.

3) Build & push image

python tools/build_push.py

By default this pushes a stable tag (defaults to IMAGE_TAG=stable) and also pushes an immutable build tag like build-YYYYMMDDHHMMSS.

4) Run tests

Create a local request config (recommended)

The committed files under loadtest/utils/ are intentionally safe templates (they point at example.com). For real targets, copy one to a local-only file (ignored by git):

cp loadtest/utils/request.json loadtest/utils/request.local.json

Then edit loadtest/utils/request.local.json to point at your target.

Start a run:

python tools/run_task.py \
  --vus 50 \
  --duration 1m \
  --warmup-vus 10 \
  --warmup-duration 15s \
  --request-file loadtest/utils/request.local.json \
  --tail

Threshold flags (optional): k6 defaults to failing (exit 99) if error rate > 1% or p95 latency > 1000ms. Results are uploaded even on a threshold breach. To adjust:

# Relax thresholds for a slow or degraded API
python tools/run_task.py ... --threshold-error-rate 0.05 --threshold-p95-ms 2000

# Disable thresholds entirely — always upload results regardless of performance
python tools/run_task.py ... --threshold-error-rate off --threshold-p95-ms off

If you add --fetch-and-append, the tool will automatically download the result and append it to the local run history when the task finishes (so you can skip step 3):

python tools/run_task.py \
  --vus 50 \
  --duration 1m \
  --warmup-vus 10 \
  --warmup-duration 15s \
  --sleep-ms 10 \
  --request-file loadtest/utils/request.local.json \
  --tail \
  --fetch-and-append

The tool prints a RUN_ID and the S3 location where results will be uploaded.

Download and/or extract the result:

If you ran with --fetch-and-append, skip this step.

tools/fetch_and_append.py is the "one command" path: it downloads the run (tools/fetch_run.py) and then extracts metrics (tools/extract_run_metrics.py) and appends them to test-results/runs.jsonl.

python tools/fetch_and_append.py <RUN_ID>
python tools/fetch_run.py <RUN_ID>
python tools/extract_run_metrics.py test-results/<RUN_ID>/summary.json

See Graphs of history runs:

python tools/plot_runs.py

Plots 5 panels (avg latency, p90, p95, throughput, error rate) from test-results/runs.jsonl. Useful flags:

Flag	Default	Description
`--runs <path>`	`test-results/runs.jsonl`	Path to the JSONL ledger
`--url <url>`	(all)	Filter to a specific target URL (exact match)
`--metrics avg,rps,p90,p95,err`	all	Show only a subset of panels
`--group-by none\|url\|scenario`	`none`	Split series by URL or scenario
`--save <file.png>`	(none)	Save chart to an image file
`--show`	(auto)	Force the interactive window even when `--save` is set

Examples:

# Save chart to a file
python tools/plot_runs.py --save report.png

# Show only latency panels, grouped by URL
python tools/plot_runs.py --metrics avg,p90,p95 --group-by url

# Filter to one endpoint and save
python tools/plot_runs.py --url https://api.example.com/endpoint --save endpoint.png

CI

A lightweight CI pipeline enforces Terraform formatting/validation and basic Python static checks.

It runs:

Terraform fmt + validate
Python compileall
Ruff (basic F checks)

Execution Flow

tools/run_task.py calls ECS RunTask
ECS launches a Fargate task with a command override: run /tests/scenarios/<scenario>.js
Container ENTRYPOINT executes entrypoint.sh
entrypoint.sh runs k6
k6 handleSummary() writes /tmp/summary.json
After k6 exits, entrypoint.sh uploads summary.json to S3 when k6 exits 0 (success) or 99 (threshold breach — run completed but thresholds failed). Other non-zero exits (script errors, bad args) skip the upload.
Task stops

Dockerfile CMD is only a fallback and normally overridden by ECS.

Results

Each run generates a unique RUN_ID.

S3 location:

s3://<results-bucket>/runs/<RUN_ID>/summary.json

Local downloads are stored under test-results/ by default (this folder is ignored and should not be committed).

Logs

CloudWatch Log Group:

/ecs/k6-fargate-loadtest

Streams:

run/<container>/<task_id>

Networking Model

Current setup:

Public subnets
assignPublicIp = ENABLED
Security group egress:
- HTTPS (TCP 443) to 0.0.0.0/0 — for the target API
- DNS (UDP 53 + TCP 53) to 0.0.0.0/0 — for hostname resolution (typically to the VPC resolver; 0.0.0.0/0 is used here for simplicity)

Chosen for cost efficiency (no NAT, no endpoints). DNS egress is required for k6 to resolve hostnames; without it, tasks fail with cryptic connection errors in environments with custom resolvers or tightened NACLs.

If your organization requires private networking, you can adapt this to private subnets + NAT or VPC endpoints (not the default here).

Image Tagging (reproducibility)

Terraform config uses var.image_tag (default stable) for the ECS task definition.

To run an immutable build tag without re-applying Terraform, tools/run_task.py supports:

python tools/run_task.py --image-tag build-YYYYMMDDHHMMSS --tail

This registers a one-off task definition revision for the run.

Troubleshooting

No logs while tailing: CloudWatch streams may appear after a few seconds.

Task failed: Check CloudWatch logs using printed stream name.

Many untagged ECR images: Normal with multi-arch builds. Lifecycle cleanup is asynchronous.

Security Notes

No inbound rules on task SG
Task role limited to s3:PutObject on results prefix
Target API auth can be passed via env vars if needed (TARGET_API_KEY / TARGET_BEARER_TOKEN)

Important:

ECS task overrides are logged in CloudTrail. Every RunTask call records all container override environment variables — including TARGET_API_KEY, TARGET_BEARER_TOKEN, and the full REQUEST_JSON payload (URL, headers, body) — in CloudTrail and in ecs:DescribeTasks responses. Any principal with CloudTrail read access or ecs:DescribeTasks can see these values.
Never put auth credentials in request.json headers. Use TARGET_API_KEY / TARGET_BEARER_TOKEN env vars exclusively for credentials. Be aware those are also visible in CloudTrail.
Avoid putting secrets inside request JSON files. Never commit secrets.
In shared AWS accounts, restrict CloudTrail read access and ecs:DescribeTasks permissions to limit exposure of task environment variables.

Cost & Cleanup

Costs come mainly from CloudWatch Logs, S3 storage, and ECR storage (plus Fargate runtime while a test runs).

Cleanup:

cd infra/terraform
terraform destroy

If you want to purge images immediately, delete them from ECR (the repo also has a lifecycle policy, but it's not instant).

Publishing checklist

Before making this repository public:

Ensure no *.tfstate files, .terraform/, .venv/, or test-results/ artifacts are committed.
Keep request templates sanitized; store real targets in loadtest/utils/*.local.json.
Confirm the results bucket policy enforces TLS-only access.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
docker		docker
docs/images		docs/images
infra/terraform		infra/terraform
loadtest		loadtest
tools		tools
uploader		uploader
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md

Folders and files

Latest commit

History

Repository files navigation

k6-fargate-api-loadtest

Contents

What this is (and is not)

Why this exists (vs running k6 locally)

Architecture (what runs where)

On your laptop

In AWS

Why run k6 on Fargate

Operating Model

What “scaling” means here

When the generator becomes the bottleneck

Rate vs concurrency (common source of confusion)

Reproducibility expectations

Quick Start

Prerequisites

1) Create virtual environment

2) Provision infrastructure

3) Build & push image

4) Run tests

CI

Execution Flow

Results

Logs

Networking Model

Image Tagging (reproducibility)

Troubleshooting

Security Notes

Cost & Cleanup

Publishing checklist

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages