feat(orchestrator): improve concurrent benchmark tracing and enable huge pages by arkamar · Pull Request #2327 · e2b-dev/infra

arkamar · 2026-04-08T14:38:34Z

While investigating concurrent sandbox creation performance, I found
that traces were missing key information — there was no way to filter
spans by concurrency level, and the biggest bottleneck inside resume-fc
was invisible in traces. I also enabled huge pages by default in the
benchmark to match production.

To fix the tracing gaps, I added concurrency and sandbox.index
attributes to benchmark spans for filtering in Grafana, and two new
spans in resume-fc (wait-uffd-socket and wait-rootfs-path) that
make the parallel waits before snapshot loading visible.
wait-rootfs-path turned out to be the primary bottleneck, growing
proportionally with concurrency due to kernel-level serialization in
nbdnl.Connect().

…rk spans Add concurrency level, sandbox index, and sandbox ID attributes to the bench-resume span so traces can be filtered by concurrency level in Grafana/Tempo (e.g. {span.concurrency=5}).

Production uses huge pages, so the benchmark should too. Disable with DISABLE_HUGE_PAGES=true for comparison. Uses a separate build ID per mode to avoid cache collisions.

…ume-fc During concurrent sandbox creation, resume-fc blocks on several parallel waits before it can load the snapshot. These waits were previously invisible — only covered by point-in-time ReportEvent calls that do not capture duration. Adding duration spans makes them visible as bars in the Grafana waterfall view. This is important because wait-rootfs-path turned out to be the primary bottleneck, growing significantly as more sandboxes are created simultaneously.

cursor · 2026-04-08T14:38:43Z

PR Summary

Low Risk
Primarily adds OpenTelemetry spans/attributes and tweaks benchmark configuration; no production control flow changes beyond passing derived contexts into existing waits, so functional risk is low.

Overview
Improves observability of concurrent sandbox resume by adding per-goroutine bench-resume spans tagged with concurrency, sandbox.index, and sandbox.id, and by instrumenting resume-fc with new spans around waiting for the UFFD socket and resolving/symlinking the rootfs path. The concurrent resume benchmark now defaults to huge pages via a separate build ID, with an opt-out via DISABLE_HUGE_PAGES.

^{Reviewed by Cursor Bugbot for commit 621d261. Bugbot is set up for automated code reviews on this repo. Configure here.}

claude

(This was a test - please disregard)

linear · 2026-04-08T14:44:45Z

ENG-3728 Experiment: How many sandboxes can be spawned concurrently (yet effectively) on a single node

packages/orchestrator/benchmarks/concurrent_benchmark_test.go

packages/orchestrator/pkg/sandbox/fc/process.go

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f1d48b0bc1

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

packages/orchestrator/benchmarks/concurrent_benchmark_test.go

packages/orchestrator/pkg/sandbox/fc/process.go

Use strconv.ParseBool so that common boolean env values like 1, TRUE, or True are accepted, not just the exact string "true".

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b82e6e5f88

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

packages/orchestrator/benchmarks/concurrent_benchmark_test.go

packages/orchestrator/pkg/sandbox/fc/process.go

Co-authored-by: Jakub Novák <jakub@e2b.dev>

arkamar added 3 commits April 8, 2026 12:11

feat(orchestrator): add concurrency and sandbox attributes to benchma…

4236103

…rk spans Add concurrency level, sandbox index, and sandbox ID attributes to the bench-resume span so traces can be filtered by concurrency level in Grafana/Tempo (e.g. {span.concurrency=5}).

feat(orchestrator): enable huge pages by default in concurrent benchmark

f84e030

Production uses huge pages, so the benchmark should too. Disable with DISABLE_HUGE_PAGES=true for comparison. Uses a separate build ID per mode to avoid cache collisions.

e2b-request-same-site-reviewers bot assigned sitole Apr 8, 2026

claude bot reviewed Apr 8, 2026

View reviewed changes

packages/orchestrator/benchmarks/concurrent_benchmark_test.go Show resolved Hide resolved

packages/orchestrator/pkg/sandbox/fc/process.go Show resolved Hide resolved

arkamar marked this pull request as ready for review April 8, 2026 14:58

arkamar requested review from ValentaTomas, dobrac and jakubno as code owners April 8, 2026 14:58

chatgpt-codex-connector bot reviewed Apr 8, 2026

View reviewed changes

packages/orchestrator/benchmarks/concurrent_benchmark_test.go Outdated Show resolved Hide resolved

arkamar marked this pull request as draft April 8, 2026 15:03

claude bot reviewed Apr 8, 2026

View reviewed changes

packages/orchestrator/pkg/sandbox/fc/process.go Show resolved Hide resolved

fix(orchestrator): parse DISABLE_HUGE_PAGES as a boolean flag

b82e6e5

Use strconv.ParseBool so that common boolean env values like 1, TRUE, or True are accepted, not just the exact string "true".

arkamar marked this pull request as ready for review April 8, 2026 15:32

chatgpt-codex-connector bot reviewed Apr 8, 2026

View reviewed changes

packages/orchestrator/benchmarks/concurrent_benchmark_test.go Show resolved Hide resolved

claude bot reviewed Apr 8, 2026

View reviewed changes

packages/orchestrator/pkg/sandbox/fc/process.go Show resolved Hide resolved

jakubno assigned jakubno and unassigned sitole Apr 10, 2026

jakubno approved these changes Apr 10, 2026

View reviewed changes

packages/orchestrator/pkg/sandbox/fc/process.go Outdated Show resolved Hide resolved

Update packages/orchestrator/pkg/sandbox/fc/process.go

621d261

Co-authored-by: Jakub Novák <jakub@e2b.dev>

arkamar merged commit 49b8f16 into main Apr 10, 2026
81 of 82 checks passed

arkamar deleted the experiment/sbx-concurency branch April 10, 2026 09:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(orchestrator): improve concurrent benchmark tracing and enable huge pages#2327

feat(orchestrator): improve concurrent benchmark tracing and enable huge pages#2327
arkamar merged 5 commits intomainfrom
experiment/sbx-concurency

arkamar commented Apr 8, 2026

Uh oh!

cursor bot commented Apr 8, 2026 •

edited

Loading

Uh oh!

claude bot left a comment •

edited

Loading

Uh oh!

linear bot commented Apr 8, 2026

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

arkamar commented Apr 8, 2026

Uh oh!

cursor bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

Uh oh!

claude bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

linear bot commented Apr 8, 2026

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cursor bot commented Apr 8, 2026 •

edited

Loading

claude bot left a comment •

edited

Loading