test(opencode): stabilize runner cancel tests by Astro-Han · Pull Request #876 · Astro-Han/pawwork

Astro-Han · 2026-05-23T14:19:39Z

Summary

Add bounded runner-state and Deferred synchronization helpers for runner tests.
Replace fixed sleep / yieldNow synchronization in cancel, shell, and queued-caller tests with explicit state and waiter handshakes.
Wait for cancellation tests' blocking work fiber to start before calling cancel, closing the CI-only timeouts exposed by follow-up runs.
Narrow the Deferred runtime waiter-count check to only the shared-run caller tests that have no public Effect hook for "second caller is waiting on the existing run".
No related issue; this is a CI flake follow-up from recent post-merge failures.

Why

Recent post-merge CI runs timed out in Runner > cancel with onInterrupt resolves callers gracefully. The original test forked ensureRunning(...), slept for 10ms, then cancelled. On a slow runner, cancel can run before the runner has entered Running, making cancel a no-op and leaving the Effect.never waiter to time out.

Review follow-up also pointed out that two queued-caller tests still used Effect.yieldNow as a scheduling hint. Those now wait for the shared run Deferred to have both waiters attached before the test advances.

Later PR CI runs exposed the same class of race one layer deeper. runner.state === "Running" proves the runner state was installed, but it does not prove the work fiber has actually started. runner.state === "Shell" has the same boundary for shell masking assertions: it proves shell state was installed, but not that the shell effect and interrupt finalizers have started. The affected tests now use a local blocked-work helper with a public Deferred start handshake before cancelling.

The remaining Deferred runtime waiter-count check is intentionally private to this test file and only used for the two shared-run caller assertions. Effect does not expose a public signal for "this second caller is now awaiting the same run Deferred", and replacing that proof with sleep/yield would weaken the tests again.

Related Issue

None.

Human Review Status

Pending

Review Focus

Please check that the test helpers stay local to runner test synchronization and that the queued-caller, cancel, and shell masking assertions now wait for the relevant caller/work fiber to attach before cancel/release.

Risk Notes

No product behavior risk; this is test-only. Skipped conditional checklist items: visible UI or copy check because no visible UI or copy changed; platform/packaging impact because no runtime platform surface changed; docs/release/dependencies/permissions/generated-content checks because none of those surfaces were touched.

How To Verify

RED proof: bun --cwd packages/opencode -e '...' confirmed cancel-before-running leaves the waiter timing out.
Focused runner tests: cd packages/opencode && bun test test/effect/runner.test.ts --timeout 30000 -> 30 pass, 0 fail.
Focused runner repeat with CI Bun: for i in {1..100}; do /tmp/pawwork-bun-1.3.13/bun-darwin-aarch64/bun test test/effect/runner.test.ts --timeout 30000; done -> 100/100 runs passed.
Typecheck: cd packages/opencode && bun run typecheck -> passed.
Opencode CI subset with CI Bun: PATH=/tmp/pawwork-bun-1.3.13/bun-darwin-aarch64:$PATH bun turbo test:ci --filter=opencode -> 3071 pass, 9 skip, 1 todo, 0 fail.
Diff check: git diff --check -> no whitespace errors.

Screenshots or Recordings

Not applicable; no visible UI changes.

Checklist

How to use this checklist:

Tick a box by replacing [ ] with [x]. Do not edit, add, or remove items.

The bot-applied label items can only be honestly ticked AFTER the PR is opened and the labeler / priority-triage bots have run — return to the PR description and tick them then.

Most items are required. The few that are conditional are explicitly marked (conditional); for those, leave unticked if they truly do not apply and explain why in Risk Notes. All other items must be ticked before requesting human review.

coderabbitai · 2026-05-23T14:19:45Z

Warning

Review limit reached

@Astro-Han, we couldn't start this review because you've used your available PR reviews for now.

Your plan currently allows 1 review/hour. Refill in 25 minutes and 6 seconds.

Your organization has run out of usage credits. Purchase more in the billing tab.

⌛ How to resolve this issue?

After more review capacity refills, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than trial, open-source, and free plans. In all cases, review capacity refills continuously over time.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro Plus

Run ID: 9318ced1-c24e-4b80-8c11-071284b4cbcd

📥 Commits

Reviewing files that changed from the base of the PR and between d9d052e and c9fd387.

📒 Files selected for processing (1)

packages/opencode/test/effect/runner.test.ts

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch codex/fix-runner-cancel-test-race

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions

Suggested priority: P3 (only low-risk paths changed (packages/opencode/test/effect/runner.test.ts)).

P1/P0 are reserved for maintainer confirmation. Please relabel manually if this is a release blocker, security issue, data-loss risk, or updater/runtime failure.

gemini-code-assist

Code Review

This pull request improves the reliability of the Runner tests by replacing fixed sleep durations with a polling helper function, waitForRunnerState, which waits for the runner to reach a specific state. Feedback suggests refactoring this helper to use idiomatic Effect services, specifically replacing Date.now() with Effect.currentTimeMillis and using Effect.dieMessage for timeout errors to ensure consistency and better testability.

test(opencode): stabilize runner cancel tests

d36644f

Astro-Han added bug Something isn't working flaky-test Non-deterministic test failure P3 Low priority harness Model harness, prompts, tool descriptions, and session mechanics labels May 23, 2026

github-actions Bot reviewed May 23, 2026

View reviewed changes

gemini-code-assist Bot reviewed May 23, 2026

View reviewed changes

Comment thread packages/opencode/test/effect/runner.test.ts Outdated

Astro-Han added 5 commits May 23, 2026 22:57

test(opencode): tighten runner queue synchronization

fe6ee9f

test(opencode): wait for runner work before cancel

a16438d

test(opencode): narrow runner waiter introspection

8c05337

test(opencode): wait for active run before shell rejection

ac398cf

test(opencode): wait for shell work before masking assertions

c9fd387

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(opencode): stabilize runner cancel tests#876

test(opencode): stabilize runner cancel tests#876
Astro-Han wants to merge 6 commits into
devfrom
codex/fix-runner-cancel-test-race

Astro-Han commented May 23, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot commented May 23, 2026 •

edited

Loading

Review limit reached

Uh oh!

github-actions Bot left a comment

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Astro-Han commented May 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Related Issue

Human Review Status

Review Focus

Risk Notes

How To Verify

Screenshots or Recordings

Checklist

Uh oh!

coderabbitai Bot commented May 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review limit reached

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Astro-Han commented May 23, 2026 •

edited

Loading

coderabbitai Bot commented May 23, 2026 •

edited

Loading