Skip to content

Ask Agent to emit browser actions#64

Closed
giaphutran12 wants to merge 1 commit into
codex/self-healing-stack-rollupfrom
codex/producer-browser-actions
Closed

Ask Agent to emit browser actions#64
giaphutran12 wants to merge 1 commit into
codex/self-healing-stack-rollupfrom
codex/producer-browser-actions

Conversation

@giaphutran12
Copy link
Copy Markdown
Collaborator

@giaphutran12 giaphutran12 commented May 23, 2026

Summary

  • make the collection TinyFish Agent goal require agent_browser_actions next to records
  • define the action contract for navigation/click/type/select/wait/extract steps with URL, selector/target text, safe value description, status, phase, and label
  • add a unit test so the producer-side browser-action contract cannot silently disappear
  • refresh docs/benchmark README to say browser replay evidence starts at the producer contract

Evidence

Local verification:

  • node --import ./backend/node_modules/tsx/dist/esm/index.mjs --test backend/test/agent-goal-contract.test.ts backend/test/collection-browser-actions.test.ts backend/test/collection-agent-runner.test.ts
  • npm --prefix backend run build
  • npm --prefix backend test
  • git diff --check

GitHub checks currently cover secret/review automation only; rerun backend gates after restacking.

Restack simulation

Disposable worktree /private/tmp/bigset-restack-sim-20260523-0912 cherry-picked this commit (d818ba38b8785e07f7ec983717502645aa1f7171) on top of PR #63 head without conflicts, then cherry-picked PR #65.

Verification on the simulated final stack:

  • targeted tests: 23/23 pass
  • npm --prefix backend run build: pass
  • git diff --check: pass
  • make verify-self-healing: pass, including backend tests 94/94

Notes

  • No auto-merge.
  • This does not compile Playwright yet. It makes the upstream Agent responsible for emitting ordered browser actions that the existing trace/readiness layer can preserve.
  • After PR Roll up self-healing data collection stack #63 lands, replay this as commit d818ba38b8785e07f7ec983717502645aa1f7171 onto fresh origin/main, then rerun build/tests/make verify-self-healing.
  • Promotion gate: run a live Agent-enabled collection-self-heal canary and require non-candidate_rejected plus playwrightCandidateStatus: "ready" before treating this as production-ready.
  • Benchmark canary needs OPENROUTER_API_KEY and TINYFISH_API_KEY. App-path commit smoke also needs Convex/admin/local-stack prerequisites.

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented May 23, 2026

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: bd80d8d0-af87-489e-a123-3e7de5ba082c

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

  • 🔍 Trigger review
✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch codex/producer-browser-actions

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@giaphutran12
Copy link
Copy Markdown
Collaborator Author

Closing stale draft cleanup PR; superseded by later BigSet work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant