5H99L1Y2BrcB2if9N92NXzfrzyYDXc82dsayqgdK2MyCs4RX by luckysymb · Pull Request #1596 · unarbos/ninja

luckysymb · 2026-05-14T22:36:11Z

No description provided.

github-actions · 2026-05-14T22:37:05Z

OpenRouter PR Judge

Verdict: WARN
Model: anthropic/claude-opus-4.7
Threshold: 70

Score	Value
Overall	74
Real edit	80
Safety	92
Scope	85
Contract	95

Summary

Large but substantive harness rewrite. Adds several new mechanical components: IDF-style term-frequency weighting in file ranking (_term_doc_freq / _rank_term_weight), relevance-scored region selection for large preloaded files (_select_relevant_regions instead of head-truncation), per-language directory-companion expansion (_augment_with_directory_companions), a signature-change audit detecting caller-breaking parameter shifts, an import-resolution preflight, a caller audit on removed/renamed symbols, a creation-intent issue detector mirroring the existing relocation detector, a confident early-exit that skips the self-check turn when all structural gates are clean, and a dual-pass mid-loop rescue replacing the previous single soft-nudge + hail-mary pair. Also adds lockfile and build-cache exclusions, and removes the prior JS/TS comment-import-concat repair helper.

Static Checks

agent.py has very few substantive changed lines.
Large patch with 2235 changed lines; judge should inspect for churn.

Judge Reasons

Diff is large (~1800 add / ~400 del) but adds real new helpers with new control flow, not just renames or reorderings.
solve(...) signature, return dict, DANGEROUS_PATTERNS, _EDGECASE_GUARDRAIL, and validator-owned boundaries are preserved.
No third-party imports; remains stdlib-only. No new endpoints, hardcoded secrets, or hostnames.
MINER-EDITABLE / VALIDATOR CONTRACT markers and docstrings are intact; identifiers remain readable.
Removes _split_comment_import_concat (a JS/TS patch-repair helper) — a capability regression the miner is implicitly betting against, worth a human glance but not disqualifying.
Removes the 'Scope discipline' line from build_initial_user_prompt while moving similar guidance into the self-check; net effect is prompt reshaping rather than removal.
Self-check prompt is reframed from 'adversarial review' to a structured correctness/completeness/scope-discipline pass — wording change, not a Goodhart-style judge-flattering payload.

Risks

scope-drift (mild): touches many distinct areas of agent.py in one PR (ranking, preload, sanitizer, multiple new audit gates, rescue loop), making it harder for the next forker to bisect a regression.
goodhart (low): self-check prompt explicitly tells the model to avoid whitespace-only / type-annotation-only / unrelated-refactor hunks; this is plausible scope hygiene rather than judge persuasion, but the framing leans toward shaping what the judge will see.
cosmetic-copy (low/none): the renames (_augment_with_directory_siblings → _companions, _check_inplace_intent → _inplace_intent_advisories, _patch_removed_definitions → _patch_removed_symbols) are paired with materially expanded behavior, not identity refactors.

Required Changes

No required changes returned.

Update print statement from 'Hello' to 'Goodbye'

1b0ce75

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

5H99L1Y2BrcB2if9N92NXzfrzyYDXc82dsayqgdK2MyCs4RX#1596

5H99L1Y2BrcB2if9N92NXzfrzyYDXc82dsayqgdK2MyCs4RX#1596
luckysymb wants to merge 1 commit into
unarbos:mainfrom
luckysymb:grit11

luckysymb commented May 14, 2026

Uh oh!

github-actions Bot commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

luckysymb commented May 14, 2026

Uh oh!

github-actions Bot commented May 14, 2026

OpenRouter PR Judge

Summary

Static Checks

Judge Reasons

Risks

Required Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant