task-proof

Independent verification framework for OpenCode - closes the self-certification gap by spawning a fresh LLM session that has never seen the build context.

The Problem

The agent that wrote the code physically cannot judge it cleanly: it has the same context, assumptions, and blind spots it had while building. Self-certification bias is real, and on tasks with multiple acceptance criteria it shows up as "everything looks good" reports for incomplete work.

task-proof closes that gap with two pieces working together:

fresh-verify guard - every git push and git commit is held for review by an LLM that sees only the last task description and the diff.
task-proof skill - a 7-step proof loop: spec freeze, build, evidence pack, fresh verify, fix loop, complete, self-improve.

A proof-recommend nudge fires once per session to suggest the skill when the user's request looks complex.

Quickstart

task-proof is now pure TypeScript. Install from a checkout with Node.js and tsx:

npm install
npx tsx install.ts --target /path/to/your/repo

For this repo's dogfood install:

npm run install:local

That installs the core under .uplift/task-proof/, installs the project-local OpenCode plugin under .opencode/plugins/, and installs the repo-scoped skill under .opencode/skills/task-proof/.

Commit .uplift/task-proof/ and .opencode/ so the proof loop is available in worktrees. OpenCode auto-loads project plugins from .opencode/plugins/ unless it is started with --pure.

How It Works

OpenCode hook event
        |
        v
.opencode/plugins/task-proof.ts              <- OpenCode adapter
        |
        v
.uplift/task-proof/core/cmd/task-proof-run.ts pre-commit
        |
        v
.uplift/task-proof/core/guards/fresh-verify.ts
        |
        v
.uplift/task-proof/core/lib/llm-client.ts
        |
        v
TASK_PROOF_LLM_CMD -> opencode run --pure

Two layers, on purpose:

core/ is host-agnostic and speaks plain text tags: BLOCK:, ASK:, WARN:, or empty output.
adapters/opencode/ translates those tags to OpenCode's plugin hook surface.

Runtime proof artifacts are separate from installed code and live under .task-proof/runs/<TASK_ID>/.

Configuration

Everything is environment variables. See CONTRACT.md for the full list.

Variable	Purpose
`TASK_PROOF_DISABLED=1`	Kill switch for the whole product
`TASK_PROOF_DISABLE_FRESH_VERIFY=1`	Disable just the verifier
`TASK_PROOF_DISABLE_PROOF_RECOMMEND=1`	Disable the prompt nudge
`TASK_PROOF_LLM_CMD=...`	Plug in any LLM command or mock backend
`TASK_PROOF_LLM_BACKEND=opencode`	Force the built-in OpenCode backend
`TASK_PROOF_OPENCODE_MODEL=...`	Optional model override for `opencode run`
`TASK_PROOF_OPENCODE_ASK_BEHAVIOR=warn`	Let OpenCode log `ASK:` and allow instead of throwing
`TASK_PROOF_OPENCODE_PROMPT_TIMEOUT_MS=...`	OpenCode prompt nudge timeout
`TASK_PROOF_OPENCODE_TOOL_TIMEOUT_MS=...`	OpenCode tool guard timeout
`CI=true`	Skip everything in CI environments

The Skill

Once installed, invoke task-proof with a task description. The skill walks the seven steps in adapters/opencode/skills/task-proof/SKILL.md:

Spec freeze - write acceptance criteria, get user confirmation, then treat the spec as immutable.
Build - implement normally with existing repo patterns.
Evidence pack - record actual proof for each criterion in .task-proof/runs/<TASK_ID>/evidence.json.
Fresh verify - independent LLM checks spec vs evidence and writes a verdict.
Fix loop - up to three iterations on failures, then escalate.
Complete - report final verdict and residual risks.
Self-improve - reflect on what could have caught the issue sooner.

Layout

task-proof/
|-- core/
|   |-- cmd/task-proof-run.ts
|   |-- guards/{fresh-verify,proof-recommend}.ts
|   `-- lib/{decision,llm-client,payload,process}.ts
|-- adapters/
|   `-- opencode/
|       |-- plugins/task-proof.ts
|       `-- skills/task-proof/SKILL.md
|-- templates/{spec.md.tmpl,gitignore.snippet}
|-- tests/{run.ts,fixtures/...}
|-- install.ts
|-- remote-install.ts
|-- package.json
`-- CONTRACT.md

The committed .uplift/ and .opencode/ directories are dogfood output for this repo.

Tests

npm run check
npm test

The test runner sets up a throwaway git repo, mocks the LLM backend with TASK_PROOF_LLM_CMD, runs every fixture under tests/fixtures/, and checks the OpenCode adapter, OpenCode LLM backend selection, installer idempotency, and the absence of legacy Bash/JS source files.

True-positive fixtures (tp-*.json) must produce non-empty output; true-negative fixtures (tn-*.json) must stay silent.

Uninstall

rm -rf .uplift/task-proof .opencode/plugins/task-proof.ts .opencode/skills/task-proof

OpenCode auto-loads .opencode/plugins/*.{ts,js}; uninstalling the plugin file is enough unless you added explicit OpenCode config yourself.

License

MIT.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

task-proof

The Problem

Quickstart

How It Works

Configuration

The Skill

Layout

Tests

Uninstall

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.opencode		.opencode
.uplift/task-proof/core		.uplift/task-proof/core
adapters/opencode		adapters/opencode
core		core
templates		templates
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CONTRACT.md		CONTRACT.md
LICENSE		LICENSE
README.md		README.md
install.ts		install.ts
package-lock.json		package-lock.json
package.json		package.json
remote-install.ts		remote-install.ts
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

task-proof

The Problem

Quickstart

How It Works

Configuration

The Skill

Layout

Tests

Uninstall

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages