Windows EXE Decompiler MCP Server

Chinese version: README_zh.md

An MCP server for Windows reverse engineering. It exposes PE triage, Ghidra-backed inspection, DLL/COM profiling, runtime evidence ingestion, Rust/.NET recovery, source-like reconstruction, and LLM-assisted review as reusable MCP tools for any tool-calling LLM.

Feature highlights

Universal Windows PE coverage: EXE, DLL, COM-oriented libraries, Rust-native samples, and .NET assemblies all have dedicated profiling or recovery paths.
Recover-first design: when Ghidra function extraction is empty or degraded, the server can continue with .pdata parsing, boundary recovery, symbol recovery, and imported function definitions.
Observable Ghidra runs: command logs, runtime logs, staged progress, project/log roots, and parsed Java exception summaries are surfaced through high-level outputs.
Runtime-aware reconstruction: static evidence, trace imports, memory snapshots, and semantic review artifacts can all be correlated back into reconstruct and report workflows.
LLM-assisted review layers: function naming, function explanation, and module reconstruction review are exposed as structured MCP flows instead of ad hoc prompts.
Queue-friendly orchestration: long-running workflows return job_id, progress, and polling_guidance so MCP clients can wait efficiently instead of burning tokens on tight polling loops.

New in the static triage foundation

This iteration adds a stronger first-pass static analysis layer before deep reverse engineering:

static.capability.triage uses capa-style behavior classification to answer what a sample appears capable of, not just what strings or imports it contains.
pe.structure.analyze merges pefile and LIEF style structural parsing into one canonical PE summary with backend-specific detail blocks.
compiler.packer.detect adds compiler, protector, and packer attribution with setup-aware degradation when Detect It Easy is unavailable.
workflow.triage, report.summarize, and report.generate now consume these results directly, including artifact provenance, static scope selection, and compare/baseline support.

Typical analysis flows

Quick triage

sample.ingest
static.capability.triage
pe.structure.analyze
compiler.packer.detect
workflow.triage
report.summarize

Hard native recovery

ghidra.analyze
workflow.function_index_recover
workflow.reconstruct

LLM-assisted refinement

workflow.reconstruct
workflow.semantic_name_review
workflow.function_explanation_review
workflow.module_reconstruction_review

What this server is for

This project is meant to be a reusable reverse-engineering tool surface, not a pile of one-off local scripts.

It is designed to help MCP clients:

triage Windows PE samples quickly
inspect imports, exports, strings, packers, runtime hints, and binary role
use Ghidra when available for decompile, CFG, search, and reconstruction
recover usable function indexes when Ghidra function extraction fails
surface actionable setup guidance when Java, Python extras, or Ghidra are missing
expose richer Ghidra diagnostics, command logs, and stage/progress metadata when analysis fails
correlate static evidence, runtime traces, memory snapshots, and semantic review artifacts
export source-like reconstruction output with optional build and harness validation

Core capability areas

Sample and static analysis

sample.ingest
sample.profile.get
static.capability.triage
pe.structure.analyze
compiler.packer.detect
pe.fingerprint
pe.imports.extract
pe.exports.extract
pe.pdata.extract
dll.export.profile
com.role.profile
strings.extract
strings.floss.decode
yara.scan
runtime.detect
packer.detect
binary.role.profile
system.setup.guide

Ghidra and function analysis

ghidra.health
ghidra.analyze
code.functions.list
code.functions.rank
code.functions.search
code.function.decompile
code.function.disassemble
code.function.cfg
code.functions.reconstruct

Recovery for Rust and hard native samples

code.functions.smart_recover
pe.symbols.recover
code.functions.define
rust_binary.analyze
workflow.function_index_recover

.NET and managed inspection

dotnet.metadata.extract
dotnet.types.list
dotnet.reconstruct.export

Runtime evidence and reporting

dynamic.dependencies
sandbox.execute
dynamic.trace.import
dynamic.memory.import
attack.map
ioc.export
report.summarize
report.generate
artifacts.list
artifact.read
artifacts.diff
tool.help

Semantic review and reconstruction

code.function.rename.prepare
code.function.rename.review
code.function.rename.apply
code.function.explain.prepare
code.function.explain.review
code.function.explain.apply
code.module.review.prepare
code.module.review
code.module.review.apply
code.reconstruct.plan
code.reconstruct.export

High-level workflows

These are the main orchestration entrypoints for MCP clients.

`workflow.triage`

Fast first-pass triage for PE samples. Use this when you want a quick answer before deeper recovery.

workflow.triage now combines:

fingerprint and runtime hint extraction
import and string triage
YARA matches
static capability triage
canonical PE structure summary
compiler/packer attribution

`workflow.deep_static`

Long-running static pipeline for deeper analysis and ranking. Supports async job mode.

`workflow.reconstruct`

The main high-level reconstruction workflow.

It can:

run binary preflight
detect Rust-oriented samples
profile DLL lifecycle, export dispatch, callback surface, and COM activation hints
auto-recover a function index when Ghidra function extraction is missing or degraded
export native or .NET reconstruction output
optionally validate build and run the generated harness
tune export strategy based on role-aware preflight for native Rust, DLL, and COM-oriented samples
return structured setup guidance when Java, Ghidra, or optional dependencies are not ready
expose stage-oriented progress metadata for queued and foreground runs
carry runtime and semantic provenance through the result

`workflow.function_index_recover`

High-level recovery chain for hard native binaries:

code.functions.smart_recover
pe.symbols.recover
code.functions.define

Use this when Ghidra analysis exists but function extraction is empty or degraded.

`workflow.semantic_name_review`

High-level semantic naming review workflow for external LLM clients. It can prepare evidence, request model review through MCP sampling when available, apply accepted names, and optionally refresh reconstruct/export output. When export refresh runs, the workflow now carries the same ghidra_execution summary used by workflow.reconstruct, including project root, log root, command/runtime log paths, progress stages, and parsed Java exception context.

`workflow.function_explanation_review`

High-level explanation workflow for external LLM clients. It can prepare evidence, request structured explanations, apply them, and optionally rerun reconstruct/export. Export refresh results also surface ghidra_execution so explanation-heavy review chains still expose Ghidra project/log context and progress metadata.

`workflow.module_reconstruction_review`

High-level module review workflow for external LLM clients. It can prepare reconstructed modules for review, request structured module refinements through MCP sampling when available, apply accepted module summaries and guidance, and optionally refresh reconstruct/export output. When export refresh runs, the workflow also carries ghidra_execution so module-level review chains expose Ghidra project/log context and progress metadata just like reconstruct and function-level review workflows.

Universal recovery model

This server does not assume Ghidra is always able to recover functions correctly.

For difficult native samples, especially Rust, Go, or heavily optimized binaries, the recovery path is:

ghidra.analyze
if Ghidra post-script extraction fails, use pe.pdata.extract
recover candidate function boundaries with code.functions.smart_recover
recover names with pe.symbols.recover
import the recovered boundaries with code.functions.define
continue with code.functions.list, code.functions.rank, code.functions.reconstruct, or workflow.reconstruct

This means function_index readiness is tracked separately from decompile and cfg readiness.

Evidence scope, semantic scope, and replayability

Most high-level tools support explicit scope control so clients can choose between all history and the current session.

Runtime evidence selection:

evidence_scope=all
evidence_scope=latest
evidence_scope=session with evidence_session_tag

Semantic naming / explanation / module-review selection:

semantic_scope=all
semantic_scope=latest
semantic_scope=session with semantic_session_tag

Comparison-aware outputs are also supported through:

compare_evidence_scope
compare_evidence_session_tag
compare_semantic_scope
compare_semantic_session_tag

This allows MCP clients to ask not only "what is the current result?" but also "what changed compared with the previous evidence or semantic review session?"

Static-analysis artifact selection:

static_scope=all
static_scope=latest
static_scope=session with static_session_tag

Static baseline comparison:

compare_static_scope
compare_static_session_tag

LLM review layers

This server supports multiple structured review layers for MCP clients with tool calling and optional sampling:

function naming review
function explanation review
module reconstruction review

Each layer follows the same pattern:

prepare a structured evidence bundle
optionally ask the connected MCP client to perform a constrained review through sampling
apply accepted results as stable semantic artifacts
rerun reconstruct/export/report workflows against explicit semantic scope

Async job model

Long-running workflows support queued execution and background completion:

workflow.deep_static
workflow.reconstruct
workflow.semantic_name_review
workflow.function_explanation_review
workflow.module_reconstruction_review

Use these with:

task.status
task.cancel
task.sweep

Queued workflow responses and task.status now include polling_guidance. When a long-running Ghidra or reconstruct job is still queued/running, MCP clients should prefer one client-side sleep/wait using that recommendation instead of repeated immediate polling.

Environment bootstrap and setup guidance

If a client starts using the server before Python, dynamic-analysis extras, or Ghidra are configured, use:

system.health
dynamic.dependencies
ghidra.health
system.setup.guide

These return structured setup actions and required user inputs so an MCP client can explicitly ask for:

python -m pip install ...
JAVA_HOME
GHIDRA_PATH / GHIDRA_INSTALL_DIR
GHIDRA_PROJECT_ROOT / GHIDRA_LOG_ROOT
CAPA_RULES_PATH
DIE_PATH
optional dynamic-analysis extras such as Speakeasy/Frida dependencies

Frida Dynamic Instrumentation (Optional)

For runtime API tracing and behavioral analysis, install Frida:

pip install frida frida-tools

Environment Variables (optional - auto-detected when frida is in PATH):

FRIDA_SERVER_PATH - Path to Frida server binary for USB/remote device analysis
FRIDA_DEVICE - Device ID or "usb" for USB device selection (default: local spawn)

Pre-built Scripts are included in frida_scripts/:

api_trace.js - Windows API tracing with argument logging
string_decoder.js - Runtime string decryption
anti_debug_bypass.js - Anti-debug detection neutralization
crypto_finder.js - Cryptographic API detection
file_registry_monitor.js - File/registry operation tracking

See [docs/EXAMPLES.md](./docs/EXAMPLES.md#场景 -9-frida-运行时 instrumentation) for usage examples.

Current Development Status

Latest Release: v0.1.4

Stable Features (Production Ready):

PE triage and static analysis (static.capability.triage, pe.structure.analyze, compiler.packer.detect)
Ghidra-backed inspection with full execution visibility
DLL/COM profiling (dll.export.profile, com.role.profile)
Rust and .NET recovery paths
Source-like reconstruction with LLM-assisted review layers
Runtime evidence ingestion and correlation

In Development (Upcoming v0.2.0)

Frida Dynamic Instrumentation - Completed implementation, awaiting release:

frida.runtime.instrument - Spawn and attach mode instrumentation
frida.script.inject - Pre-built and custom script injection
frida.trace.capture - Canonical trace schema with filtering/aggregation
Full integration with dynamic.trace.import, report.generate, report.summarize
101 unit tests + integration test coverage
Comprehensive documentation in docs/EXAMPLES.md

Test Coverage: All 101 tests passing including Frida instrumentation suite.

For the new static triage foundation, the most common optional requirements are:

flare-capa
pefile
lief
a downloaded capa rules bundle referenced by CAPA_RULES_PATH
Detect It Easy CLI referenced by DIE_PATH

For Ghidra 12.0.4, the server expects Java 21+ and will report explicit Java compatibility hints through:

ghidra.health
system.health
system.setup.guide

When Ghidra commands fail, the server now persists command logs and, when available, Ghidra runtime logs. Normalized diagnostics include Java exception summaries and remediation hints instead of only returning exit code 1.

The bundled ghidra_scripts/ directory is resolved from the installed package or repository root, not from the current working directory. This prevents ExtractFunctions.py / ExtractFunctions.java lookup failures when the server is launched from a different folder.

Ghidra execution visibility

High-level outputs now expose a structured ghidra_execution block instead of hiding Ghidra details behind generic success/failure states.

You can now see:

which analysis record was selected
whether the result came from the best ready analysis or only the latest attempt
project path, project root, and log root
persisted command logs and runtime logs
function extraction status and script name
staged progress metadata
parsed Java exception summaries when Ghidra fails

This summary is surfaced through:

workflow.reconstruct
workflow.semantic_name_review when export refresh runs
workflow.function_explanation_review when export refresh runs
workflow.module_reconstruction_review when export refresh runs
report.summarize
report.generate

Project layout

bin/                         npm CLI entrypoint
dist/                        compiled TypeScript output
ghidra_scripts/              Ghidra helper scripts used by the server
helpers/DotNetMetadataProbe/ .NET metadata helper project
src/                         TypeScript MCP server source
tests/                       unit and integration tests
workers/                     Python worker, YARA rules, dynamic helpers
install-to-codex.ps1         local Codex MCP install helper
install-to-copilot.ps1       local GitHub Copilot MCP install helper
install-to-claude.ps1        local Claude Code MCP install helper
docs/QUALITY_EVALUATION.md   evaluation checklist for regression and release readiness

Prerequisites

Required:

Node.js 18+
npm 9+
Python 3.9+

Optional but strongly recommended:

Ghidra for native decompile and CFG features
.NET SDK for dotnet.metadata.extract
Clang for reconstruct export validation
Python packages from requirements.txt
Python worker packages from workers/requirements.txt

Local development

Install JavaScript dependencies:

npm install

Install Python worker dependencies:

python -m pip install -r requirements.txt
python -m pip install -r workers/requirements.txt
python -m pip install -r workers/requirements-dynamic.txt

Build:

npm run build

Run tests:

npm test

Start locally:

npm start

MCP client configuration

Generic stdio config

{
  "mcpServers": {
    "windows-exe-decompiler": {
      "command": "node",
      "args": ["/absolute/path/to/repo/dist/index.js"],
      "cwd": "/absolute/path/to/repo",
      "env": {
        "GHIDRA_PATH": "C:/path/to/ghidra",
        "GHIDRA_INSTALL_DIR": "C:/path/to/ghidra"
      }
    }
  }
}

Local install helpers

Codex: install-to-codex.ps1
Claude Code: install-to-claude.ps1
GitHub Copilot: install-to-copilot.ps1

Related docs:

Persistent storage

By default, runtime state is stored under the user profile instead of the current working directory:

Windows workspace root: %USERPROFILE%/.windows-exe-decompiler-mcp-server/workspaces
SQLite database: %USERPROFILE%/.windows-exe-decompiler-mcp-server/data/database.db
File cache: %USERPROFILE%/.windows-exe-decompiler-mcp-server/cache
Audit log: %USERPROFILE%/.windows-exe-decompiler-mcp-server/audit.log
Ghidra project root: %ProgramData%/.windows-exe-decompiler-mcp-server/ghidra-projects
Ghidra log root: %ProgramData%/.windows-exe-decompiler-mcp-server/ghidra-logs
Bundled Ghidra scripts: resolved from the installed package root

You can override these with environment variables or the user config file:

%USERPROFILE%/.windows-exe-decompiler-mcp-server/config.json
WORKSPACE_ROOT
DB_PATH
CACHE_ROOT
AUDIT_LOG_PATH
GHIDRA_PROJECT_ROOT
GHIDRA_LOG_ROOT

Sample ingest note

For local IDE clients such as VS Code or Copilot, prefer local file paths:

{
  "tool": "sample.ingest",
  "arguments": {
    "path": "E:/absolute/path/to/sample.exe"
  }
}

Use bytes_b64 only when the client cannot access the same filesystem as the server.

Publishing to npm

The published package includes:

compiled dist/
the CLI entrypoint in bin/
Python workers and YARA rules
Ghidra helper scripts
the .NET metadata helper source
MCP client install scripts

It excludes:

tests
local workspaces
caches
generated reports
scratch documents and internal progress notes

Pre-publish checklist:

Update the version in package.json.
Run npm run release:check.
Inspect npm run pack:dry-run.
Log in with npm login.
Publish with npm publish --access public.

GitHub automation included in this repository:

For GitHub Actions publishing, configure the NPM_TOKEN repository secret.

Security boundaries

This project is for analysis workflows, not live malware operations.

Current strengths:

PE triage and classification support
reverse-engineering evidence extraction
IOC and ATT&CK export
runtime evidence import and correlation
source-like reconstruction and review

Current non-goals:

original source recovery for complex native binaries
guaranteed malware family attribution from static evidence alone
fully automatic unpacking for every packer
high-confidence semantic recovery of every function in heavily optimized code

Contributing and release process

Contributor guide: CONTRIBUTING.md
Quality evaluation notes: docs/QUALITY_EVALUATION.md
Example benchmark corpus: examples/benchmark-corpus.example.json
Security policy: SECURITY.md

Using the published package

{
  "mcpServers": {
    "windows-exe-decompiler": {
      "command": "npx",
      "args": ["-y", "windows-exe-decompiler-mcp-server"],
      "env": {
        "GHIDRA_PATH": "C:/path/to/ghidra",
        "GHIDRA_INSTALL_DIR": "C:/path/to/ghidra"
      }
    }
  }
}

License

Released under the MIT license. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github		.github
bin		bin
docs		docs
examples		examples
frida_scripts		frida_scripts
ghidra_scripts		ghidra_scripts
helpers/DotNetMetadataProbe		helpers/DotNetMetadataProbe
scripts		scripts
src		src
test-data-sample-ingest		test-data-sample-ingest
tests		tests
tmp-ghidra-cmd-wqLDZV/ES&E (space)/中文目录		tmp-ghidra-cmd-wqLDZV/ES&E (space)/中文目录
workers		workers
.eslintignore		.eslintignore
.eslintrc.json		.eslintrc.json
.gitattributes		.gitattributes
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
CHANGELOG.md		CHANGELOG.md
CLAUDE_INSTALLATION.md		CLAUDE_INSTALLATION.md
CODEX_INSTALLATION.md		CODEX_INSTALLATION.md
CONTRIBUTING.md		CONTRIBUTING.md
COPILOT_INSTALLATION.md		COPILOT_INSTALLATION.md
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
SECURITY.md		SECURITY.md
install-to-claude.ps1		install-to-claude.ps1
install-to-codex.ps1		install-to-codex.ps1
install-to-copilot.ps1		install-to-copilot.ps1
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
run-tests-node.js		run-tests-node.js
run-tests.bat		run-tests.bat
run-tests.ps1		run-tests.ps1
test-frida-runtime.db-journal		test-frida-runtime.db-journal
test-mcp-handshake.js		test-mcp-handshake.js
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

Windows EXE Decompiler MCP Server

Feature highlights

New in the static triage foundation

Typical analysis flows

Quick triage

Hard native recovery

LLM-assisted refinement

What this server is for

Core capability areas

Sample and static analysis

Ghidra and function analysis

Recovery for Rust and hard native samples

.NET and managed inspection

Runtime evidence and reporting

Semantic review and reconstruction

High-level workflows

workflow.triage

workflow.deep_static

workflow.reconstruct

workflow.function_index_recover

workflow.semantic_name_review

workflow.function_explanation_review

workflow.module_reconstruction_review

Universal recovery model

Evidence scope, semantic scope, and replayability

LLM review layers

Async job model

Environment bootstrap and setup guidance

Frida Dynamic Instrumentation (Optional)

Current Development Status

Latest Release: v0.1.4

In Development (Upcoming v0.2.0)

Ghidra execution visibility

Project layout

Prerequisites

Local development

MCP client configuration

Generic stdio config

Local install helpers

Persistent storage

Sample ingest note

Publishing to npm

Security boundaries

Contributing and release process

Using the published package

License

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`workflow.triage`

`workflow.deep_static`

`workflow.reconstruct`

`workflow.function_index_recover`

`workflow.semantic_name_review`

`workflow.function_explanation_review`

`workflow.module_reconstruction_review`

Packages