inmydata PenTest

Automated penetration testing tool for web applications deployed by inmydata's agentic coding pipeline. Combines OWASP ZAP (DAST scanning) and Nuclei (template-driven vulnerability detection) into a single orchestrated pipeline with unified HTML reporting.

Architecture

┌─────────────────────────────────────────────────────────┐
│                    CI/CD Pipeline                        │
│  (GitHub Actions / AWS CodePipeline)                    │
└──────────────────────┬──────────────────────────────────┘
                       │
                       ▼
┌─────────────────────────────────────────────────────────┐
│              pentest.py (Orchestrator)                   │
│                                                         │
│  1. Load config.yaml                                    │
│  2. Authenticate via AWS Cognito                        │
│  3. Run ZAP spider + active scan (authenticated)        │
│  4. Run Nuclei with custom + community templates        │
│  5. Merge findings, deduplicate, rank by severity       │
│  6. Generate HTML report                                │
└───────┬────────────────────────┬────────────────────────┘
        │                        │
        ▼                        ▼
┌───────────────┐    ┌───────────────────────┐
│   OWASP ZAP   │    │       Nuclei          │
│  (Docker)     │    │  (Binary / Docker)    │
│               │    │                       │
│  - Spider     │    │  - Community temps    │
│  - Ajax Spider│    │  - Custom temps       │
│  - Passive    │    │    (inmydata-specific) │
│  - Active     │    │  - CVE detection      │
│  - Auth scan  │    │  - Misconfig checks   │
└───────────────┘    └───────────────────────┘
        │                        │
        └──────────┬─────────────┘
                   ▼
        ┌────────────────────┐
        │   HTML Report      │
        │                    │
        │  - Executive summ  │
        │  - Findings table  │
        │  - Risk breakdown  │
        │  - Remediation     │
        └────────────────────┘

Quick Start

Prerequisites

Docker and Docker Compose
Python 3.11+
AWS credentials configured (for Cognito authentication)

Installation

# Clone the repo
git clone <your-repo-url>
cd inmydata-pentest

# Install Python dependencies
pip install -r requirements.txt

# Pull ZAP Docker image
docker pull ghcr.io/zaproxy/zaproxy:stable

# Install Nuclei
go install -v github.com/projectdiscovery/nuclei/v3/cmd/nuclei@latest
# Or use Docker: docker pull projectdiscovery/nuclei:latest

Configuration

Copy and edit the config file:

cp config.example.yaml config.yaml

Edit config.yaml with your target URLs, Cognito settings, and scan preferences.

Running a Scan

# Full scan against a target
python pentest.py --config config.yaml --target https://app.example.com

# Quick scan (passive only, no active exploitation)
python pentest.py --config config.yaml --target https://app.example.com --mode quick

# Scan with custom Nuclei templates only
python pentest.py --config config.yaml --target https://app.example.com --nuclei-only

CI/CD Integration (GitHub Actions)

Add the workflow from .github/workflows/pentest.yml to your repo. Configure the following secrets:

PENTEST_TARGET_URL - the URL to scan
AWS_ACCESS_KEY_ID - for Cognito auth
AWS_SECRET_ACCESS_KEY - for Cognito auth
COGNITO_USER_POOL_ID - your Cognito user pool
COGNITO_CLIENT_ID - your Cognito app client ID
COGNITO_USERNAME - test user for authenticated scanning
COGNITO_PASSWORD - test user password

Custom Nuclei Templates

The templates/nuclei/ directory contains inmydata-specific templates targeting common patterns in agentic-built applications.

api-auth-bypass.yaml - Tests for authentication bypass on API routes
cors-misconfiguration.yaml - Detects overly permissive CORS policies
exposed-debug-endpoints.yaml - Finds debug/admin endpoints left exposed
nextjs-misconfig.yaml - Next.js specific misconfigurations
exposed-env-vars.yaml - Environment variable and secret exposure

Report Output

Reports are saved to the reports/ directory as timestamped HTML files. Each report includes an executive summary with risk scoring, a detailed findings table sorted by severity, remediation guidance for each finding, and scan metadata (duration, tools used, coverage).

Severity Levels

Level	Description
Critical	Actively exploitable, immediate data breach risk
High	Exploitable with moderate effort, significant impact
Medium	Potential vulnerability, requires specific conditions
Low	Informational, best-practice violations

License

Internal use only. Copyright DataPA Limited.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
src		src
templates/nuclei		templates/nuclei
.gitignore		.gitignore
Dockerfile		Dockerfile
PLAN.md		PLAN.md
README.md		README.md
config.example.yaml		config.example.yaml
docker-compose.yml		docker-compose.yml
pentest.py		pentest.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

inmydata PenTest

Architecture

Quick Start

Prerequisites

Installation

Configuration

Running a Scan

CI/CD Integration (GitHub Actions)

Custom Nuclei Templates

Report Output

Severity Levels

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

inmydata PenTest

Architecture

Quick Start

Prerequisites

Installation

Configuration

Running a Scan

CI/CD Integration (GitHub Actions)

Custom Nuclei Templates

Report Output

Severity Levels

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages