Skip to content

kherrera6219/DataLogicEngine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

961 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

DataLogicEngine

Enterprise AI orchestration, governed LLM routing, and knowledge graph reasoning in one deployable platform.

CI Security Deploy Python Node License

DataLogicEngine (DLE) is a local-first Enterprise AI Platform built around the Universal Knowledge Graph (UKG).

Unlike traditional AI applications that operate as black boxes, DLE provides explainable reasoning, multi-agent orchestration, GraphRAG retrieval, governance controls, and complete audit traceability.

Designed for enterprise, government, compliance, cybersecurity, acquisition, and research environments, every AI decision can be traced through evidence sources, personas, reasoning stages, validation checkpoints, and immutable audit records compliant with EU AI Act Article 53.

Current Status: Feature Complete (Local-First Edition)

Major Systems Completed:

  • Universal Knowledge Graph (UKG)
  • 17-Axis Knowledge Framework
  • 10-Layer Truth Engine
  • 12-Step Refinement Workflow
  • Knowledge Algorithm Framework (120+ KAs)
  • Multi-Agent Orchestration
  • GraphRAG Integration
  • Knowledge Ingestion Pipeline
  • Trace Viewer
  • MCP Integration Framework
  • Local Database Lifecycle Management
  • Enterprise Audit & Governance Framework

Current Focus:

  • Accessibility validation (NVDA)
  • Production code signing
  • Release evidence package
  • Public architecture assets
  • Expanded integration benchmarks
  • Public release readiness

What Makes DataLogicEngine Different?

Most AI applications answer questions.

DataLogicEngine explains how answers were produced.

Core Differentiators

Universal Knowledge Graph (UKG)

A structured knowledge system designed to organize information, expertise, regulations, compliance requirements, risk factors, locations, time contexts, and reasoning workflows into a unified framework accessible to human operators and AI agents.

17-Axis Knowledge Framework

A multidimensional coordinate system that maps every request across knowledge domains, industries, regulatory frameworks, compliance requirements, expertise models, geography, temporal context, risk profiles, and governance policies.

10-Layer Truth Engine

A progressive reasoning architecture that combines retrieval, validation, simulation, planning, trust analysis, safety controls, and audit generation into a governed AI workflow.

12-Step Refinement Workflow

A structured reasoning improvement pipeline that continuously validates, refines, audits, and strengthens outputs before release.

Knowledge Algorithm Framework

More than 120 specialized Knowledge Algorithms (KAs) provide modular capabilities for planning, validation, compliance analysis, contradiction detection, risk assessment, reasoning control, governance policy enforcement, and audit trace generation.

Explainable AI

Every response can be traced through:

  • Evidence sources
  • Personas
  • Knowledge Algorithms
  • Validation checkpoints
  • Refinement stages
  • Governance policies
  • Audit records

Local-First Architecture

Supports disconnected, air-gapped, enterprise, government, and workstation deployments without requiring external cloud infrastructure.

Model Context Protocol (MCP)

Native MCP support enables integration with tools, resources, external agent systems, subscriptions, and dynamic plugin architectures.

Roadmap

Release Readiness:

  • Complete NVDA accessibility validation
  • Production code-signing pipeline
  • Final release evidence package
  • Public architecture diagrams and assets
  • Expanded integration and performance benchmarking

Enterprise Enhancements:

  • Advanced policy-as-code governance
  • Expanded multi-tenant controls
  • Enhanced cost and usage analytics
  • Human feedback and review workflows
  • Advanced persona orchestration strategies

Platform Evolution:

  • Local SLM optimization for lower reasoning tiers
  • Expanded GraphRAG retrieval capabilities
  • Enterprise deployment automation
  • Additional knowledge ingestion connectors
  • Advanced knowledge graph learning and adaptation

Recently Completed:

  • βœ… Local-first database lifecycle management
  • βœ… GraphRAG integration
  • βœ… Trace Viewer implementation
  • βœ… PDF and DOCX ingestion support
  • βœ… Async ingestion workflows
  • βœ… SQL to Neo4j synchronization
  • βœ… Advanced MCP integration framework
  • βœ… Real-time trace streaming
  • βœ… Automated documentation validation
  • βœ… Portable workstation deployment stack
  • βœ… Enterprise audit traceability framework
  • βœ… Knowledge Algorithm expansion and validation
  • βœ… Truth Engine release readiness improvements

See TODO.md for the canonical backlog and release-readiness work items.

Recommended architecture asset path: docs/assets/readme/architecture-overview.png. Add a dark-mode-safe PNG/SVG export when publishing visual docs.

Quick Links

Quickstart

Run the full local stack with Docker:

git clone https://github.com/kherrera6219/DataLogicEngine.git
cd DataLogicEngine
cp .env.template .env
docker compose up --build

Open:

Service URL
Web console http://localhost:3000
Backend API http://localhost:5000
Health probe http://localhost:5000/health
Metrics http://localhost:5000/metrics
Swagger UI http://localhost:5000/api/docs

Minimal API call:

curl http://localhost:5000/health

Contents

Why DataLogicEngine

DataLogicEngine is designed for teams that need AI workflows to be explainable, inspectable, and operable in regulated environments.

Capability What it provides
LLM gateway Multi-provider routing for OpenAI, Anthropic, Azure OpenAI, Google, and Gemini-style providers with retries, circuit-breaker behavior, cost tracking, and audit metadata.
Knowledge graph Structured graph model with sectors, domains, pillars, knowledge nodes, edges, and 17-axis reasoning support.
Traceable reasoning Runs, traces, stage timing, persona context, and evidence references for audit reconstruction.
Governance RBAC, MFA support, CSRF controls, CORS policy enforcement, prompt-injection checks, request limits, and immutable audit patterns.
Local-first distribution Browser deployment plus Electron/NSIS Windows packaging for workstation and constrained-network scenarios.
Production operations Docker Compose, cloud Dockerfile, health/readiness probes, metrics endpoint, Sentry integration, and CI/security workflows.

Architecture

flowchart LR
  Client["Web console / API client"] --> Frontend["Next.js frontend"]
  Frontend --> API["Flask API"]
  API --> Auth["Auth, RBAC, CSRF, rate limits"]
  API --> Gateway["LLM Gateway"]
  API --> Graph["Knowledge Graph APIs"]
  API --> Truth["Truth Engine and tracing"]
  Gateway --> Providers["OpenAI / Anthropic / Azure / Google"]
  Graph --> Postgres["PostgreSQL"]
  Graph --> Neo4j["Neo4j"]
  API --> Redis["Redis cache and rate limit storage"]
  API --> ObjectStore["S3-compatible object storage"]
  API --> Metrics["/health /ready /metrics"]
Loading

Runtime Components

Layer Components Notes
Frontend Next.js 16, React 18, Electron 40 Web console, desktop shell, graph visualization, admin surfaces.
Backend Flask 3.1, SQLAlchemy, Socket.IO API routing, auth, gateway orchestration, audit, tracing.
Data PostgreSQL 15+, Neo4j 5+, Redis 7+, MinIO Relational state, graph state, cache/rate limits, object storage.
AI OpenAI, Anthropic, Azure OpenAI, Google/Gemini clients Provider keys are resolved at runtime from environment or configured provider records.
Quality Pytest, Ruff, Vitest, Playwright, GitHub Actions CI includes backend, frontend, governance, security, deploy, and Windows packaging checks.

Installation

Prerequisites

Tool Version Purpose
Python 3.11+ Backend runtime and tests
Node.js 24+ Frontend and Electron tooling
Docker Current stable Local full-stack development
PostgreSQL 15+ Production relational store
Redis 7+ Cache, rate limiting, async support
Neo4j 5+ Knowledge graph storage

Backend Development

Windows:

python -m venv .venv
.venv\Scripts\activate
python -m pip install --upgrade pip
pip install -r requirements.txt
copy .env.template .env
python app.py

macOS/Linux:

python -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
pip install -r requirements.txt
cp .env.template .env
python app.py

Frontend Development

cd frontend
npm ci
npm run dev

Local Mode (no Docker, no cloud databases)

For workstation development without Docker, the setup script downloads and installs portable PostgreSQL, Redis, and Neo4j binaries locally and the app manages their lifecycle automatically:

# Install portable database binaries (one-time)
python scripts/setup_local_databases.py --all

# Seed Neo4j with UKG pillar taxonomy
python scripts/seed_neo4j.py

# Run database migrations
flask db upgrade

# Start the backend (databases auto-start on app launch)
python app.py

Verify all services are reachable:

python scripts/setup_local_databases.py --verify

Desktop Build

npm --prefix frontend run electron:dist

Installer artifacts are copied to the repository root as a single canonical setup executable:

  • DataLogicEngine Setup Latest.exe
  • matching .sha256 and .blockmap files

Configuration

Copy .env.template to .env and set values for your deployment target.

Required Production Variables

Variable Required Description
FLASK_ENV Yes Use production for deployed environments.
SECRET_KEY Yes Flask session secret. Generate a unique 64+ character value.
JWT_SECRET_KEY Yes JWT signing secret. Generate a unique 64+ character value.
SESSION_SECRET Yes Session signing secret used by runtime checks.
DATABASE_URL Yes SQLAlchemy database URL. PostgreSQL is recommended for production.
CORS_ORIGINS Yes Comma-separated allowed browser origins. Do not use * in production.
ADMIN_USERNAME Initial setup Initial administrative username.
ADMIN_PASSWORD Initial setup Strong initial password. Rotate after first login.
ADMIN_EMAIL Initial setup Initial administrator email.

Provider and Integration Variables

Variable Description
OPENAI_API_KEY OpenAI provider key.
ANTHROPIC_API_KEY Anthropic provider key.
AZURE_OPENAI_ENDPOINT Azure OpenAI endpoint URL.
AZURE_OPENAI_API_KEY Azure OpenAI provider key.
GOOGLE_API_KEY / GEMINI_API_KEY Google/Gemini provider key.
SENTRY_DSN Enables crash reporting when configured.
SENTRY_TRACES_SAMPLE_RATE Distributed trace sampling rate. Default: 0.1.
SENTRY_PROFILES_SAMPLE_RATE Profiling sample rate. Default: 0.1.

Data Services

Variable Default / Example Description
REDIS_URL redis://localhost:6379/0 Cache and runtime coordination.
RATELIMIT_STORAGE_URI redis://localhost:6379 Flask-Limiter storage backend.
NEO4J_URI bolt://localhost:7687 Neo4j Bolt endpoint. Standard Neo4j Bolt port.
NEO4J_USER neo4j Neo4j username.
NEO4J_PASSWORD unset Neo4j password.
OBJECT_ENDPOINT_URL http://localhost:9000 S3-compatible object storage endpoint.
OBJECT_ACCESS_KEY unset Object storage access key.
OBJECT_SECRET_KEY unset Object storage secret key.
OBJECT_BUCKET datalogic Object storage bucket.

API Examples

Base URLs:

Environment Base URL
Local backend http://localhost:5000
Versioned API http://localhost:5000/api/v1
Production https://your-domain.example/api/v1

Health and Readiness

curl http://localhost:5000/health
curl http://localhost:5000/live
curl http://localhost:5000/ready

Authentication

curl -X POST http://localhost:5000/api/v1/auth/login \
  -H "Content-Type: application/json" \
  -d '{
    "username": "operator@example.com",
    "password": "replace-with-a-secret"
  }'

API key authentication is also supported for programmatic access. Generate a key via the admin interface and include it as X-API-Key:

export UKG_KEY="ukg_<prefix>_<secret>"
curl -H "X-API-Key: $UKG_KEY" http://localhost:5000/api/v1/gateway/chat ...

LLM Gateway Request

curl -X POST http://localhost:5000/api/v1/gateway/chat \
  -H "X-API-Key: $UKG_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {
        "role": "user",
        "content": "Summarize the compliance impact of this control change."
      }
    ],
    "model": "gpt-5.5",
    "tier": "2"
  }'

Tier 2+ responses include a verifiable audit footer:

[UKG Audit Trace]
Tier: 2
Active Axes: ...
Personas Invoked: ...
Confidence: 0.395
Refinement Steps Executed: ...
Compliance Flags: ...
Key Assumption to Verify: ...
What Changes if Wrong: ...

Every Tier 2+ run also writes a TruthAuditEvent row with a SHA-256 hash-chain receipt for EU AI Act Article 53 compliance.

Knowledge Graph Query

curl -H "X-API-Key: $UKG_KEY" \
  http://localhost:5000/api/v1/knowledge-nodes

Knowledge Algorithm Execution

curl -X POST http://localhost:5000/api/v1/ka/algorithms/KA-001/execute \
  -H "X-API-Key: $UKG_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "input": {
      "claim": "New customer data must remain in-region.",
      "jurisdiction": "US"
    }
  }'

Standard Response Shape

{
  "success": true,
  "data": {},
  "error": null,
  "timestamp": "2026-01-11T19:35:00Z"
}

Deployment

Docker Compose

Use Docker Compose for local integration testing or single-host evaluation:

cp .env.template .env
docker compose up --build -d
docker compose ps

Cloud Container

Dockerfile.cloud builds the frontend and backend into a single runtime image:

docker build -f Dockerfile.cloud -t datalogicengine:latest .
docker run --env-file .env -p 5000:5000 -p 3000:3000 datalogicengine:latest

Production Checklist

  • Set FLASK_ENV=production.
  • Use PostgreSQL, Redis, Neo4j, and S3-compatible object storage outside the app container.
  • Set unique secrets for SECRET_KEY, JWT_SECRET_KEY, and SESSION_SECRET.
  • Configure exact CORS_ORIGINS.
  • Run database migrations instead of enabling AUTO_CREATE_SCHEMA.
  • Terminate TLS at a trusted reverse proxy or platform load balancer.
  • Enable Sentry or equivalent crash reporting.
  • Confirm /health, /ready, and /metrics are monitored.
  • Review docs/DEPLOYMENT.md, docs/OPERATIONAL_RUNBOOKS.md, and deploy/DEPLOYMENT_CHECKLIST.md.

Security and Compliance

DataLogicEngine includes security controls intended for enterprise deployments, but each deployment must still be threat-modeled and configured for its environment.

Area Built-in Support
Authentication Session auth, JWT flows, MFA routes, SSO/OIDC integration hooks, desktop challenge flow.
Authorization RBAC utilities, admin route controls, tenant-aware patterns.
Request security CSRF, request size limits, CORS enforcement, rate limiting, SSRF allowlisting utilities.
Data protection Secret resolution controls, encryption manager, PII redaction utilities, audit logging.
AI governance Prompt-injection checks, provider usage tracking, trace IDs, policy/gateway hooks.
Supply chain GitHub Actions security workflow, Bandit, npm audit, pip-audit, SBOM-oriented workflow steps.
Release governance Windows installer governance, signing workflows, integrity reporting, release checklist.

Security references:

πŸ”’ Report Security Issues Privately:

Do not report vulnerabilities in public issues. Follow the private reporting process in SECURITY.md.

Observability

Signal Location
Liveness GET /live
Readiness GET /ready
Health summary GET /health
Runtime metrics GET /metrics
API docs GET /api/docs
Gateway provider usage LLM gateway usage models and admin routes
Crash reporting SENTRY_DSN, SENTRY_TRACES_SAMPLE_RATE, SENTRY_PROFILES_SAMPLE_RATE
Run tracing /api/v1/trace/* and run-oriented UI routes

Recommended production integrations:

  • Prometheus-compatible scraping for /metrics.
  • Sentry or an equivalent error and performance backend.
  • Centralized JSON logs via python-json-logger and platform log shipping.
  • Alerting on readiness failures, provider error spikes, token cost anomalies, and authentication failures.

Testing

# Backend
python -m pytest tests/
python -m pytest tests/ --cov=backend --cov=models --cov-report=html --cov-report=term-missing --cov-report=json --cov-fail-under=70
python -m ruff check .
python -m pip_audit -r requirements.txt --desc

# Frontend
npm --prefix frontend ci
npm --prefix frontend run lint
npm --prefix frontend run typecheck
npm --prefix frontend run test
npm --prefix frontend audit --audit-level=high

Current CI Runs

  • βœ… Backend tests and dependency audit
  • βœ… Frontend lint, typecheck, tests, and build
  • βœ… Security scan workflow
  • βœ… Deploy build and test workflow
  • βœ… Windows packaging smoke test
  • βœ… Governance and release checklist workflows

Roadmap

Horizon Focus
Near term Complete app-readiness evidence: authenticated accessibility coverage, keyboard/NVDA checks, failure-mode tests, and export/delete end-to-end validation.
Near term Tighten public API contracts, reduce legacy route aliases, and improve generated OpenAPI coverage.
Near term Add public architecture assets under docs/assets/readme/.
Mid term Expand deployment reference material for Kubernetes, managed Postgres, managed Redis, and managed Neo4j.
Mid term Publish signed release artifacts with checksums and provenance metadata.
Long term Harden multi-tenant operations, cost controls, recursive persona evaluation, human feedback loops, and policy-as-code governance for larger deployments.

Getting Help

Documentation

Community & Support

  • Questions: Open a GitHub Discussion
  • Bug Reports: Create an issue with steps to reproduce
  • Security Issues: See SECURITY.md for responsible disclosure
  • API Documentation: Swagger UI at http://localhost:5000/api/docs (when running locally)

Contributing

Contributions are welcome when they align with the project license and governance model.

  1. Read CONTRIBUTING.md.
  2. Read CODE_OF_CONDUCT.md.
  3. Create an issue for non-trivial changes before implementation.
  4. Run backend and frontend checks locally.
  5. Submit a pull request using the repository template.

Development references:

License

DataLogicEngine is licensed under the PolyForm Noncommercial License 1.0.0.

Personal, research, and educational use are permitted under the license terms. Commercial use, production deployment in a business environment, or integration into a paid product requires a separate commercial license. See COMMERCIAL_LICENSE.md for details.

Repository Metadata

Existing Supporting Files

File Status
LICENSE Present
COMMERCIAL_LICENSE.md Present
SECURITY.md Present
CONTRIBUTING.md Present
CODE_OF_CONDUCT.md Present
SUPPORT.md Present
CHANGELOG.md Present
.github/CODEOWNERS Present
.github/pull_request_template.md Present
.github/ISSUE_TEMPLATE/* Present
.env.template Present
Dockerfile.cloud and docker-compose.yml Present

Recommended Additions

Recommendation Purpose
docs/assets/readme/architecture-overview.png Public README architecture image for GitHub social previews and non-Mermaid consumers.
.github/FUNDING.yml Optional sponsorship metadata if the project accepts funding.
CITATION.cff Citation metadata for research and academic users.
GitHub repository topics Suggested: ai, llm, knowledge-graph, flask, nextjs, governance, compliance, enterprise-ai.

About

Enterprise AI platform built on the Universal Knowledge Graph (UKG), featuring a 17-Axis Knowledge Framework, 10-Layer Truth Engine, multi-agent orchestration, GraphRAG, MCP integration, explainable reasoning, governance, compliance, and end-to-end audit traceability.

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors