New command: caf.tech-ai-security — Agentic AI threat modelling and security

## Context

This issue follows from #1 (How to adapt architecture practice & security with AI and Agentic AI).

The current `ai-commands` toolkit covers AI/ML adoption through `caf.tech-principles` (Symbiotic coupling principle), `caf.ops-digital` (AI/ML investment plan), and `caf.ops-readiness` (security checklist). However, none of these commands specifically address the **security challenges introduced by agentic AI systems** — systems where an AI model takes autonomous actions, calls tools, and operates across trust boundaries.

## Problem

Agentic AI introduces a new class of architectural security concerns that existing CAF practices do not yet cover:

- **Prompt injection** — malicious inputs that hijack agent behaviour
- **Tool misuse** — agents calling APIs or executing code in unintended ways
- **Data exfiltration** — agents leaking sensitive data through tool calls or outputs
- **Loss of human oversight** — autonomous action chains that bypass review gates
- **Supply chain risks** — MCP servers, plugins, and third-party tools introduced into the agent loop
- **Non-determinism** — probabilistic outputs that break traditional security assumptions

These risks need to be assessed and documented as part of architecture practice, not treated as an afterthought.

## Proposed command: `caf.tech-ai-security`

A new command that produces an **Agentic AI Security Assessment** — a structured threat model and architectural controls checklist for systems that include AI agents, LLMs in production, or AI-assisted automation.

**Expected output:** `technology/ai-security-assessment.md`

**Scope:**
- Threat model for the agentic system in scope (inputs, tools, outputs, trust boundaries)
- Assessment against OWASP LLM Top 10
- Alignment with NIST AI Risk Management Framework (AI RMF)
- Human oversight mechanisms — where are the review gates, and are they sufficient?
- Fitness function candidates for AI-specific NFRs (response consistency, hallucination rate, tool call audit trail)
- Architectural controls per threat (input validation, output filtering, rate limiting, audit logging, kill switch)
- RAG status for Architecture Review Board

**References to incorporate:**
- [OWASP Top 10 for LLM Applications](https://owasp.org/www-project-top-10-for-large-language-model-applications/)
- [NIST AI RMF](https://www.nist.gov/system/files/documents/2023/01/26/AI%20RMF%201.0.pdf)
- [OWASP Agentic AI Threats](https://genai.owasp.org/llmrisk/)
- CAF Technology principle 4: Symbiotic coupling of machines and humans

## How to contribute

See [CONTRIBUTING.md](../blob/main/CONTRIBUTING.md) for the command template and file naming convention.

The command file should be placed at:
```
.claude/commands/caf.tech-ai-security.md
```

Please open a Pull Request referencing this issue.

## Acceptance criteria

- [ ] Threat model covers at least the 6 agentic AI risk categories listed above
- [ ] OWASP LLM Top 10 is addressed (assess / mitigated / not applicable per item)
- [ ] At least 3 fitness function candidates for AI-specific NFRs
- [ ] Human oversight mechanisms are explicitly assessed
- [ ] Output includes a RAG status suitable for an Architecture Review Board
- [ ] Command follows the standard CAF command template (Context, Objective, Output, Quality gates)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New command: caf.tech-ai-security — Agentic AI threat modelling and security #2

Context

Problem

Proposed command: `caf.tech-ai-security`

How to contribute

Acceptance criteria

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

New command: caf.tech-ai-security — Agentic AI threat modelling and security #2

Description

Context

Problem

Proposed command: caf.tech-ai-security

How to contribute

Acceptance criteria

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

Proposed command: `caf.tech-ai-security`