Agentic AI security curriculum · Security overview
Threat Modelling for Agentic AI: STRIDE, Attack Trees, and Living Threat Models
STRIDE, Attack Trees, and Living Threat Models
Hello and welcome to Module 22!
Modules 1–21 have given us a complete technical security stack: trusted images, admission control, zero-trust networking, runtime enforcement, sandboxing, immutable memory, GPU isolation, automated response, and secure development pipelines. Now we step back and ask the most important question of all: “Have we actually defended against the right threats?”
Standard application threat models assume simple request-response architectures with a human in the loop. Agentic AI systems are fundamentally different — they act autonomously across many steps, maintain persistent memory, delegate to sub-agents, and execute real-world actions. In this module we build a living threat model tailored to agentic platforms using an extended STRIDE framework, attack trees, and a defined update cadence so the model stays accurate as the platform evolves rather than becoming obsolete the week after it is written.
Why Agentic AI Requires a Distinct Threat Model
Traditional threat models assume:
-
A human initiates every request.
-
The system is stateless between requests.
-
The main risk is in the request itself.
Agentic platforms break all three assumptions:
-
Agents act autonomously across many steps without human intervention.
-
They maintain long-term memory and state.
-
They form dynamic pipelines with delegation and inter-agent communication.
This creates new threat categories that do not appear in standard STRIDE templates:
-
Goal hijacking
-
Memory poisoning
-
Inter-agent lateral movement
-
Model inversion
-
Indirect prompt injection
We must extend STRIDE and make the threat model a living document that evolves with the platform.
STRIDE Applied to Each Component
We systematically apply STRIDE (Spoofing, Tampering, Repudiation, Information Disclosure, Denial of Service, Elevation of Privilege) to every major component, highlighting how the categories change in an agentic context.
Model / Inference Layer
-
Spoofing: Model inversion via repeated queries.
-
Tampering: Weight tampering or adversarial examples.
-
Information Disclosure: Model inversion attacks that extract training data.
Memory Store
-
Tampering: Memory poisoning via direct write, RAG retrieval, or post-write modification.
-
Information Disclosure: Unauthorized recall of confidential entries.
-
Repudiation: Merkle chain manipulation to hide changes.
Tool Call Gateway
-
Elevation of Privilege: ATR bypass (forged JWT, delegation expansion, scope drift).
-
Tampering: Injection via tool results.
-
Spoofing / Elevation: Replay attacks.
ClawHub Skills
- Spoofing / Tampering / Elevation: Malicious skill in trusted position, dependency confusion.
Inter-Agent Communication
-
Spoofing: Forged orchestrator instruction.
-
Tampering / Repudiation: Fabricated subagent result.
Operator Plane
-
Spoofing / Elevation: Admin credential theft or insider threat.
-
Tampering / Repudiation: Break-glass abuse.
Each threat is mapped to current controls and residual risk.
Attack Trees
The attack tree is the most practical artifact we maintain.
Root node: “Agent executes unauthorized action”
Major branches:
-
Branch 1: Prompt injection (direct, indirect, multi-step, split-payload)
-
Branch 2: ATR bypass (forged JWT, delegation expansion, scope drift)
-
Branch 3: Memory poisoning (direct write, retrieval poisoning, post-write tamper)
-
Branch 4: Supply-chain compromise (base image, skill, model weight, dependency)
-
Branch 5: Operator compromise (credential theft, insider threat, break-glass abuse)
Each leaf node maps to one or more controls. Any leaf without a corresponding control is a documented gap with a remediation timeline.
The Living Threat Model
A threat model written once at launch is a historical document within weeks. Ours is living.
Update triggers (any of these automatically starts a review):
-
New agent pipeline deployed
-
New ClawHub skill category approved
-
New external API integration
-
Post-incident review finding
-
New OWASP Agentic Top 10 guidance published
Update process:
-
Review the attack tree against the change.
-
Identify new branches or modified leaves.
-
Assign or update controls.
-
Document residual risk.
-
Store the updated model in git with a signed commit.
Threat model review is a standing agenda item in every quarterly security review (Module 25). Red-team exercises (Module 24) must incorporate findings into the model within 14 days.
Threat Model Documentation Format
The model lives in git alongside infrastructure code as version-controlled YAML.
Each threat entry contains:
-
id: Unique identifier (e.g., TM-AGENT-004)
-
stride_category: One or more of S/T/R/I/D/E
-
affected_component: e.g., “memory_store”, “pipeline_orchestrator”
-
attack_path_description: Clear narrative
-
current_controls: List of modules/controls that mitigate it
-
residual_risk: Assessment and acceptance (must be reviewed quarterly)
-
last_reviewed: Date + reviewer
Control gaps are explicitly documented with remediation plans and owners.
Key Takeaways (Memorize These!)
-
Agentic AI introduces threat categories (goal hijacking, memory poisoning, inter-agent lateral movement) that don’t exist in standard STRIDE templates — the template must be extended.
-
The attack tree is the most useful artifact: it makes the mapping from threat to control explicit and makes gaps visible.
-
A living threat model with defined update triggers is a fundamentally different artifact from a point-in-time assessment.
-
Residual risk must be explicitly accepted and documented — undocumented residual risk is the same as unmitigated risk from a governance perspective.
You now have a living, agentic-specific threat model that stays current as the platform evolves. It is no longer a dusty document on a shelf — it is an active security control that drives architecture decisions, red-team exercises, and quarterly reviews. This is the strategic layer that ensures all the tactical controls in Modules 1–21 are aimed at the right threats.
