AnalysisSecurity & robustness

The agentic threat surface, translated for AI Officers

By GovCompass.ai· Last verified June 2026· Agentic security frameworks (OWASP ASI, MAESTRO, NIST/CAISI) are evolving rapidly.

The OWASP Agentic Security Initiative Top 10 catalogs the security risks that autonomous AI introduces. It is written for security engineers, but the risks are governance problems, because they describe what an agent can be made to do rather than what it can be made to say. This article translates the agentic threat surface into the language of controls an AI Officer owns, and maps each risk to the GovCompass pillar it stresses.

This is part of the Agentic AI element of the GovCompass-7.

Why the threat surface is different

Classic LLM security is about the text a model produces. Agentic security is about the actions an agent takes. The OWASP Agentic Security Initiative makes the point directly: securing agentic AI is a move from securing outputs to governing autonomous actions. An agentic risk often combines several classic LLM vulnerabilities and amplifies them, because autonomy means a vulnerability can be exploited at scale without a human in the path. Goal hijacking, for example, is prompt injection combined with excessive autonomy: the injection no longer just changes what the model says, it changes what the agent does.

The ten risks, as governance problems

The OWASP Top 10 for Agentic Applications identifies ten risk categories. Read as governance problems rather than exploits, they translate as follows.

Agent goal hijacking. An attacker redirects the agent's objective so it pursues a goal you did not set. Governance response: bounded objectives, input provenance controls, and detective monitoring that flags when an agent's behavior diverges from its mandate. Stresses security and human oversight.

Tool misuse and unintended execution. The agent calls a tool in a way you did not intend, executing an action with real-world effect. Governance response: scoped tool access, least-privilege tool permissions, and approval gates on high-consequence tool calls. Stresses security and accountability.

Identity and privilege abuse. The agent operates with more access than its task requires, or its identity is impersonated. Governance response: per-agent least-privilege identities, no shared credentials across agents, and scoped, time-bound access. Stresses security and privacy.

Agentic supply chain compromise. A component, a tool, a model, a sub-agent, is compromised upstream. Governance response: supply chain assurance for every tool and model an agent can reach, and an inventory of the agent's full dependency surface. Stresses security and accountability.

Unexpected code execution. The agent executes code, directly or through a tool, with effects you did not anticipate. Governance response: sandboxing, execution boundaries, and a deny-by-default posture on code execution. Stresses security and safety.

Memory and context poisoning. The agent's persistent memory is corrupted so that future behavior is shaped by planted content. Governance response: memory integrity controls, provenance on stored context, and detective monitoring for memory drift. Stresses security, safety, and fairness.

Resource exhaustion. The agent consumes resources, compute, API calls, budget, in a runaway loop. Governance response: rate limits, budget caps, and circuit breakers that halt a runaway chain. Stresses reliability and accountability.

Advanced prompt injection. Injection techniques tailored to agents, including injection through tool outputs and retrieved content. Governance response: input sanitisation across every channel the agent reads from, not just the user prompt. Stresses security and transparency.

Sensitive data disclosure. The agent leaks data it had legitimate access to, through an action or output. Governance response: output filtering, data-handling policy enforcement at the action level, and least-privilege data access. Stresses privacy and security.

Over-reliance on autonomous decision making. The organization grants the agent more autonomy than its reliability justifies. Governance response: progressive autonomy, escalation triggers, and a documented autonomy level matched to demonstrated reliability. Stresses human oversight and accountability.

How an AI Officer uses this

This list is not a security checklist to delegate. It is a control inventory for the security and oversight dimensions of agentic AI. The practical move is to take each agent in your inventory and run it against these ten risks, asking for each: which preventive control reduces it, which detective control surfaces it, which corrective control contains it. The gaps in that grid are the agentic security backlog, and they belong in the same risk register as the rest of your GovCompass-7 program, not in a separate security silo that the governance function never sees.

The framework landscape

OWASP is not alone. The MAESTRO threat-modeling framework from the Cloud Security Alliance provides a structured way to enumerate the agentic attack surface, and NIST and CAISI opened a formal process on AI agent security in early 2026. These converge on the same insight: agentic security needs its own threat model because the single-inference model of classic LLM security does not capture probabilistic behavior, runtime tool composition, persistent memory, and multi-agent delegation. For an AI Officer, the value is not in adopting one framework over another but in ensuring the controls they all point to are present, owned, and evidenced.

Legal referencesArt. 15

Share Share on LinkedIn

The agentic threat surface, translated for AI Officers

Why the threat surface is different

The ten risks, as governance problems

How an AI Officer uses this

The framework landscape

More on Security & robustness

Art. 51 EU AI Act: classifying a GPAI model as systemic risk

Art. 55 EU AI Act: obligations for systemic-risk GPAI providers

Agentic AI: governing actions, not just decisions