Does agentic AI need different governance?

No. It needs the existing governance model applied with more rigor. Autonomy does not create a new category; it widens the gap between what the system does and who is accountable, which makes the standard chain, control tracing to risk and forward to evidence, more important, not less. The actions are real and sometimes irreversible, so the stakes on each control rise.

Is "human in the loop" enough for EU AI Act compliance?

Not on its own. Article 14 requires that high-risk systems be designed so a human can effectively oversee them. A person positioned in the loop but unable to meaningfully intervene, through lack of authority, expertise, information, or time, does not meet that bar. The Act also ties the required oversight to the system's level of autonomy, so more autonomous agents demand more.

Who is accountable when an autonomous agent causes harm?

The deployer, in EU AI Act terms. Article 26 makes the deployer of a high-risk AI system accountable regardless of who built it, including assigning human oversight and retaining logs. Accountability does not transfer to the software because it acts on its own. The harder the agent is to trace, the more the burden falls on the deployer to show the controls, oversight, and evidence were in place.

AnalysisAccountability Human oversight

Agentic AI and governance: why autonomy sharpens the control question

By Michel Venniker· Last verified June 2026· References the EU AI Act and the GovCompass governance model.

Agentic AI does not need a new kind of governance. Autonomy widens the gap between what a system does and who is accountable for it, which makes the existing governance chain, control tracing to risk and forward to evidence, more important, not less. The actions are real and sometimes irreversible, so the stakes on each control rise.

The vocabulary of agentic AI is everywhere right now: harness, loop, context, memory, tool use, orchestration, guardrails, human in the loop. These terms describe how an agent works, the engineering that lets a model do tasks on its own rather than just respond to questions. They are worth knowing. But knowing what an agent is made of does not tell you how to keep it under control, and that is the question an organization has to resolve before it deploys one.

This article takes the common agentic vocabulary as a starting point and asks the governance question the vocabulary skips: when an AI system stops merely producing output and starts taking actions on its own, what happens to control, oversight, evidence, and accountability? The short version: autonomy does not create a new kind of governance. It widens the gap between what the system does and who is accountable for it, and that makes the existing governance chain more important, not less.

What changes when a system becomes agentic

A traditional AI system produces an output: a score, a classification, a piece of text. A human reads it and decides what to do. The decision, and the accountability for it, sits clearly with the person.

An agentic system narrows that gap or removes it. It does not just suggest; it acts. In the common vocabulary: tool use means the agent queries the database, sends the email, or updates the record itself; a loop means it repeats the act-check-decide cycle without a human approving each step; orchestration means several of these actions chain together across tools. The point of all this engineering is to let the system work without a person checking every step. That is exactly the capability that makes it useful, and exactly the capability that creates the governance problem.

Because here is what moves: in a traditional system, the human is the control point by default, standing between the output and the action. In an agentic system, that human is no longer automatically in the path. The control point does not disappear; it has to be designed in. If you do not design it, you have a system taking real actions with no one positioned to catch the one you cannot undo.

The control question: guardrails are controls, not features

In agentic vocabulary, guardrails are "the limits on what an agent is allowed to do." That is accurate as far as it goes, but it frames a guardrail as a product feature, a setting. In governance terms, a guardrail is a control: a deliberate constraint that traces back to a specific risk and produces evidence that it held.

The distinction matters because it changes how you reason about it. A feature is something you switch on. A control is something you can point to and say: this addresses that risk, it is preventive (it stops the action before it happens) rather than detective (it notices afterward), and here is the log that proves it was in force when the agent ran. The "something you can't undo" that everyone worries about with agents is, in governance language, an irreversible-harm risk, and an irreversible-harm risk is precisely the kind that demands a preventive control with evidence behind it, not a setting someone hopes is enabled.

This is the GovCompass model applied to agents: every guardrail is a control, every control traces to a risk and forward to evidence. Agents do not change that logic. They raise the stakes on it, because the actions are real and sometimes irreversible. (For the full chain, see what AI governance is.)

The oversight question: "human in the loop" is not automatically oversight

The vocabulary offers human in the loop: "a person approves before the agent does something that matters." This is the most over-trusted term in the agentic vocabulary, because the phrase describes a position and quietly implies it provides control, when those are not the same thing.

The EU AI Act, in Article 14, requires that high-risk AI systems be designed so they can be effectively overseen by people. The operative word is effectively. Putting a human in the loop satisfies the org chart; it does not satisfy Article 14 unless that human can meaningfully intervene. A person who has to approve forty agent actions an hour, with no real ability to evaluate each one, is not oversight. A person who approves because the interface nudges them toward "yes" and gives them no basis to say "no" is not oversight. That is automation bias wearing the costume of control, and the Act names it directly: Article 14 requires that overseers be able to stay aware of the tendency to over-rely on a system's output. It is one of the most common ways agentic deployments fail the effectiveness test.

Meaningful oversight needs four things the phrase "human in the loop" does not guarantee: the authority to override without reprisal, the expertise to judge the agent's action, the information to understand what the agent is about to do, and the time to evaluate it. Strip any one of those and the human is a rubber stamp, and the system is uncontrolled regardless of who is nominally in the loop.

Notably, the Act ties the weight of oversight directly to autonomy. Article 14 requires oversight measures to be commensurate with the risks, the level of autonomy, and the context of use. The law itself says, in other words, that the more autonomous the system, the more oversight it demands, which is exactly the point an agentic deployment cannot afford to miss.

The evidence question: evals are evidence, not just tests

In the vocabulary, evals are "a scored set of tasks you test an agent against, like unit tests for AI." The analogy is good for engineers. The governance reframing is that evals are evidence: not only a development-time check that the agent works, but part of the record that demonstrates, to a regulator or an auditor, that the system did what you claimed and was tested against the risks that matter.

The difference between a test and evidence is durability and purpose. A test tells you, today, whether the thing works. Evidence is what you can produce months later when someone asks you to prove the system was fit for use, behaved within its limits, and was monitored after launch. Agents make this harder, because their behavior depends on context and can drift, so the evidence cannot be a one-time snapshot. It has to be the kind of ongoing record that the governance chain is built to produce.

The accountability question: someone is accountable, regardless of autonomy

This is where agentic AI most tempts people into a category error: the more autonomous the system, the easier it is to talk as if the agent is responsible. It is not. Accountability does not transfer to software.

The EU AI Act is explicit on this for deployers. Article 26 makes the deployer accountable for the high-risk AI system it puts into use, regardless of who built it, including the duty to assign human oversight and to retain the system's logs. Autonomy does not dilute that. An agent that acts on its own is still an agent someone chose to deploy, configured, and is accountable for. The harder the system is to trace, the more the burden falls on the deployer to have built the controls, the oversight, and the evidence that show it was under control.

The practical consequence: as a system becomes more agentic, the question "who is accountable when this causes harm" does not get fuzzier; it gets more demanding. You cannot point at the autonomy and shrug. You have to point at the control that should have caught it, the oversight that should have intervened, and the evidence that shows whether either was in place.

Why this makes governance more important, not less

It is tempting to think that a system clever enough to act on its own needs less governing. The opposite is true. Every property that makes an agent useful, acting without a human in the path, chaining tools together, improving with memory, operating in a loop, also widens the distance between the action and the accountable person. Governance is what closes that distance: the control that constrains the action, the oversight that can stop it, the evidence that proves what happened, and the named person who is accountable for it.

Agentic AI does not need a new governance model. It needs the existing one applied with more rigor, because the actions are real, sometimes irreversible, and harder to trace. The vocabulary tells you how agents work. Governance tells you how to stay in command of them, and that is the part that decides whether deploying one is responsible.

Continue reading

Frequently asked questions

Does agentic AI need different governance?: No. It needs the existing governance model applied with more rigor. Autonomy does not create a new category; it widens the gap between what the system does and who is accountable, which makes the standard chain, control tracing to risk and forward to evidence, more important, not less. The actions are real and sometimes irreversible, so the stakes on each control rise.
Are guardrails the same as controls?: A guardrail is a control, but calling it a "feature" understates it. A feature is something you switch on; a control is a deliberate constraint that traces back to a specific risk and produces evidence that it held. Framing a guardrail as a control is what lets you reason about whether it is preventive, what risk it addresses, and whether you can prove it was in force.
Is "human in the loop" enough for EU AI Act compliance?: Not on its own. Article 14 requires that high-risk systems be designed so a human can effectively oversee them. A person positioned in the loop but unable to meaningfully intervene, through lack of authority, expertise, information, or time, does not meet that bar. The Act also ties the required oversight to the system's level of autonomy, so more autonomous agents demand more.
Who is accountable when an autonomous agent causes harm?: The deployer, in EU AI Act terms. Article 26 makes the deployer of a high-risk AI system accountable regardless of who built it, including assigning human oversight and retaining logs. Accountability does not transfer to the software because it acts on its own. The harder the agent is to trace, the more the burden falls on the deployer to show the controls, oversight, and evidence were in place.

Legal referencesArt. 14 Art. 26

Share Share on LinkedIn

Agentic AI and governance: why autonomy sharpens the control question

What changes when a system becomes agentic

The control question: guardrails are controls, not features

The oversight question: "human in the loop" is not automatically oversight

The evidence question: evals are evidence, not just tests

The accountability question: someone is accountable, regardless of autonomy

Why this makes governance more important, not less

Continue reading

Frequently asked questions

More on Accountability

Art. 10 EU AI Act: data and data governance for high-risk AI

Art. 12 EU AI Act: record-keeping and logging for high-risk AI

Art. 19 EU AI Act: keeping the automatically generated logs

Art. 26.1 EU AI Act: following provider instructions as a deployer

More on Human oversight

Art. 14 EU AI Act: designing high-risk AI for human oversight

Art. 26.2 EU AI Act: human oversight of high-risk AI

Art. 27 EU AI Act: Fundamental Rights Impact Assessment (FRIA)

Art. 4 EU AI Act: AI literacy obligations for organizations