Levels of AI Autonomy

Every vendor now ships an “autonomous agent.” Almost none of them mean the same thing by it. One product’s autonomy is a chatbot that calls two tools under a human’s nose; another’s is a system that opens pull requests and merges them unattended. When the word covers that much ground, it stops carrying information, and procurement, risk, and engineering end up talking past each other. The fix is the same one the automotive industry reached for when self-driving cars created exactly this confusion: a shared, level-based taxonomy of autonomy.

The Society of Automotive Engineers standardized the conversation about cars with SAE J3016, the Levels 0 through 5 you have seen on every autonomous-vehicle slide. The point of J3016 was never the technology; it was a common vocabulary that let engineers, regulators, and buyers agree on who is responsible when. Agentic AI needs the same thing, and the research community has converged on something close. This is our working version, with an opinionated view of where enterprises should stop.

Why borrow from self-driving#

The analogy is not decorative. Self-driving levels solved a specific problem: the dangerous middle. A car that drives itself ninety-five percent of the time but hands control back without warning is arguably more hazardous than one that never pretends to, because the human stops paying attention exactly when their attention is most needed. That failure mode, automation complacency at the handoff boundary, is precisely what kills agentic deployments too. An agent that is right most of the time trains its operators to rubber-stamp, so the one time it is catastrophically wrong, nobody is actually watching.

So the value of a levels framework is not bragging rights for reaching a higher number. It is forcing an explicit decision about the human’s role and the system’s authority at every step. As the Knight First Amendment Institute’s analysis of agent autonomy and the accompanying arXiv paper argue, autonomy is a design decision, not a capability score. A highly capable model can be deliberately run at low autonomy, and a weak one can be handed full autonomy over a trivially-scoped task. Conflating “smart” with “should run unsupervised” is the original sin here.

The five levels#

We frame the levels by the human’s role, which is the variable that actually governs risk. The framing of the human as operator, collaborator, consultant, approver, and observer comes directly from the user-centered taxonomy above; we have sharpened it for enterprise deployment.

Level 1 — Assisted (human as operator)#

The human does the work; the AI suggests. Autocomplete, a drafting assistant, a model that proposes a SQL query you run yourself. The agent has no authority to act. Every consequential action passes through human hands. This is where most “AI features” honestly sit, and there is no shame in it.

Level 2 — Augmented (human as collaborator)#

The agent executes scoped sub-tasks but the human stays in the loop turn by turn, steering and correcting. Think of an agent in a School ERP that drafts the term’s report-card comments while a teacher reviews and edits each one before anything is saved. The agent does real work; the human owns every output. The collaboration is continuous, not a single approval gate.

Level 3 — Conditional (human as approver)#

The agent plans and executes a multi-step task end to end, then pauses at defined checkpoints for human approval before anything irreversible happens. It books the travel but waits for sign-off before paying; it prepares the database migration but does not run it until you click. This is the most important level, because it is where most real production value lives today, and where most of the danger hides.

Industrial control panel with switches and an illuminated stop button

Level 3 is the dangerous middle. The approval gate only works if the approver genuinely evaluates, and the entire psychology of the level pushes toward not evaluating. Get this wrong and you have built a liability-laundering machine: a human nominally signs off, the system does the deciding, and when it fails everyone points at the rubber stamp. The cybersecurity field has independently landed on the same conclusion, that production-grade autonomous tooling tops out around conditional autonomy under human supervision, precisely because the validation and governance problem at this level is unsolved enough that going further is reckless.

Level 4 — High (human as observer)#

The agent operates unattended within a bounded domain, no per-task approval, and the human monitors aggregate behavior rather than individual actions. The human can intervene but is not in the per-decision loop. This is viable only when the domain is narrow, the actions are reversible or cheaply bounded, and the monitoring is real, with alerting, kill switches, and audit trails that someone watches. A Level 4 agent reconciling low-value invoices under a hard spending cap is reasonable. A Level 4 agent making clinical decisions in a Hospital Management System is not.

Level 5 — Full (no human)#

Open-ended autonomy across arbitrary domains with no human oversight. This does not responsibly exist for any consequential enterprise task in 2026, and treating it as a near-term target is a category error. Anyone selling Level 5 is selling Level 4 with the monitoring removed.

Where enterprises should draw the line#

Here is the opinionated part, because the framework is useless without a position.

Default to Level 3, and engineer the approval gate as if it were the product. The gate is not a checkbox; it is the most important component you will build. That means surfacing why the agent wants to act, not just what it proposes. It means showing the diff, the cost, the affected records, the reversibility, the confidence, in a form a busy human can actually evaluate in seconds. A good Level 3 gate makes the right decision easy and the wrong one effortful. A bad one trains complacency.

Earn Level 4 per-action-class, never per-system. Autonomy is not a property you grant a whole agent; it is a property you grant a specific action under specific conditions. The same agent might run unattended on “categorize this ticket” and require approval on “issue this refund.” Decompose the action space, set the autonomy level per class against blast radius and reversibility, and revisit it with real production data. This is the single most useful discipline we install on agentic projects.

Mechanical relay and terminal block on a DIN rail

Make reversibility the gating variable, not capability. The right question at every boundary is not “is the agent good enough” but “what happens when it is wrong, and how fast can we undo it.” Cheap-to-reverse actions can run hot; irreversible ones, money out the door, data deleted, a patient record altered, stay gated regardless of how impressive the model’s benchmarks look. Capability tells you how often it will be wrong. Reversibility tells you what it costs when it is. Only the second one bounds your liability.

Instrument every level for audit, including Level 1. The autonomy level tells you who decides; the audit trail tells you what was decided and why, after the fact, when something has gone wrong and someone is asking. Log the agent’s proposed action, the human’s response, the reasoning surfaced at the gate, and the eventual outcome, for every action class at every level. This is not bureaucracy. It is the only way to discover, with data, that a Level 3 gate has quietly degraded into a rubber stamp, or that a Level 4 action class is failing more often than its blast radius allows. The taxonomy sets the policy; the telemetry tells you whether the policy is actually holding in production. Without it you are flying on the assumption that your operators are vigilant, which is the assumption the dangerous middle is specifically designed to violate.

There is also an organizational point here that engineers tend to skip. The autonomy level of an action class is a decision with an owner, and that owner should be a named person, not a committee and not “the platform.” When the refund-issuing action class graduates from Level 3 to Level 4, someone signed off on that risk, and the audit trail should record who. This sounds like governance theater until the first incident, at which point the difference between “a human accepted this risk on this date” and “the system did it” is the difference between a postmortem and a lawsuit.

The taxonomy is a forcing function#

The levels are not a maturity ladder to climb for its own sake. Higher is not better; appropriate is better, and for most enterprise Operational Automation the appropriate ceiling is Level 3 with a few carefully-earned Level 4 action classes. The value of writing the level down, in the design doc, next to each action class, is that it converts a vague aspiration (“let’s make it autonomous”) into an accountable engineering decision with a named human, a defined authority, and an explicit blast radius.

That is the whole game. Not how autonomous you can make the system, but how precisely you can specify where the human stands and what the machine is allowed to do without them. Get the taxonomy right and the rest is engineering. Get it wrong and you have built something that fails exactly when the stakes are highest.

Drawing the autonomy line for an agentic system that touches money, records, or safety? We map it per action class, not per slogan. Let’s scope it.