Agent Operations Design Notes (7/9) — How to Design an Approval Loop

6월 02, 2026

The more work you delegate to agents, the less sufficient raw permissions become. It is not enough to say that some actions are allowed and others are blocked. There is a third layer in between: when should a human step in, what exactly should be reviewed, and with what information should that decision be made? That middle layer is the approval loop.

핵심 요약

An approval loop is not anti-automation. It is a structure that moves hard-to-reverse actions in front of human judgment earlier.
Approval targets are better grouped by blast radius, reversal cost, external trace, and privilege change than by technology category.
Putting a human into every step is not good design. If ask is overused, important approval signals disappear.
Even if Managed Agents absorb more of the execution layer, approval criteria still remain a local operating policy.

1. Why approval loops matter again

Older assistant systems mostly suggested answers or drafts. Humans read the result and then acted themselves.

Agents are different. They can edit files, run commands, operate browsers, and send things outward. From that point on, we can review it later stops being enough.

Some actions should be inspected before execution. Some actions are fine with logs after the fact. Some actions should never proceed at all.

That is what approval loop design is really about.

If permission design cuts the system into allow / ask / deny, the approval loop is what turns ask into a real operating procedure.

2. Approval targets should be grouped by reversal cost, not by tool type

Teams often start with tool categories:

shell is risky
browser is risky
file editing is risky

That looks tidy, but it breaks quickly.

The same surface category can carry very different consequences.

Action	Surface appearance	Reversal cost
editing a draft file	write	low
editing a publish helper	write	high
updating internal notes	write	low to medium
publishing externally	transmission / publication	very high
touching credentials	read	very high

So approval loops are usually better designed around:

is it easy to undo
does it create an external trace
does it touch protected assets
does human judgment actually add value
can logs alone provide enough safety

3. Read, edit, execute, and externally send are not the same permission class

A good approval loop starts by splitting actions more carefully.

At minimum, it is usually worth separating:

read
narrow local edit
code or command execution
external network transmission
publication or deployment
privilege escalation or protected asset access

This matters because approval frequency and approval quality are shaped here.

If reading internal documents and publishing externally sit inside the same approval logic, the system usually ends up either too noisy or too weak.

Permissions answer what is possible. Approval loops answer when human review is required even if an action is technically possible.

4. Good approval loops do not put humans everywhere

Many teams respond to risk by inserting human approval into every meaningful step. That usually creates two failures:

humans become the throughput bottleneck
approval quality falls because the review becomes routine

If approvals arrive too often, people stop reading them carefully. At that point, the system still has a formal approval step, but the real value is gone.

A strong approval loop reduces:

meaningless asks
repeated asks of the same low-value kind
approval on actions that are easy to recover after the fact

And it strengthens:

approval on irreversible actions
approval on external sends
approval on wide-scope modifications
approval on privilege escalation

The goal is not use more humans. It is use human judgment only where it matters most.

5. What information should an approval request include

Approval quality is determined less by whether a popup appears and more by what the reviewer can actually see.

A useful approval request should usually include:

Field	Why it matters
what is being attempted	confirms purpose
why it is needed now	checks whether the step is really necessary
what will change	shows impact scope
how it can be reversed	shows recovery path
what the alternative is if rejected	prevents approval from becoming the lazy default

If the human sees only an approve button, the loop is close to meaningless.

6. The bigger problem is not approval delay, but missing approval criteria

Approval delay is real, but the larger problem is often the absence of clear criteria.

Without criteria:

different reviewers make different decisions
similar actions are approved one day and rejected the next
the agent cannot learn a stable operating boundary
post-incident review cannot explain why approval happened

That is why an approval loop usually needs policy before it needs UI.

For example:

some actions are default allow
some are always approval-gated
some are default deny
some are conditionally allowed under narrow rules

7. Why Managed Agents cannot solve approval design for you

Managed Agents can help with long-running execution, common tool access, and some observability. They do not decide your approval philosophy.

The reason is simple: approval rules are organizational judgment.

one team may require approval for any external send
another may care far more about modifying operational scripts
another may treat publication as the highest-risk event

So approval loops may be offered as platform features, but approval policy itself remains local.

8. Five questions to answer before rolling an approval loop out

Which actions are the most irreversible?
Which actions are safe enough for post-hoc logging only?
Which actions are not improved by human review anyway?
Which asks become dangerous if they appear too often?
If approval is rejected, what fallback should the agent take?

Without answers to those questions, approval loops usually degenerate into a pile of popups.

9. A quick health check for approval-loop fatigue

approval prompts appear so often that humans nearly auto-approve them
approval requests do not explain purpose or scope
there is no fallback path when approval is denied
similar actions are handled inconsistently
post-review cannot explain why something was approved

If several of these are true, the system is probably over-asking and under-designing.

10. Conclusion: approval loops are not about late human intervention, but about timely human intervention

Approval loops can look like anti-automation if they are designed badly. In strong systems, the opposite is true.

A good approval loop:

removes low-value asks
sharpens the truly important asks
catches irreversible actions earlier

So the deeper question is not where do we put a human. It is which part of the operating boundary the organization is willing to let automation own by itself.

References

Anthropic, Permissions management
Anthropic, Hooks reference
OpenAI Agents SDK, Guardrails
OpenAI, The next evolution of the Agents SDK, 2026-04-15

Series overview: Series index

이 블로그 검색

MaJu Tech Notes