Managed Agents: A Mental Models Reading

I · The Frame

What this post is really about.

The question that opens the Anthropic Engineering post is one of the oldest in computing: how do you design a system for "programs as yet unthought of"? It is the question that gave birth to operating systems — the insight that you cannot build infrastructure for every application that will ever run on it, so you build an abstraction layer that is stable enough for applications you cannot yet imagine. Managed Agents is Anthropic's answer to the same question, applied to AI agents: how do you build a hosting layer for harnesses, sandboxes, and models that haven't been written yet?

The answer they arrived at is conceptually clean: decouple the three components of an agent — the brain (Claude + harness), the hands (sandboxes and tools), and the session (the append-only log) — so that each can fail, evolve, or be replaced without disturbing the others. This is not a new idea in systems design. It is the idea behind the Unix file abstraction, the virtual machine, and the REST API. What's new is applying it to agents, where the "application" is a large language model and the "hardware" is a combination of inference, execution environments, and external service integrations.

The mental-models pass through this post is rich. Every classic systems-design principle gets a fresh illustration. Some assumptions baked into tightly coupled agent architectures turn out to be wrong. And the post contributes several models that belong in the latticework.

II · The Reinforced

Models the architecture amplifies.

Reinforced · 01

Systems · Separation of Concerns

Each layer should own exactly one thing.

Separation of concerns is the architectural principle that each component of a system should be responsible for exactly one concern, and that concerns should not leak across boundaries. Managed Agents implements this three ways: the session owns durability; the harness owns the agent loop and tool routing; the sandbox owns execution and file state. When these were co-located in a single container, every concern leaked into every other. Decoupling made each layer debuggable, replaceable, and independently scalable.

"Decoupling the brain from the hands meant the harness no longer lived inside the container. It called the container the way it called any other tool: execute(name, input) → string."

Reinforced · 02

Systems · Redundancy & Resilience

Cattle survive; pets require nurses.

The pets-vs-cattle analogy is a classic in distributed systems: pets are named, hand-tended, irreplaceable; cattle are interchangeable, disposable, and easily replaced. Anthropic's coupled architecture adopted a pet: one container held the session, the harness, and the sandbox together. When it failed, everything failed. When the container was unresponsive, engineers had to open a shell inside a box that also held user data. Decoupling turned both containers and harnesses into cattle — replaceable by design, debuggable without touching user data.

"If the container died, the harness caught the failure as a tool-call error and passed it back to Claude. If Claude decided to retry, a new container could be reinitialized with a standard recipe: provision({resources}). We no longer had to nurse failed containers back to health."

Reinforced · 03

Security · Principle of Least Privilege

Credentials should never touch the sandbox.

The principle of least privilege says: give each component exactly the access it needs, no more. In the coupled design, Claude's generated code ran in the same container as service credentials — a prompt injection only had to convince Claude to read its own environment. The structural fix was architectural: auth tokens are either bundled into resources during initialization (Git clones with access tokens already wired in) or held in a vault that the sandbox cannot access. The proxy that calls external services holds the credential; Claude never sees it.

"The structural fix was to make sure the tokens are never reachable from the sandbox where Claude's generated code runs."

Reinforced · 04

Systems · Abstraction Layers

The `read()` command doesn't know what a disk pack is.

Operating systems lasted decades by virtualizing hardware into abstractions — process, file, socket — general enough for programs that didn't exist yet. The read() command is agnostic to whether it's accessing a 1970s disk pack or a modern NVMe drive. Managed Agents follows the same pattern: execute(name, input) → string is agnostic to whether the "hand" is a Docker container, a phone, or (in the authors' words) a Pokémon emulator. The abstraction is designed to outlast the implementation.

"The harness doesn't know whether the sandbox is a container, a phone, or a Pokémon emulator."

Reinforced · 05

Systems · Latency & Throughput

Don't pay upfront for what you might not need.

In the coupled design, every session paid the full container setup cost before inference could begin — even sessions that would never touch the sandbox. Time-to-first-token (TTFT) included container provisioning, repo cloning, and process startup. Decoupling meant containers are only provisioned when a tool call requires them. Sessions that only need inference start immediately. The p50 TTFT dropped ~60%; p95 dropped over 90%. The architectural insight generalizes: don't initialize resources that might not be needed.

"So a session that didn't need a container right away didn't wait for one. Inference could start as soon as the orchestration layer pulled pending events from the session log. Using this architecture, our p50 TTFT dropped roughly 60% and p95 dropped over 90%."

III · The Contradicted

Models that tight coupling bends.

Bent · 01

Engineering · Co-location as Efficiency

Close proximity is not always the faster path.

The engineering intuition is that co-located components communicate faster — file edits are direct syscalls, no service boundaries to cross. Anthropic's initial architecture followed this logic. The problem was that co-location created a pet: a named, irreplaceable server that became a debugging nightmare. The alternative — a thin interface execute(name, input) → string — added a network hop but gained elasticity, debuggability, and the ability to swap implementations. The total latency was lower, not higher, because startup costs were eliminated for sessions that didn't need the sandbox.

Bent · 02

AI · Context Window as Agent Memory

The session is not the context window.

The standard model of LLM agents treats the context window as the agent's working memory — everything the agent knows is in the prompt. For long-horizon tasks, this breaks: the task exceeds the window, context must be discarded, and every discard is an irreversible decision. Managed Agents separates these concerns: the session log is a durable, append-only record that lives outside the context window. The agent can interrogate it via getEvents() — fetching positional slices, rewinding, rereading. The context window becomes a view into a larger store, not the store itself.

"In Managed Agents, the session provides this same benefit, serving as a context object that lives outside Claude's context window … durably stored in the session log. The interface, getEvents(), allows the brain to interrogate context by selecting positional slices of the event stream."

Bent · 03

AI · One Agent, One Environment

Many brains can share many hands.

The natural mental model for an AI agent is one-to-one: one model, one execution environment, one task. Early Managed Agents followed this: one container per brain. As tasks grew more complex, the constraint became apparent — the brain needed to reach into multiple execution environments simultaneously. Decoupling made each hand a named tool. Brains can now hold multiple hands (execute("container-A", ...), execute("phone-B", ...)), and brains can pass hands to one another. The one-to-one assumption was an artifact of implementation, not a requirement of the task.

IV · The New

Models worth adding to the latticework.

New · 01

Coined · Architecture & AI

Brain-Hand Decoupling.

The architectural principle of separating the reasoning component (brain: the model + its control loop) from the execution component (hands: sandboxes, tools, external services) through a thin, stable interface (execute(name, input) → string). The brain doesn't know what hands exist; the hands don't know what brain called them. This enables: independent failure recovery, independent scaling, composition (many brains, many hands, brains passing hands to each other), and security isolation (credentials stay with the resource, not the brain). Generalizes beyond AI agents to any system where reasoning and execution have different scaling profiles or security requirements.

"The harness leaves the container. Decoupling the brain from the hands meant the harness no longer lived inside the container."

New · 02

Coined · AI Systems

Harness Obsolescence Rate.

The observation that harnesses — the control loops wrapping LLMs — encode assumptions about model capability that go stale as models improve. A harness built for a model with "context anxiety" becomes dead weight on a model without it. The Harness Obsolescence Rate is the pace at which current harness assumptions need to be revisited as model capabilities change. The insight drives the Managed Agents design: build interfaces that are stable (session, sandbox, harness as separable concerns) and treat any specific harness implementation as ephemeral. A meta-harness is one that accommodates harnesses as yet unthought of.

"We expect harnesses to continue evolving. So we built Managed Agents: a hosted service … through a small set of interfaces meant to outlast any particular implementation — including the ones we run today."

New · 03

Coined · Security & Systems

Structural Credential Isolation.

The pattern of ensuring that credentials are architecturally unreachable from the untrusted execution environment — not by narrowing credential scope (which encodes assumptions about what the model can do with limited tokens), but by making the credential invisible to the execution environment entirely. Auth is bundled into resources during initialization (Git remote with token embedded), or held in a vault that the sandbox cannot query. The proxy pattern: Claude calls MCP tools via a proxy; the proxy fetches credentials from the vault; the sandbox never sees them. Generalizes to any system where generated or untrusted code runs alongside sensitive credentials.

V · The Field Card

When to reach for which.

Standing in front of an agent architecture decision, a distributed systems design, or a security review — which of these models tells you what to do?

Field Card

Triggers & deployments

A service is failing and the failure mode is opaque: Reach for Pets vs Cattle + Separation of Concerns. Is a pet in the system? Can you debug one layer without touching another? If not, the architecture adopted a pet.
Designing an agent that needs to outlast current model capabilities: Reach for Harness Obsolescence Rate + Abstraction Layers. What assumptions does your harness encode? Which ones will go stale in 6 months? Put those in the harness, not the interface.
A long-horizon task exceeds the context window: Reach for Brain-Hand Decoupling + the session-as-external-store pattern. Context windows are views, not stores. Build the durable store first; let the window be a query into it.
Credentials are co-located with untrusted code: Reach for Structural Credential Isolation + Least Privilege. Narrow scoping is insufficient; structural isolation is the fix. Can the untrusted code ever reach the credential, by any path?
A session startup is too slow: Reach for Brain-Hand Decoupling + deferred provisioning. Are you paying for hands the brain hasn't asked for yet? Provision on first execute(), not on session start.

VI · Coda

The meta-harness idea.

The most conceptually ambitious claim in the post is that Managed Agents is a "meta-harness" — not a harness for any particular task, but a system that accommodates harnesses for tasks not yet conceived. This is the operating-system ambition, applied to AI. It is also, at bottom, a claim about what should be stable in a rapidly changing system.

We are opinionated about the shape of these interfaces, not about what runs behind them. — Anthropic Engineering, Managed Agents

The latticework lesson is older than AI: the most durable designs are those that identify the right level of abstraction — the level at which the interface can stay stable while implementations churn underneath. read() survived fifty years of storage technology changes. execute(name, input) → string is a bet that it can do the same for agent execution environments. Whether that bet pays off depends on whether the abstraction is at the right level — whether the true invariants of agent work are "a brain, some hands, and a session log" or something else entirely. That is the open question the post leaves on the table, honestly.

★ END ★

A latticework reading of Anthropic Engineering · Managed Agents, against fs.blog/mental-models.

Brain from Hands.