← Wiki

Anthropic Designs Three-Agent Harness for Long-Running Full-Stack AI Development

Link: https://www.infoq.com/news/2026/04/anthropic-three-agent-harness-ai/

Source: InfoQ, April 2026

This piece covers Anthropic's published design for a three-agent harness architecture intended for long-running, full-stack development tasks. The structure is an orchestrator agent that manages the overall plan and delegates work, a subagent that executes individual tasks (writing code, running commands, searching the web), and a verifier agent that independently checks the outputs before they are accepted. The orchestrator does not verify its own work — verification is a separate role with a separate context, reducing the blind spots that come from self-checking. This is Anthropic's own answer to the question of how to make agents reliable enough to operate over hours rather than minutes.

The three-role separation is the architecturally interesting part. Most agent frameworks still treat the model as a single entity that both acts and checks its own actions. Anthropic's design treats verification as a first-class citizen that must be structurally independent — not just a final prompt asking "did you do this right?" The harness manages the handoffs, the shared state, and the conditions under which human approval is required. The result is a system that can run autonomously on complex tasks while maintaining a meaningful audit trail.

This is worth reading because it reveals how the company behind the most widely-used frontier model is designing its own production agent workflows internally. The three-agent pattern is likely to become a reference architecture that others adopt. For anyone building systems that use Claude (or any frontier model) for multi-step tasks, understanding this structure gives you a concrete template to evaluate your own orchestration design against.