Natural-Language Agent Harnesses

Link: https://arxiv.org/abs/2603.25723

Source: arXiv, March 2026

This paper explores an alternative direction in harness design: rather than specifying agent harnesses in code, it investigates what happens when you write them in natural language. The motivation is that code harnesses are powerful but brittle — small specification errors have outsized consequences, and iterating on them requires programming skill. Natural-language harnesses are more accessible, easier to audit by non-engineers, and potentially easier to auto-improve using LLMs that already reason well over language. The paper formalizes what it means for a natural-language specification to constitute a harness, and evaluates the tradeoffs against code-based alternatives.

The key tension the paper addresses is expressiveness versus reliability. Code is unambiguous but rigid; natural language is flexible but introduces interpretation variance. The authors find that for certain classes of tasks — those with well-defined evaluation criteria and moderate complexity — natural-language harnesses can match or approach code-based harness performance, while being substantially easier to iterate. The more interesting finding is that LLMs can propose improvements to their own natural-language harnesses more effectively than they can modify code harnesses, because they are operating in their native medium.

The practical relevance here is in the intersection with auto-improvement. If your harness can be expressed in natural language and an LLM can both evaluate it and propose edits, you have closed the loop without needing an agent that writes and executes code. For teams building AI systems without strong engineering resources, or for pipelines where the harness needs to be inspectable by domain experts rather than engineers, this paper opens a path worth understanding.