When One Agent Isn’t Enough: Claude Now Builds Its Own Team of Agents on the Fly

TL;DR

Anthropic’s Claude Code team has described dynamic workflows, a feature that lets Claude write a task-specific JavaScript harness to coordinate temporary subagents. The company frames it as useful for complex, high-value work, while warning that it can consume far more tokens than a single-agent task.

Anthropic’s Claude Code team says Claude can now create dynamic workflows: task-specific orchestration code that coordinates temporary subagents for complex work, a change aimed at reducing failures that can appear when one agent handles a large assignment alone.

The feature, described by Anthropic as “a harness for every task”, allows Claude to write a small JavaScript program around the model. That harness can spawn agents, assign focused briefs, wait for results, compare outputs and synthesize a final answer.

According to the source material, Anthropic presents several reusable patterns: classify-and-act, fan-out-and-synthesize, adversarial verification, generate-and-filter, tournament-style judging and loop-until-done workflows. These are intended for tasks that are large, parallel, judgment-heavy or require independent checking.

Anthropic also cautions that the approach uses meaningfully more tokens and is meant for complex, high-value tasks. The company’s framing makes clear that dynamic workflows are not intended for simple edits or routine requests where a single agent is enough.

At a glance
announcementWhen: Anthropic published the source blog pos…
The developmentAnthropic has published details on Claude Code dynamic workflows, which let Claude assemble and coordinate subagents during a single task.
AI Dispatch · Insights · 1 July 2026

When one agent isn’t enough: Claude now builds its own team on the fly

Skills package what you know; loops decide how far you delegate over time. Dynamic workflows are the third axis — within a single task, Claude writes its own harness and assembles a temporary team of subagents. Think of it as Claude drawing an org chart for one job.

Why one agent grinding alone underdelivers
Agentic laziness
Declares done on partial work — 35 of 50 review items.
Self-preferential bias
Grades its own homework — likes what it already produced.
Goal drift
Loses the original objective across turns, especially after context is summarized.
These are the failure modes of one person doing a huge job alone. The cure is the manager’s: divide the work, give isolated briefs, and have someone independent check it.
The harness — an org chart Claude writes for one task
Orchestrator
Claude writes a JS harness on the fly
▼   fan out   ▼
Subagent
own context · model
Subagent
own worktree
Subagent
focused goal
Subagent
isolated
✕ adversarial verify
✕ adversarial verify
✕ adversarial verify
✕ adversarial verify
▼   barrier: wait for all   ▼
Synthesize
merge structured outputs
→ Result
one verified answer
Each subagent gets a clean context window and can run on a cheaper or smarter model — so no single overloaded context gets lazy, biased, or lost. Resumable if interrupted.
The six moves it composes
Classify-and-actroute by task type (switchboard)
Fan-out-and-synthesizeparallel agents → a barrier merges (map/reduce)
Adversarial verificationa separate agent attacks each result
Generate-and-filterbrainstorm wide, keep only survivors
Tournamentagents compete; pairwise judging > scoring
Loop-until-donespawn until a stop condition, not a fixed count
Where it earns its keep — often away from code
Big migrations & refactors Deep research → cited report Fact-check every claim Rank 1,000 tickets by severity Root-cause post-mortems (“why did sales drop?”) Triage a backlog at scale Design/naming by rubric Model routing
One security pattern to memorize — quarantine: agents that read untrusted public content are barred from high-privilege actions; a separate agent does the acting. Separation of duties for autonomous agents.
The take

The shift is from prompting a worker to commissioning a team — more output, more cost, and a manager’s judgment required. Reach for a workflow when a task is big, parallel, adversarial, or judgment-heavy — and when you can feel a single agent getting lazy, grading its own homework, or losing the plot. Bound it (token budgets, pilot first) — workflows can spawn hundreds of agents and burn far more tokens. For everything else, don’t hire five people to change a lightbulb.

Source: “A harness for every task: dynamic workflows in Claude Code,” Thariq Shihipar & Sid Bidasaria (Anthropic), Claude blog, 2 June 2026. Mechanics, patterns & use cases are Anthropic’s; the “org chart” framing is the author’s. A recent, still-evolving feature. Docs: code.claude.com/docs.
thorstenmeyerai.com

A Shift Toward Agent Teams

The development matters because it moves Claude Code from a model acting as a single worker toward a model that can act more like a temporary project manager. Instead of keeping one long-running context responsible for planning, execution and review, the system can split work across agents with separate context windows.

That design targets known weak points in agentic work. The source material identifies agentic laziness, self-preferential bias and goal drift as risks when one agent handles a long assignment. Anthropic’s proposed answer is separation of duties: give agents narrower jobs and have independent reviewers challenge the output.

For developers and teams using Claude Code, the impact could be practical rather than cosmetic. Large refactors, security reviews, research reports, backlog triage and root-cause analysis can involve many separable judgments. Dynamic workflows may help organize that work, though the gains depend on task design, token budgets and review quality.

Amazon

AI workflow automation tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Claude Code’s Workflow Arc

The Thorsten Meyer AI source frames dynamic workflows as the third part of a loose sequence from Anthropic’s Claude Code team. In that framing, skills package organizational knowledge, loops decide how far to delegate over time, and dynamic workflows coordinate multiple agents inside one task.

The underlying Anthropic blog was published on June 2, 2026 by Thariq Shihipar and Sid Bidasaria. The follow-up coverage on July 1, 2026 emphasized the “org chart” analogy: Claude can create a dispatcher, specialists, reviewers or judges for a job, then discard that structure when the task is finished.

The security example in the source material centers on quarantine. Agents that read untrusted public content can be barred from high-privilege actions, while a separate agent performs the action. That pattern reflects a broader software principle: systems handling untrusted input should be isolated from systems allowed to make changes.

“A harness for every task: dynamic workflows in Claude Code”

— Anthropic blog title

Amazon

AI subagent orchestration software

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Costs And Limits Remain Open

It is not yet clear from the source material how often dynamic workflows outperform a carefully managed single-agent process in everyday use. The reported benefits are strongest for large or adversarial tasks, but the tradeoff is higher token consumption and more orchestration complexity.

Several operational questions remain open for teams evaluating the feature. Users will need to decide how to set token budgets, when to stop spawning agents, how to audit generated harnesses and how much trust to place in agent-generated reviews. Anthropic’s caveat suggests cost control will be a central part of adoption.

Amazon

JavaScript AI task manager

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Teams Test High-Value Use Cases

The next phase is likely practical testing inside Claude Code workflows. Developers may begin with pilot tasks such as codebase migrations, claim-checking, large ticket triage or security review, where parallel work and independent verification have clearer value.

For now, the safest reading is limited: dynamic workflows give Claude Code a way to coordinate temporary agent teams, but users still need to define the job, bound the cost and review the result. The feature’s wider value will depend on how reliably those workflows perform under real project pressure.

Amazon

AI model coordination platform

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

What did Anthropic announce?

Anthropic described dynamic workflows in Claude Code, allowing Claude to write a task-specific JavaScript harness that coordinates subagents during complex tasks.

Is this meant for everyday coding tasks?

No. The source material says the feature is meant for complex, high-value work and can use many more tokens than a single-agent request.

What kinds of tasks could benefit?

Examples include large refactors, deep research, security reviews, backlog ranking, post-mortems and other jobs that are parallel or judgment-heavy.

What is still uncertain?

The main open questions involve cost control, reliability, review quality and when dynamic workflows beat a simpler single-agent process.

Source: Thorsten Meyer AI

You May Also Like

No Way to Make a Deal

President Trump expected to announce a deal with Iran over nuclear talks but has now conceded no agreement is imminent amid stalled negotiations.

Republicans to hold their first-ever midterm convention in Dallas, Trump announces

President Trump announced Republicans will hold their first-ever midterm convention in Dallas on September 9-10, highlighting a historic GOP event during an election year.

Poll: Democrats maintain an edge in the fight for Congress as Trump gets poor marks

A new NBC poll indicates Democrats lead Republicans 49% to 44% in congressional preference, amid Trump’s declining approval ratings.

Recovery-percentile tracker for orthopedic surgery patients

A new recovery-percentile tracker for post-op orthopedic patients is being tested in pilot clinics to reduce patient calls and improve recovery monitoring.