What are LangChain dynamic subagents?

Dynamic subagents are a Deep Agents feature (LangChain, announced June 29, 2026) where the agent dispatches subagents from generated code instead of from one-at-a-time tool calls. LangChain gives the agent a code interpreter and, when subagents are configured, exposes a task() global inside it. The agent writes a short JavaScript program — for example, a loop that calls task() once per page of a 300-page document — and the interpreter fans out those subagents. Coverage and concurrency become properties of the code rather than of the model's turn-by-turn decisions. It runs inside a QuickJS interpreter (deepagents[quickjs]) and in the dcode coding agent.

Why is code-driven fan-out better than sequential tool calls?

With sequential tool calls the model must choose to call the tool once per item and remember to cover them all, in order — slow, and easy to skip an item. A written loop iterates the whole list, so every item is dispatched in a single pass and the tasks launch together instead of one after another. In LangChain's phrasing, 'coverage becomes a structural guarantee, not a prompt engineering problem.' The trade-off is that the agent is now running code, which is why it runs inside a QuickJS interpreter rather than the host process.

How does it relate to parallel subagents and orchestrator-workers?

It is the orchestrator-workers pattern with the orchestration written as code. Orchestrator-workers means one coordinator hands slices to worker subagents; dynamic subagents let the coordinator express that hand-off as a loop plus a task() call rather than a sequence of tool invocations. It pairs naturally with parallel-subagent execution (all dispatched tasks run together) and with subagent context isolation (each worker gets its own context window and returns a compact result).

LangChain adds dynamic subagents for code-driven orchestration — Programmatic subagent fan-out

Jargon

Deep Agents: LangChain's framework for building agents that tackle long, multi-step tasks. Dynamic subagents ship here, and in its terminal coding agent, dcode.
Dynamic subagents: Subagents whose number and inputs are decided at runtime by generated code, not fixed ahead of time. A loop over 300 pages spawns 300; a loop over 4 spawns 4.
Subagent: A fresh agent instance with its own context window, handed one slice of the work. It does the slice and returns a short result, so the parent's context stays small — see subagents as context isolation.
task(): The global the interpreter exposes for dispatch. Calling task(...) spins up a subagent; calling it inside a loop is what turns coverage into code.
QuickJS: A tiny, embeddable JavaScript engine. LangChain runs the agent's generated code inside it, not in your own process, so that code is contained rather than loose on the host. Installed via deepagents[quickjs].
Code interpreter (eval tool): A tool that lets the agent run code it just wrote rather than only calling named tools. Here the interpreter is the middleware that exposes task() to that code.
Fan-out: Launching many parallel workers from one place and gathering their results — the opposite of doing items one after another. In JavaScript, running the dispatched tasks together at once is the fan-out.
Sequential tool calls: The classic agent loop: one tool call per model turn. To cover 300 items the model has to choose to call the tool 300 times, in order — the baseline this feature replaces for fan-out work.

The news. On June 29, 2026, LangChain introduced dynamic subagents in Deep Agents. The idea: give the agent a code interpreter, and when subagents are configured, expose a task() global inside it. The agent then writes a short JavaScript program — loops, branches, a single fan-out — that dispatches subagents itself, instead of the model emitting one tool call per turn. The headline pattern runs one summarizer subagent per page of a 300-page document, all dispatched together. It runs inside a QuickJS interpreter (deepagents[quickjs]) and in the dcode coding agent. LangChain's framing: "Coverage becomes a structural guarantee, not a prompt engineering problem." Read the announcement →

Picture a dispatcher with 300 delivery stops. The slow way is to radio one courier, wait for them to come back, then radio the next — and to keep it all straight in your head, so if you lose count, stop 214 quietly never gets a courier. That is exactly how an agent covers 300 pages with sequential tool calls: one tool call per turn, the model itself deciding each time who to send next, and remembering to send all 300. The coverage of the whole job rides on the model not forgetting — which is why LangChain calls it a "prompt engineering problem."

Dynamic subagents hand the dispatcher a different tool: a route sheet they write once. LangChain gives the agent a code interpreter, and — when subagents are turned on — drops a single command into it, task(), that means "send a courier." Now the agent doesn't radio anyone; it writes a short program — for (const page of pages) task(summarize, page) — and runs it. Because the loop physically walks the whole list, every stop gets assigned in one pass, and because the calls are launched together rather than one-after-another, the couriers all leave at once. The interesting move is where the work lives: dispatching stops being a sequence of model decisions and becomes a property of a few lines of code.

That code doesn't run just anywhere. It runs inside QuickJS, a tiny, embeddable JavaScript engine. Running an agent's freshly written code inside a small, separate interpreter — instead of your own process — is how you scope what that code can reach, the same instinct as a route sheet that only says send a courier, not open the safe. Letting an agent write and run code is powerful, and it opens a real attack surface, so where the code executes is part of the design, not an afterthought.

Why the loop beats the list

Hold the job fixed at 300 pages and watch the two shapes diverge. With sequential tool calls, the model takes one turn per page: it reads the state, decides "summarize page 1," waits for the result, then decides page 2, and so on — up to 300 separate model turns, each one a full round-trip, and each one a chance to lose the thread. To put rough numbers on it (illustrative): if a single turn is ~2 seconds of the model just choosing what to do next, that is ~10 minutes of pure dispatch overhead before you count the summarizing itself. With dynamic subagents, the model takes one turn to write the script; the loop then issues 300 task() calls in a single pass, launched together. 300 model decisions collapse to 1, and as long as the loop runs over the full page list, coverage is 300 / 300 by construction — not by the model remembering. The parallel dispatch is the speed win; the loop is the coverage win.

Dimension	Sequential tool calls	Code-driven fan-out (dynamic subagents)
Who decides the dispatch	The model, one tool call per turn	A short program the model writes once
Coverage of N items	Rides on the model remembering all N	Structural — a loop written over the full list of N
Concurrency	One at a time, each waits on the last	Fanned out — all dispatched together
Model turns for N dispatches	~N turns	~1 turn (write the script)
Where the risk moves to	Forgotten items, slow serial chains	Running generated code — hence the QuickJS interpreter

The honest caveat is that this shifts the problem rather than deleting it. Fan-out is only free when the slices are independent — 300 page summaries don't need each other, but a task where step 2 depends on step 1 can't be sprayed out at once. And the loop is only a coverage guarantee when it actually iterates the right list; generated code can still be wrong. Letting an agent write and run code is also exactly why where it runs matters — the payoff of code-driven orchestration comes bundled with the duty to contain that code. What LangChain has really done is move coverage and concurrency out of the prompt and into a place a program can express them — a small, sharp idea, with the containment attached.

Goes deeper in: AI Agents → Workflow Patterns → Orchestrator-Workers + Subagents

Related explainers

Claude Opus 4.8 — Parallel-subagent dynamic workflows — the model-native cousin: parallel subagents built into the model, where wall-clock is set by the slowest subtask
SpatialClaw — code-as-action vs structured tool-calls — the same "write code instead of emitting tool calls" idea, applied to a single agent's own actions
Microsoft FastContext — explorer-subagent context offloading — why you reach for subagents at all: to keep the parent's context small

Frequently Asked Questions

Check what you knowMap your AI & GPU knowledge across every track — free, role-based