AI Systems Architecture — Mastery3 / 9

Orchestration Patterns — Pipelines, Routers, Swarms

Once you have multiple steps or agents, how they're wired together decides cost, latency and reliability. Four patterns cover almost everything.

Published May 9, 20261 min readHaythem Rehouma · Claude Mastery

When work spans multiple steps or agents, the wiring — not the model — drives cost, latency, and reliability. Four patterns cover almost everything you'll build.

The four patterns

Pipeline — fixed sequence: step A's output feeds B feeds C. Predictable, easy to debug. Use when the path is known (extract → transform → summarize).
Router — a classifier picks the path: a cheap model triages the request to the right specialist or tool. Use when inputs vary widely (support intents, query types).
Parallel fan-out / fan-in — split independent work across workers, then merge. Use for N-files, N-sources, multi-perspective review. Wall-clock = slowest worker, not the sum.
Evaluator-optimizer loop — a generator produces, a critic scores, repeat until good enough. Use for quality-critical output where one shot isn't reliable.

Choosing

Default to the simplest pattern that fits: pipeline if the path is fixed, router if it branches, parallel only for genuinely independent work, loops only when one pass isn't enough. Composing them (a router into pipelines, a fan-out with per-item loops) handles the rest.

Patterns move data between steps. Next: what the system remembers between them — context and memory architecture.

The four patterns

Choosing

Related Claude skills you can install

Share this article

Series — AI Systems Architecture — Mastery

Keep learning

The Claude Mastery course