By Sagar Shankaran, Founder of CallSphere
Five proven multi-agent architecture patterns built on A2A — orchestrator, peer mesh, hub-and-spoke, marketplace, and tiered specialist.
Key takeaways
A2A — the agent-to-agent open standard Google donated to the Linux Foundation in 2026 — gives you a transport and a discovery model. It does not give you an architecture. The architecture is a design decision: how the agents are organized, who calls whom, how state is owned, how failures cascade.
This post catalogs the five A2A multi-agent patterns that show up most often in production agent systems in 2026.
flowchart TB
User[User / Front-End Agent] --> O[Orchestrator Agent]
O --> S1[Specialist Agent: CRM]
O --> S2[Specialist Agent: Scheduling]
O --> S3[Specialist Agent: Knowledge]
O --> S4[Specialist Agent: Voice]
A single orchestrator decides which specialist agent handles each step of a task. Specialists are domain-specific (CRM, scheduling, voice, knowledge). The orchestrator owns the plan and the global state.
Where it shines. Easy to reason about; one canonical decision-maker; easy to swap specialists.
Where it struggles. The orchestrator is a single point of failure and a latency bottleneck.
Use it when. You have clear specialist boundaries and predictable workflows.
flowchart LR
A[Agent A] <--> B[Agent B]
A <--> C[Agent C]
A <--> D[Agent D]
B <--> C
B <--> D
C <--> D
Agents call each other directly with no central coordinator. Each agent owns its slice of state; coordination is by mutual A2A handshake.
Where it shines. No single point of failure; resilient to one-agent outages.
Hear it before you finish reading
Talk to a live CallSphere AI voice agent in your browser — 60 seconds, no signup.
Where it struggles. Hard to reason about; emergent behavior; debugging is brutal.
Use it when. Agents are roughly symmetric in role and you want loose, resilient coordination.
flowchart TB
H[Hub Agent: Routing + Auth + Audit] --> S1[Spoke: Vendor A Agent]
H --> S2[Spoke: Vendor B Agent]
H --> S3[Spoke: Vendor C Agent]
U[User-Facing Agent] --> H
A hub agent owns the cross-vendor routing, auth, and audit layer. Spokes are vendor-provided agents (Workday, Salesforce, CallSphere). The user-facing agent only talks to the hub; the hub talks to spokes.
Where it shines. Clean separation between governance and business logic; centralized auth and audit; vendor independence.
Where it struggles. The hub itself has to be operationally excellent; latency overhead.
Use it when. You are an enterprise integrating many vendor agents and need a single governance plane.
flowchart TB
R[Registry] --> A1[Agent 1]
R --> A2[Agent 2]
R --> A3[Agent 3]
R --> A4[Agent N]
Buyer[Consumer Agent] --> R
Buyer -.discover.-> A1
Buyer -.discover.-> A2
Buyer --> A1
Agents publish Agent Cards to a registry. Consumer agents discover and call them via the registry. Think of this as the "open web" pattern of A2A — analogous to how mobile apps discover web services.
Where it shines. Maximum reach; vendor-agnostic on both sides; works with public registries.
Where it struggles. Trust, quality, and SLA enforcement need extra layers (signing, reputation, governance).
Use it when. You want a consumer agent to discover services it has no pre-existing relationship with — for example, a Hatch-style agent finding a salon's voice agent for a booking.
flowchart TB
F[Frontier / Reasoning Agent] --> T1[Tier 1: Cheap Worker Agent]
F --> T2[Tier 1: Cheap Worker Agent]
T1 --> Tool1[(Tool / MCP)]
T2 --> Tool2[(Tool / MCP)]
A high-cost reasoning agent (frontier model) orchestrates a fleet of cheap worker agents (smaller model, narrow task). The frontier model does the planning and judgment; workers execute deterministic subtasks.
Still reading? Stop comparing — try CallSphere live.
CallSphere ships complete AI voice agents per industry — 14 tools for healthcare, 10 agents for real estate, 4 specialists for salons. See how it actually handles a call before you book a demo.
Where it shines. Excellent cost-to-capability ratio at scale; explicit separation of "thinking" from "doing."
Where it struggles. Coordination overhead; worker failures can confuse the frontier planner.
Use it when. You have volume and a clear plan/execute split. Most production voice and chat agent platforms use this internally.
Heuristic decision rubric:
Most real systems use two or three of these in combination. A voice agent platform like CallSphere uses Tiered Specialist internally (frontier planner + cheaper worker models for intent classification, transcription, summarization) and exposes a Hub-friendly Agent Card so enterprise hub patterns can integrate it cleanly.
Three patterns to avoid:
CallSphere is an AI voice and chat agent platform — voice, chat, SMS, and WhatsApp, 57+ languages, 6 verticals, ~14 function tools, 20+ DB tables, $149/$499/$1,499 monthly tiers, HIPAA-friendly, 3–5 day launch.
Architecturally:
See how a CallSphere voice agent slots into a multi-agent stack — book a demo.
Q: Can I mix orchestrator and peer-mesh patterns? Yes, and most real systems do. A common shape: orchestrator at the top, peer mesh among specialists for resilience, with the orchestrator stepping in only on conflicts.
Q: How do I version Agent Cards in a marketplace pattern? Use semantic versioning on the card itself and publish both old and new versions at well-known paths. Consumers pin to a version range.
Q: Which pattern do consumer agents like Hatch use? Hatch and similar consumer agents are essentially marketplace consumers — they discover and call third-party services. As more services publish Agent Cards, the marketplace pattern is what they use to find them.
Written by
Sagar Shankaran· Founder, CallSphere
Sagar Shankaran is the founder of CallSphere, where he builds production AI voice and chat agents deployed across healthcare, hospitality, real estate, and home services. He writes about agentic AI, LLM engineering, and shipping voice agents that handle real calls in production.
See how AI voice agents work for your industry. Live demo available -- no signup required.
How we built a fault-tolerant HVAC emergency triage and tech-dispatch platform on Kubernetes — three-tier CQRS, 11 micro-agents on the OpenAI Agents SDK + LangGraph, NATS JetStream, DTMF/SMS/WebSocket acceptance, circuit breakers, and an evaluation pipeline that catches regressions before they wake a tech at 3 AM.
How to design a multi-agent system using MCP for tools and A2A for cross-vendor coordination, with a CallSphere voice agent as a participating node.
A2A is the open standard for agent-to-agent coordination. Here is how the Agent Card JSON works, how discovery happens, and what to publish.
MCP is agent-to-tool. A2A is agent-to-agent. Here is a clear 2026 decision guide for builders choosing between (and combining) the two protocols.
A no-fluff recap of the 7 biggest enterprise AI moves from Google Cloud Next 2026 — Gemini Enterprise, Agentspace, A2A, Gemini 3.1 Ultra, and more.
Google's May 2026 MCP 1.0 + A2A developers guide is the cleanest protocol picker we have seen. The takeaways, in plain English, with a CallSphere lens.
© 2026 CallSphere LLC. All rights reserved.