By Sagar Shankaran, Founder of CallSphere
deepagents v0.5 ships harness profiles, async subagents, and Anthropic prompt caching baked in. We unpack when this opinionated harness beats raw LangGraph for production agents.
Key takeaways
TL;DR — LangGraph is the runtime; deepagents is the opinionated harness on top. Pick deepagents when you want Anthropic-style planning, virtual filesystem, subagents, and prompt-caching defaults shipped on day one. Pick raw LangGraph when your topology is custom enough that an opinion is in the way.
flowchart LR
User --> Triage["Triage / Supervisor"]
Triage -->|tool A| A["Specialist A"]
Triage -->|tool B| B["Specialist B"]
Triage -->|tool C| C["Specialist C"]
A --> Mem[(Shared memory · mem0/Letta)]
B --> Mem
C --> Mem
Mem --> Final["Final response"]deepagents (langchain-ai/deepagents) is a Python and TypeScript agent harness built on LangChain primitives and powered by the LangGraph runtime. It ships a planning tool, a virtual filesystem, the ability to spawn subagents, and — as of v0.5.0 alpha (March 2026) — async subagents, multi-modal support, and tuned Anthropic prompt-caching. The repo hit 9.9k stars within five hours of the v0.5 announcement, which tells you the demand for "LangGraph with batteries" was real.
LangGraph is the lower-level graph runtime. You build your own topology, manage your own checkpointer, and write your own planning logic. deepagents is what LangChain wishes everyone wrote on top of LangGraph: a default loop, default tools, default prompts, harness profiles that swap behavior per model vendor, and middleware hooks for human-in-the-loop, summarization, and storage.
Pick deepagents when:
Pick raw LangGraph when:
Hear it before you finish reading
Talk to a live CallSphere AI voice agent in your browser — 60 seconds, no signup.
The v0.5 alpha changes that matter for production:
CallSphere runs 37 specialist agents across 6 verticals, with 90+ tools and 115+ DB tables. Our Real Estate OneRoof deployment uses an OpenAI Agents SDK orchestrator + 10 specialist agents pattern; our IT Services UrackIT wraps deepagents-style planning with a ChromaDB RAG layer for past-ticket lookup. After-hours runs 7 agents with explicit escalation. We don't pick frameworks dogmatically — we pick whichever harness lets us hit p95 latency targets and keep the prompts auditable.
For new internal agents in 2026 we've started defaulting to deepagents when the workflow is long-running and offline (research, summarization, batch outreach), and to raw OpenAI Agents SDK or LangGraph when the workflow is conversational and latency-bound.
pip install deepagents or npm i deepagents.from deepagents import create_deep_agent, AnthropicProfile.deepagents accepts plain LangChain @tool decorators.StateGraph with create_deep_agent(tools=..., instructions=..., subagents=[...]).write_todos tool for built-in planning if you want the harness to expose its plan to the user.from deepagents import create_deep_agent
from langchain_core.tools import tool
@tool
def lookup_account(phone: str) -> dict:
"""Look up a CallSphere customer account by E.164 phone."""
# ...real DB call
return {"plan": "Growth", "mrr": 499}
researcher = {
"name": "researcher",
"description": "Pulls public web context on the caller's company.",
"prompt": "You are a research subagent. Return 3 facts and 2 risks.",
"tools": ["web_search"],
}
agent = create_deep_agent(
tools=[lookup_account],
instructions="You are CallSphere's inbound triage agent.",
subagents=[researcher],
)
result = agent.invoke({"messages": [{"role": "user", "content": "+18453884261 just called"}]})
When we ported one of our internal LangGraph workflows (an outbound research agent) to deepagents in March 2026, three things changed:
@tool decorators are LangChain-native; the harness re-uses them directly.create_deep_agent call.What we still wrote ourselves: the subagent prompts. The harness ships boilerplate that works, but production-quality subagent prompts are vertical-specific and worth investing in — a 200-token "you are a research subagent that returns 3 facts and 2 risks in JSON" pays for itself ten times over.
Still reading? Stop comparing — try CallSphere live.
CallSphere ships complete AI voice agents per industry — 14 tools for healthcare, 10 agents for real estate, 4 specialists for salons. See how it actually handles a call before you book a demo.
The big v0.5 selling point for Anthropic shops is the prompt-cache defaults. The harness places cache breakpoints around the system prompt and stable tool definitions automatically. In our outbound research agent, this dropped per-call cost on Claude Sonnet 4 from $0.012 to $0.0072 — a 40% drop driven entirely by cache hits on the system prompt. For high-volume agents (we run thousands of these per day on the GTM side) that compounds fast.
You don't need to do anything to enable it; just pick the Anthropic harness profile. The library ensures cache breakpoints land on stable boundaries, not in the middle of a tool call where they'd invalidate constantly.
Does deepagents replace LangGraph? No — it sits on top of LangGraph. Every deepagents agent is still a LangGraph graph under the hood, so checkpointers, streaming, and human-in-the-loop all still work.
Can I run deepagents with a model other than Claude? Yes. The OpenAI harness profile ships out of the box; community profiles exist for Gemini and DeepSeek.
Is the v0.5 alpha production-safe? Pin to a specific commit and run regression evals before you ship. The async subagents path is the youngest surface and should be feature-flagged.
How do we demo a deepagents-style workflow on CallSphere? Start a 14-day trial on the Sales product, mount our research subagent skill, and let it triage your inbound calls.
Written by
Sagar Shankaran· Founder, CallSphere
Sagar Shankaran is the founder of CallSphere, where he builds production AI voice and chat agents deployed across healthcare, hospitality, real estate, and home services. He writes about agentic AI, LLM engineering, and shipping voice agents that handle real calls in production.
See how AI voice agents work for your industry. Live demo available -- no signup required.
How we built a fault-tolerant HVAC emergency triage and tech-dispatch platform on Kubernetes — three-tier CQRS, 11 micro-agents on the OpenAI Agents SDK + LangGraph, NATS JetStream, DTMF/SMS/WebSocket acceptance, circuit breakers, and an evaluation pipeline that catches regressions before they wake a tech at 3 AM.
A three-way comparison of Gemini Enterprise, Anthropic managed agents and OpenAI Frontier Platform after Cloud Next 2026 — strengths, gaps, buyer fit.
Anthropic's May 2026 push positions Claude as a vertical platform for financial services. The strategic positioning versus OpenAI and Google.
ServiceNow Project Arc vs Anthropic Managed Agents — runtime, governance, integration, and use cases. The 2026 enterprise autonomous agent comparison.
May 2026's biggest agent-architecture shift: planning, tool selection, and self-correction move inside the model. Framework code shrinks. Here is what changes.
Anthropic and Moody's announced a data partnership in May 2026 that grounds Claude in audited financial reference data. Why grounding reduces hallucination and what it unlocks.
© 2026 CallSphere LLC. All rights reserved.
Watch how CallSphere handles real customer calls, schedules appointments, and processes payments — live.