By Sagar Shankaran, Founder of CallSphere
A close look at the pitchbook builder template Anthropic shipped on May 5, 2026: model, tool stack, document flow, and where the human-in-the-loop sits.
Key takeaways
Of the ten finance agent templates Anthropic shipped on May 5, 2026, the pitchbook builder is the one investment bankers will notice first. Pitchbooks are the most repetitive, time-intensive, and standardized analyst work in IB. A reliable agent that produces a defensible first draft is the difference between a 30-hour task and a 3-hour review.
This piece breaks down the anatomy of the template: the model, the tool stack, the document flow, and where the human sits in the loop.
The template is anchored on Claude Opus 4.7, which leads the Vals AI Finance Agent benchmark at 64.37 percent. Opus 4.7 is the model for tasks where the depth of reasoning and the length of context matter more than per-call cost.
The model is responsible for:
The model does not directly render slides. That job belongs to the tooling layer.
A pitchbook builder is a tool-use agent. The tools are the verbs the agent can call. A representative stack:
Hear it before you finish reading
Talk to a live CallSphere AI voice agent in your browser — 60 seconds, no signup.
The agent chooses which tool to call at each step. The tools are the same regardless of which deal the agent is working on; the inputs and outputs change.
A typical pitchbook follows a standard outline. The template handles each section:
Each section is a sub-agent (or a step within the main agent) with its own tools and validation.
The template is not unsupervised. Three meaningful approval points:
The associate is not editing every page. The associate is approving the structural choices and the qualitative narrative. The numerical content is footnoted to source.
The Vals AI benchmark gives a directional sense of how often the end-to-end output is good without rework. A 64.37 percent score on Vals does not mean the pitchbook builder is right 64.37 percent of the time on every section.
Section-level reliability is higher than the end-to-end number, because each section has its own check. The overall workflow is robust when:
With those three reviews, the practical reliability is much higher than 64.37 percent. The benchmark measures pure agent autonomy; production use cases use targeted human review to compound model quality.
Still reading? Stop comparing — try CallSphere live.
CallSphere ships complete AI voice agents per industry — 14 tools for healthcare, 10 agents for real estate, 4 specialists for salons. See how it actually handles a call before you book a demo.
A typical 30 to 40-page pitchbook takes an associate one to two full days from a blank deck. With the template:
End-to-end, a one-day task becomes a half-day task, and the associate spends that half-day on judgment work rather than formatting.
CallSphere is an AI voice and chat agent platform for customer-facing communication. Pitchbook building is not in our scope; we operate at the customer-facing layer for healthcare, real estate, sales, salon and beauty, IT helpdesk, and after-hours escalation.
The reason this matters for CallSphere readers: the same agent architecture pattern is what makes a reliable voice agent work. Model selection, tool stack, structured document flow, and human-in-the-loop at meaningful boundaries.
Our voice agents use real-time speech models for low-latency conversation, plus around 14 function tools and 20 plus database tables behind the scenes. HIPAA-friendly architecture. 57 plus languages. Pricing: Starter $149 per month for 2,000 interactions, Growth $499 for 10,000, Scale $1,499 for 50,000. 3 to 5 business day launch with a free trial.
Book a demo to see the customer-facing analog of the same agent architecture pattern.
Q: Is the pitchbook builder unsupervised? No. The template assumes associate review at the comp set, narrative, and final-assembly stages.
Q: Can a boutique IB use this template? Yes. The template is available to banks and asset managers regardless of size.
Q: Does CallSphere generate documents? CallSphere generates call summaries, transcripts, and structured data per interaction, and pushes them into the customer's CRM or ticketing system. Pitchbook-shaped documents are out of scope.
Written by
Sagar Shankaran· Founder, CallSphere
Sagar Shankaran is the founder of CallSphere, where he builds production AI voice and chat agents deployed across healthcare, hospitality, real estate, and home services. He writes about agentic AI, LLM engineering, and shipping voice agents that handle real calls in production.
See how AI voice agents work for your industry. Live demo available -- no signup required.
Graphiti is the open-source temporal knowledge graph for AI agents in 2026. Learn how bi-temporal memory beats vector RAG for voice agents and long-running LLMs.
Reasoning models (Claude Mythos, o3, Opus 4.7, DeepSeek V4-Pro) for browser-side llms (webgpu) — a May 2026 comparison grounded in current model prices, benchmark...
Self-hosted on-prem stack for browser-side llms (webgpu) — a May 2026 comparison grounded in current model prices, benchmarks, and production patterns.
Reasoning models (Claude Mythos, o3, Opus 4.7, DeepSeek V4-Pro) for edge / on-device llm inference — a May 2026 comparison grounded in current model prices, bench...
Self-hosted on-prem stack for edge / on-device llm inference — a May 2026 comparison grounded in current model prices, benchmarks, and production patterns.
DeepSeek V4 vs Llama 4 vs Qwen 3.5 vs Mistral Large 3 for edge / on-device llm inference — a May 2026 comparison grounded in current model prices, benchmarks, and...
© 2026 CallSphere LLC. All rights reserved.