By Sagar Shankaran, Founder of CallSphere
Bandwidth runs its own US carrier network and added native OpenAI Realtime support in September 2025. Their BXML StartStream verb plus 2-way Media Streaming gives you carrier-grade pipes for AI voice without leaving the network you already paid for.
Key takeaways
Bandwidth has been the carrier-on-a-CPaaS in the US since 1999. They own the network, the LERG entries, the 911, and now the AI integration: September 2025 brought native OpenAI Realtime support under their "Bring Your Own AI" approach. For US-only enterprises that already buy 911 and DIDs from Bandwidth, plugging into OpenAI without a third-party hop saves real latency.
Bandwidth's voice product is BXML (Bandwidth eXtensible Markup Language), a Twilio-TwiML-style XML response language. The StartStream verb attaches a media stream to the call; the streamEventUrl receives Media Stream Started, Media Stream Rejected, and Media Stream Stopped callbacks. StartTranscription is a separate verb for live transcription with up to 4 concurrent track transcriptions per call.
Bandwidth's "Bring Your Own AI" expansion in September 2025 added direct support for OpenAI's Realtime API: a turn-key configuration that routes call audio through Bandwidth's network into OpenAI Realtime and back, without an intermediate WebSocket bridge. The latency claim is sub-200 ms because Bandwidth controls the IP path from carrier ingress to AI egress.
Hear it before you finish reading
Talk to a live CallSphere AI voice agent in your browser — 60 seconds, no signup.
graph LR
A[PSTN Caller] --> B[Bandwidth Carrier Network]
B -->|BXML response| C[Your App Server]
C -->|StartStream verb| B
B -->|wss bidirectional| D[Your WebSocket Server]
D -->|or direct hop| E[OpenAI Realtime]
E -->|audio back| D
D --> B
B --> A
F[StartTranscription verb] -.->|live tracks| G[Your Webhook]
<Response>
<StartStream
destination="wss://bridge.callsphere.ai/bandwidth-realtime"
name="ai-stream"
tracks="both"
streamEventUrl="https://callsphere.ai/api/bandwidth/stream-events"
streamEventMethod="POST">
<StreamParam name="tenant" value="abc123"/>
<StreamParam name="agent" value="intake"/>
</StartStream>
<Pause duration="3600"/>
</Response>
CallSphere terminates on Twilio across every product (Healthcare AI on FastAPI :8084 to OpenAI Realtime, Real Estate AI, Sales Calling AI with 5 concurrent outbound, Salon AI, IT Helpdesk AI, After-Hours AI with Twilio simul call+SMS 120-second timeout). 37 agents, 90+ tools, 115+ DB tables, HIPAA + SOC 2, $149/$499/$1499 plans, 14-day trial, 22% affiliate. Bandwidth is in our evaluation list because their carrier-owned path can shave 50-100 ms off the round-trip for US-domestic calls. For prospects in the IT Helpdesk vertical with strict 911-on-prem requirements, Bandwidth is the natural carrier; we maintain a reference BXML configuration that routes their calls through our standard agent stack with a thin StartStream-to-WebSocket adapter.
Bandwidth vs Telnyx for US AI voice? Both own carrier networks. Bandwidth is older with deeper E911 and number-porting muscle; Telnyx has shipped LiveKit-on-Telnyx and is faster on AI features. Pick on existing relationship.
Native OpenAI Realtime support means what exactly? A configuration option that routes the call audio through Bandwidth's IP network directly into OpenAI Realtime, without a customer-managed bridge.
Can I still use my own bridge? Yes. The native option is opt-in; default is BYOWebSocket.
Still reading? Stop comparing — try CallSphere live.
CallSphere ships complete AI voice agents per industry — 14 tools for healthcare, 10 agents for real estate, 4 specialists for salons. See how it actually handles a call before you book a demo.
HIPAA? Yes, Bandwidth signs BAAs on enterprise plans.
Pricing? Voice minutes plus a per-minute streaming charge; quote-based above the standard published rates.
Start a 14-day trial of our Twilio-based stack, see pricing, or contact us about Bandwidth integration for US-domestic high-volume tenants.
Written by
Sagar Shankaran· Founder, CallSphere
Sagar Shankaran is the founder of CallSphere, where he builds production AI voice and chat agents deployed across healthcare, hospitality, real estate, and home services. He writes about agentic AI, LLM engineering, and shipping voice agents that handle real calls in production.
See how AI voice agents work for your industry. Live demo available -- no signup required.
A founder's guide to the female voice generator landscape: AI female voices, Japanese voices, robot voices, and how CallSphere ships 57+ voices live.
A VoIP telephone number is a phone number that routes calls over the internet instead of copper lines. Learn what a VoIP number is, how to get one, what it costs, and how to pair it with an AI voice agent in 2026.
MOS 4.3+ is the band where AI voice feels human. Drop below 3.6 and conversations break. Here is how to measure, improve, and alert on MOS in production AI voice using G.711, Opus, and the underlying packet loss / jitter / latency math.
On May 4 2026 OpenAI published its Realtime stack rebuild — split-relay plus transceiver edge. Here is what changed and what it means for production voice agents.
OpenAI Realtime dominates production voice AI in 2026. Claude wins on analytics. Here's a task-by-task decision framework from a real voice agent stack.
Direct OpenAI Realtime + Agents SDK = thinner stack vs Vapi's vendor middleware layer. CallSphere ships voice agents in days, not sprints.
© 2026 CallSphere LLC. All rights reserved.