By Sagar Shankaran, Founder of CallSphere
LiveKit Agents went 1.0 in April 2025, hit 1.5 by April 2026, and shipped native SIP plus phone numbers so you no longer need a Twilio bridge. Add MCP tool support and adaptive interruption handling and it is the default open-source telephony framework for AI in 2026.
Key takeaways
LiveKit Agents shipped 1.0 in April 2025, then iterated quarterly through 1.5 by April 2026. Native SIP support means LiveKit can be your phone-system end-to-end without a Twilio bridge in the middle. Add MCP-compatible tool calls and adaptive interruption handling and it is the default open-source telephony framework for new AI voice projects.
LiveKit started as a WebRTC SFU and pivoted hard into AI voice in 2024. The Agents framework lets any Python or Node.js program join a LiveKit room as a real-time participant. Agents 1.0 (April 2025) standardized the API; 1.x added SIP integration so the room can dial out to a phone number or accept inbound SIP calls; the MCP work in late 2025 made tool calls portable across providers; adaptive interruption (1.4-1.5) tunes barge-in sensitivity per voice and per noise level.
The SIP service is open-source (livekit/sip on GitHub). It accepts inbound trunks, places outbound calls via SIPParticipant, supports DTMF and SIP REFER. Combined with Agents, you write one Python file and have a phone number plus an AI agent.
Hear it before you finish reading
Talk to a live CallSphere AI voice agent in your browser — 60 seconds, no signup.
graph LR
A[PSTN Trunk] --> B[livekit/sip Service]
B -->|inbound dispatch rule| C[LiveKit Room]
D[Agent Worker] -->|join room| C
D --> E[STT plugin]
E --> F[LLM with MCP tools]
F --> G[TTS plugin]
G --> D
H[Outbound CreateSIPParticipant] --> B
B --> A
from livekit import agents, rtc
from livekit.plugins import openai, deepgram, cartesia, silero
class IntakeAgent(agents.VoiceAgent):
def __init__(self):
super().__init__(
instructions="You are a healthcare intake assistant.",
stt=deepgram.STT(),
llm=openai.LLM(model="gpt-4o-realtime-preview"),
tts=cartesia.TTS(),
vad=silero.VAD.load(),
turn_detection="multilingual",
)
@agents.function_tool
async def book_appointment(self, ctx, date: str, time: str):
# MCP-compatible tool call
return await self.tools.call("calendar.book", {"date": date, "time": time})
CallSphere terminates every product on Twilio (Healthcare AI on FastAPI :8084 to OpenAI Realtime, Real Estate AI, Sales Calling AI with 5 concurrent outbound, Salon AI, IT Helpdesk AI, After-Hours AI Twilio simul call+SMS 120-second timeout). 37 agents, 90+ tools, 115+ DB tables, HIPAA + SOC 2, $149/$499/$1499 plans, 14-day trial, 22% affiliate. We selected Twilio over LiveKit SIP for production because Twilio's compliance attestations, number porting, and global reach are still ahead. We track LiveKit because the Telnyx + LiveKit launch in April 2026 changes the math for high-volume tenants. Our reference architecture has a LiveKit-on-Telnyx fallback path that uses our same agent prompts via the Agents 1.5 API.
pip install livekit-agents livekit-plugins-openai livekit-plugins-deepgram livekit-plugins-cartesia.python agent.py dev for local, production for managed.LiveKit native SIP or stick with Twilio bridge? For greenfield, native SIP is cleaner. For existing Twilio investments, the bridge keeps your numbers and compliance.
MCP support means what? Tool calls are portable: define once, run on OpenAI, Anthropic, Gemini, or any MCP-compatible host.
Adaptive interruption tuning? The vad parameters expose threshold and prefix-padding. For noisy environments raise threshold; for fast talkers shorten prefix.
Still reading? Stop comparing — try CallSphere live.
CallSphere ships complete AI voice agents per industry — 14 tools for healthcare, 10 agents for real estate, 4 specialists for salons. See how it actually handles a call before you book a demo.
Self-host or LiveKit Cloud? Cloud for fast deployment under 1k concurrent. Self-host on Kubernetes when you need custom GPU pools or strict region-locking.
Latency? Sub-200 ms voice-to-voice with the LiveKit on Telnyx beta; 600-900 ms typical with external trunks.
Start a 14-day trial of our Twilio-native AI voice, see pricing for $149/$499/$1499, or book a demo to compare LiveKit Agents to our managed stack.
Written by
Sagar Shankaran· Founder, CallSphere
Sagar Shankaran is the founder of CallSphere, where he builds production AI voice and chat agents deployed across healthcare, hospitality, real estate, and home services. He writes about agentic AI, LLM engineering, and shipping voice agents that handle real calls in production.
See how AI voice agents work for your industry. Live demo available -- no signup required.
GPT-Realtime-2 brings GPT-5-class reasoning into voice. What that means for tool-call reliability, structured output, and production agent design.
How to design a multi-agent system using MCP for tools and A2A for cross-vendor coordination, with a CallSphere voice agent as a participating node.
MCP is agent-to-tool. A2A is agent-to-agent. Here is a clear 2026 decision guide for builders choosing between (and combining) the two protocols.
Google's May 2026 MCP 1.0 + A2A developers guide is the cleanest protocol picker we have seen. The takeaways, in plain English, with a CallSphere lens.
A2A unlocks cross-vendor agent coordination, but most enterprise voice/chat workloads still ship faster on a single-vendor stack. Here is how to choose.
The Official MCP Registry hit API freeze v0.1. Smithery has 7,000+ servers, mcp.so has 19,700+, PulseMCP is hand-curated. We compare discovery, install, and security across the major catalogs.
© 2026 CallSphere LLC. All rights reserved.