By Sagar Shankaran, Founder of CallSphere
The CallSphere /demo page hands a user a working voice AI in one click. Here is the WebRTC plumbing that gets a Realtime session live before the page settles.
Key takeaways
A voice AI demo that needs a phone number is dead on arrival. In 2026 the bar is one click, in the browser, with sub-500 ms first-audio. WebRTC + OpenAI Realtime makes it boring.
flowchart LR
Browser["Browser · WebRTC"] --> ICE["ICE / STUN / TURN"]
ICE --> SFU["SFU · Pion Go gateway 1.23"]
SFU --> NATS["NATS bus"]
NATS --> AI["AI Worker · OpenAI Realtime"]
AI --> NATS
NATS --> SFU
SFU --> BrowserEmbedding a working voice agent on a marketing page used to be impossible because every transport route went through telephony. With WebRTC + ephemeral Realtime tokens, the browser becomes the SIP client. Your landing page hands the user a microphone, a "Talk" button, and a 380 ms first-audio experience.
In 2026 this is now table stakes for any AI voice company. CallSphere's /demo page does it. So do most of our competitors. The differentiator is no longer "does it work?" — it is "how fast is the first turn?"
A 1-click demo has three moving parts:
Hear it before you finish reading
Talk to a live CallSphere AI voice agent in your browser — 60 seconds, no signup.
That is it. No SFU, no signaling server, no SIP gateway. Mic-on to first-audio under half a second.
The /demo page on callsphere.ai uses exactly the snippet below, with two differences:
The demo intentionally skips telephony, the 6-container pod, and the Go gateway — those exist on the production path for paying customers. The /demo path proves the user-facing latency story before they sign up. That is the marketing job: a 30-second click-to-talk experience converts at 8–9% on our page.
```ts async function oneClickVoice() { const { client_secret, model } = await fetch("/api/realtime/token").then(r => r.json());
const pc = new RTCPeerConnection(); const audio = document.getElementById("agent-audio") as HTMLAudioElement; pc.ontrack = (e) => { audio.srcObject = e.streams[0]; };
Still reading? Stop comparing — try CallSphere live.
CallSphere ships complete AI voice agents per industry — 14 tools for healthcare, 10 agents for real estate, 4 specialists for salons. See how it actually handles a call before you book a demo.
const stream = await navigator.mediaDevices.getUserMedia({ audio: true }); pc.addTrack(stream.getAudioTracks()[0], stream);
const dc = pc.createDataChannel("oai"); dc.onopen = () => dc.send(JSON.stringify({ type: "session.update", session: { instructions: "You are CallSphere's demo agent." } }));
await pc.setLocalDescription(await pc.createOffer()); const ans = await fetch(`https://api.openai.com/v1/realtime?model=\${model}\`, { method: "POST", headers: { Authorization: `Bearer ${client_secret}`, "Content-Type": "application/sdp" }, body: pc.localDescription!.sdp, }); await pc.setRemoteDescription({ type: "answer", sdp: await ans.text() }); } ```
Do I need TURN for this? For a marketing demo, optional. For a real product, yes. Will Safari work? iOS 11+ desktop and mobile Safari work; some autoplay quirks — call `audio.play()` on user gesture. How long does an ephemeral key last? A few minutes; refresh on the server side every minute if needed. Can I prerecord/cache the welcome line? Yes — emit a `response.create` immediately on `dc.onopen`. How do I prevent abuse? Rate-limit by IP, fingerprint, and require a Turnstile or hCaptcha pass first.
Try it now on /demo. When you are ready to ship, /trial is 14 days free.
Written by
Sagar Shankaran· Founder, CallSphere
Sagar Shankaran is the founder of CallSphere, where he builds production AI voice and chat agents deployed across healthcare, hospitality, real estate, and home services. He writes about agentic AI, LLM engineering, and shipping voice agents that handle real calls in production.
See how AI voice agents work for your industry. Live demo available -- no signup required.
A founder's guide to texto a voz (text-to-speech in Spanish): LATAM vs Castilian voices, free options, and how CallSphere ships Spanish agents.
A founder's guide to the female voice generator landscape: AI female voices, Japanese voices, robot voices, and how CallSphere ships 57+ voices live.
A founder's guide to the Siri voice generator landscape: how AI voice cloning works, what is legal, and how CallSphere uses 57+ voices in production.
A founder's guide to AI voice assistants for ecommerce: customer service, order lookup, and how CallSphere fits in versus virtual receptionists.
Robot text to speech in 2026: how I pick TTS APIs, when robotic voices help, and how CallSphere ships 57+ language voice agents. Hands-on guide.
The customer support specialist role in 2026 is half human, half AI. Here is what the job looks like, the AI tools that pair with it, and how we ship it.
© 2026 CallSphere LLC. All rights reserved.