Mozilla shipped AV1 by default, H.264 simulcast with dependency descriptors, and OS-integrated screen capture in Firefox during 2025. Here is what is locked in for 2026 and how it affects voice AI agents.

The change

Mozilla published the "Firefox WebRTC 2025" wrap-up in January 2026, and four shipped items now form the Firefox 2026 baseline. (1) AV1 is on by default in every Firefox channel — no flag, no fallback path needed. (2) H.264 gained simulcast plus the dependency descriptor RTP header extension, which means Firefox can finally play in SFU-based selective forwarding workflows on the codec the iOS WebKit world still requires. (3) Camera resolution and frame-rate adaptation got rebuilt across all platforms, so the same getUserMedia constraints produce smoother streams with consistent aspect ratio. (4) macOS screen capture moved to the OS-integrated picker. Mozilla's stated 2026 priorities are continued web-compat work, broader codec interop, and chasing down the last simulcast/SVC parity gaps with Chromium.

What it unlocks

For voice AI specifically, AV1-by-default in Firefox means agent-side video clips (e.g. screen-share-while-on-call) can target one codec across Chrome and Firefox without negotiation thrash. H.264 simulcast finally makes Firefox a viable second-screen client for any SFU-based supervisor or whisper-coach experience that previously required Chrome. The camera adaptation rebuild kills a recurring support ticket pattern — Firefox users on 4K webcams no longer report distorted aspect ratios when joining a call sized for 720p tiles. Combined, these reduce the "Chrome-only" caveats your sales team has to explain.

flowchart TD
  A[Firefox 2025 ship list] --> B[AV1 default ON]
  A --> C[H.264 simulcast + dep descriptor]
  A --> D[Camera adaptation rebuild]
  A --> E[macOS OS screen capture]
  B --> F[Cross-browser AV1 negotiation]
  C --> G[SFU compatibility for iOS-bound flows]
  D --> H[Aspect-ratio consistency]
  E --> I[Native picker UX]

CallSphere context

CallSphere runs 37 agents · 90+ tools · 115+ tables · 6 verticals · HIPAA + SOC 2 aligned. Our supervisor whisper feature uses an SFU; we lit up Firefox H.264 simulcast for the Behavioral Health vertical the day Firefox 138 shipped, and Firefox sessions stopped falling back to single-layer H.264. The Real Estate OneRoof Pion Go gateway 1.23 negotiates AV1 first now that both major desktop browsers support it natively. Plans $149 / $499 / $1,499, 14-day trial, 22% affiliate Year 1.

Migration steps

Drop the "Chrome only" footer note on any feature gated by simulcast — Firefox is in
Reorder your codec preferences: ['AV1', 'VP9', 'H264', 'VP8'] cross-browser
Test getUserMedia against a 4K webcam in Firefox to confirm aspect-ratio adaptation
Add Firefox to your supervisor-whisper QA matrix
Subscribe to blog.mozilla.org/webrtc for monthly Firefox WebRTC updates

FAQ

Does Firefox iOS get these features? No — Firefox iOS uses WebKit per Apple App Store rules. Use Safari capability matrix on iOS.

Hear it before you finish reading

Talk to a live CallSphere AI voice agent in your browser — 60 seconds, no signup.

Try Live →

Try Live Demo →

AV1 hardware decode required? No, but software decode burns CPU. Prefer hardware on Apple Silicon and recent Intel/AMD chips.

Does dependency descriptor break older SFUs? The RTP header extension is opt-in — older SFUs ignore it without errors.

Should I default to AV1 in 2026? For one-way streams (agent-to-listener), yes. For two-way calls, prefer H.264 still on iOS users.

Sources

Mozilla - Firefox WebRTC 2025 wrap-up - https://blog.mozilla.org/webrtc/firefox-webrtc-2025/
Mozilla - Advancing WebRTC blog - https://blog.mozilla.org/webrtc/
WebRTC.org - Firefox Web APIs - https://webrtc.github.io/webrtc-org/web-apis/firefox/
AntMedia - WebRTC Browser Support 2026 - https://antmedia.io/webrtc-browser-support/
BlogGeek.me - WebRTC predictions for 2026 - https://bloggeek.me/webrtc-predictions-2026/

Firefox WebRTC Roadmap 2026: AV1 by Default, H.264 Simulcast, Camera Adaptation: production view

Firefox WebRTC Roadmap 2026: AV1 by Default, H.264 Simulcast, Camera Adaptation sounds like a single decision, but in production it splits into eval design, prompt cost, and observability. The deeper you push toward live traffic, the more those three pull against each other — better evals catch silent failures, prompt cost limits how often you can re-run them, and weak observability hides which retries are actually saving conversations versus burning latency budget.

Shipping the agent to production

Production AI agents live or die on three loops: evals, retries, and handoff state. CallSphere runs 37 agents across 6 verticals, each with its own eval suite — synthetic call transcripts replayed nightly with assertion checks on extracted entities (date, time, party size, insurance, address). Without that loop, prompt regressions ship silently and you only find out when bookings drop.

Still reading? Stop comparing — try CallSphere live.

CallSphere ships complete AI voice agents per industry — 14 tools for healthcare, 10 agents for real estate, 4 specialists for salons. See how it actually handles a call before you book a demo.

Try Live Demo → Book 30-min Walkthrough See Pricing

Structured tools beat free-form text every time. Our 90+ function tools all enforce JSON schemas validated server-side; if the model hallucinates an integer where a string is required, we retry with a corrective system message before falling back to a deterministic path. For long-running flows, we treat agent handoffs as a state machine — booking → confirmation → SMS — so context survives turn boundaries.

The Realtime API vs. async decision usually comes down to "is the user holding the phone right now?" If yes, Realtime; if no (callback queue, after-hours voicemail), async wins on cost-per-conversation, which we track per agent in 115+ database tables spanning all 6 verticals.

FAQ

What's the right way to scope the proof-of-concept? CallSphere runs 37 production agents and 90+ function tools across 115+ database tables in 6 verticals, so most workflows you'd want already have a template. For a topic like "Firefox WebRTC Roadmap 2026: AV1 by Default, H.264 Simulcast, Camera Adaptation", that means you're not starting from scratch — you're configuring an agent template that's already been hardened across thousands of conversations.

How do you handle compliance and data isolation? Day one is integration mapping (scheduler, CRM, messaging) and prompt tuning against your top 20 real call transcripts. Day two through five is shadow-mode running, where the agent transcribes and recommends but a human still answers, so you can compare side-by-side. Go-live is the moment your eval pass-rate clears your internal bar.

When does it make sense to switch from a managed model to a self-hosted one? The honest answer: it scales until your tool catalog gets stale. The agent is only as good as the integrations it can actually call, so the operational discipline is keeping schemas, webhooks, and fallback paths green. The platform handles the rest — observability, retries, multi-region routing — without your team owning the GPU layer.

Talk to us

Want to see how this maps to your stack? Book a live walkthrough at calendly.com/sagar-callsphere/new-meeting, or try the vertical-specific demo at healthcare.callsphere.tech. 14-day trial, no credit card, pilot live in 3–5 business days.

Firefox WebRTC Roadmap 2026: AV1 by Default, H.264 Simulcast, Camera Adaptation

The change

What it unlocks

CallSphere context

Migration steps

FAQ

Sources

Firefox WebRTC Roadmap 2026: AV1 by Default, H.264 Simulcast, Camera Adaptation: production view

Shipping the agent to production

FAQ

Talk to us

Try CallSphere AI Voice Agents

Related Articles You May Like

WebRTC Mobile Testing with BrowserStack + Sauce Labs (2026)

WebRTC Over QUIC and the Future of Realtime: Where Voice AI Goes After 2026

Anthropic Mythos: The Cybersecurity Model Behind Firefox's Vulnerability Sweep

OpenAI's May 2026 WebRTC Rearchitecture: How Voice Latency Got Real

Building a Custom Calling Platform: Enterprise Guide

WebRTC + AI Fact-Checker for Live News Studio Broadcasts in 2026

Product

Resources

Company

Legal

Industries

Integrations

Solutions

Compare

Pillar Guides