---
title: "Call Transfer Patterns for AI Agents: Warm Transfer, Cold Transfer, and Conferencing"
description: "Master the three call transfer patterns for AI voice agents: cold transfer, warm transfer, and conferencing. Covers context passing, hold music, agent whisper, and seamless handoff implementation."
canonical: https://callsphere.ai/blog/call-transfer-patterns-ai-agents-warm-cold-conferencing
category: "Learn Agentic AI"
tags: ["Call Transfer", "Warm Transfer", "Voice AI", "Telephony", "Agent Handoff", "Contact Center"]
author: "CallSphere Team"
published: 2026-03-17T00:00:00.000Z
updated: 2026-05-06T01:02:45.030Z
---

# Call Transfer Patterns for AI Agents: Warm Transfer, Cold Transfer, and Conferencing

> Master the three call transfer patterns for AI voice agents: cold transfer, warm transfer, and conferencing. Covers context passing, hold music, agent whisper, and seamless handoff implementation.

## The Three Transfer Patterns

When an AI agent cannot fully resolve a caller's issue, it must transfer the call to a human. How that transfer happens dramatically affects customer experience. There are three patterns, each with distinct tradeoffs:

**Cold Transfer** — The AI connects the caller directly to the destination. The caller may hear ringing and must re-explain their issue. Fast but frustrating.

**Warm Transfer** — The AI first speaks to the human agent, passes context, then bridges the caller in. The caller does not repeat themselves. Slower but much better experience.

**Conference Transfer** — The AI, caller, and human agent are briefly all on the same call. The AI introduces the situation, then drops off. Best for complex handoffs.

## Cold Transfer Implementation

Cold transfer is the simplest pattern. The AI terminates its leg of the call and connects the caller directly to the destination:

```mermaid
flowchart LR
    CALLER(["Caller"])
    subgraph TEL["Telephony"]
        SIP["Twilio SIP and PSTN"]
    end
    subgraph BRAIN["Business AI Agent"]
        STT["Streaming STT
Deepgram or Whisper"]
        NLU{"Intent and
Entity Extraction"}
        TOOLS["Tool Calls"]
        TTS["Streaming TTS
ElevenLabs or Rime"]
    end
    subgraph DATA["Live Data Plane"]
        CRM[("CRM and Notes")]
        CAL[("Calendar and
Schedule")]
        KB[("Knowledge Base
and Policies")]
    end
    subgraph OUT["Outcomes"]
        O1(["Booking captured"])
        O2(["CRM record created"])
        O3(["Human handoff"])
    end
    CALLER --> SIP --> STT --> NLU
    NLU -->|Lookup| TOOLS
    TOOLS  CRM
    TOOLS  CAL
    TOOLS  KB
    NLU --> TTS --> SIP --> CALLER
    NLU -->|Resolved| O1
    NLU -->|Schedule| O2
    NLU -->|Escalate| O3
    style CALLER fill:#f1f5f9,stroke:#64748b,color:#0f172a
    style NLU fill:#4f46e5,stroke:#4338ca,color:#fff
    style O1 fill:#059669,stroke:#047857,color:#fff
    style O2 fill:#0ea5e9,stroke:#0369a1,color:#fff
    style O3 fill:#f59e0b,stroke:#d97706,color:#1f2937
```

```python
from twilio.twiml.voice_response import VoiceResponse, Dial
from fastapi import FastAPI, Request
from fastapi.responses import Response

app = FastAPI()

@app.post("/cold-transfer")
async def cold_transfer(request: Request):
    """Transfer the caller directly to a human agent."""
    form = await request.form()
    call_sid = form.get("CallSid")

    # Log the transfer context before disconnecting
    await save_transfer_context(call_sid, {
        "reason": "billing_dispute",
        "caller_sentiment": "frustrated",
        "summary": "Caller disputing charge of $49.99 from March 3",
    })

    response = VoiceResponse()
    response.say(
        "I am connecting you to a billing specialist now. "
        "Please hold."
    )

    dial = Dial(
        caller_id=form.get("From"),  # Preserve caller ID
        timeout=30,
        action="/transfer-complete",  # Called when dial ends
    )
    dial.number(
        "+15559876543",
        status_callback="/agent-answered",
        status_callback_event="initiated ringing answered completed",
    )
    response.append(dial)

    # Fallback if agent does not answer
    response.say(
        "I am sorry, no agent is available right now. "
        "Let me take a message."
    )
    response.redirect("/take-message")

    return Response(content=str(response), media_type="application/xml")
```

## Warm Transfer Implementation

Warm transfer requires managing three call legs: the original call (on hold), a whisper call to the agent, and the final bridged call:

```python
from twilio.rest import Client
import os

twilio_client = Client()

class WarmTransferManager:
    """Manages warm transfers with context passing."""

    def __init__(self, twilio_client, webhook_base):
        self.client = twilio_client
        self.webhook_base = webhook_base

    async def initiate_warm_transfer(
        self, call_sid: str, agent_number: str, context: dict
    ):
        """Start the warm transfer process."""
        # Step 1: Put the caller on hold with music
        self.client.calls(call_sid).update(
            twiml='

'
                  'https://api.twilio.com/cowbell.mp3'
                  '',
        )

        # Step 2: Store context for the whisper
        await self.store_context(call_sid, context)

        # Step 3: Call the human agent with a whisper
        whisper_call = self.client.calls.create(
            to=agent_number,
            from_=os.environ["TWILIO_NUMBER"],
            url=(
                f"{self.webhook_base}/agent-whisper"
                f"?original_call={call_sid}"
            ),
            status_callback=f"{self.webhook_base}/whisper-status",
        )

        return whisper_call.sid

    async def store_context(self, call_sid: str, context: dict):
        """Store transfer context for the receiving agent."""
        import json
        # Use Redis for fast retrieval during the whisper
        await self.redis.set(
            f"transfer:{call_sid}",
            json.dumps(context),
            ex=300,  # 5 minute TTL
        )

@app.post("/agent-whisper")
async def agent_whisper(request: Request):
    """Play context to the human agent before bridging."""
    form = await request.form()
    original_call = form.get("original_call")

    # Retrieve the transfer context
    context = await get_transfer_context(original_call)

    response = VoiceResponse()

    # Whisper: only the agent hears this
    whisper_text = (
        f"Incoming transfer. Caller: {context['caller_name']}. "
        f"Issue: {context['summary']}. "
        f"Sentiment: {context['sentiment']}. "
        f"Press 1 to accept, 2 to decline."
    )
    response.say(whisper_text, voice="Polly.Joanna")

    gather = response.gather(
        num_digits=1,
        action=f"/agent-accept?original_call={original_call}",
        timeout=10,
    )

    # Timeout fallback — decline
    response.say("No response received. Transfer cancelled.")
    response.hangup()

    return Response(content=str(response), media_type="application/xml")

@app.post("/agent-accept")
async def agent_accept_transfer(request: Request):
    """Bridge the caller and agent after acceptance."""
    form = await request.form()
    digit = form.get("Digits")
    original_call = form.get("original_call")

    response = VoiceResponse()

    if digit == "1":
        # Agent accepted — bridge the calls via conference
        conference_name = f"transfer-{original_call}"

        # Connect the agent to the conference
        dial = Dial()
        dial.conference(conference_name, end_conference_on_exit=True)
        response.append(dial)

        # Move the original caller into the same conference
        twilio_client.calls(original_call).update(
            twiml=(
                f''
                f'{conference_name}'
                f''
            ),
        )
    else:
        response.say("Transfer declined.")
        response.hangup()
        # Return caller to AI agent
        twilio_client.calls(original_call).update(
            url=f"{os.environ['WEBHOOK_BASE']}/return-to-ai",
            method="POST",
        )

    return Response(content=str(response), media_type="application/xml")
```

## Conference Transfer (Three-Way Introduction)

The conference pattern keeps all three parties briefly on the same call:

```python
class ConferenceTransferManager:
    """Three-way conference transfer with AI introduction."""

    async def initiate_conference_transfer(
        self, call_sid: str, agent_number: str, context: dict
    ):
        """Set up a three-way call for handoff."""
        conference_name = f"handoff-{call_sid}"

        # Move caller into a conference (from hold)
        twilio_client.calls(call_sid).update(
            twiml=(
                f''
                f'I am bringing in a specialist now.'
                f'{conference_name}'
                f''
            ),
        )

        # Add the AI agent to the conference (for introduction)
        ai_participant = twilio_client.conferences(
            conference_name
        ).participants.create(
            from_=os.environ["TWILIO_NUMBER"],
            to="sip:ai-intro@yourdomain.com",
            early_media=True,
        )

        # Add the human agent
        human_participant = twilio_client.conferences(
            conference_name
        ).participants.create(
            from_=os.environ["TWILIO_NUMBER"],
            to=agent_number,
            early_media=True,
        )

        return conference_name

    async def ai_introduction(self, conference_name, context):
        """AI speaks the introduction then leaves."""
        intro_text = (
            f"Hello everyone. I have {context['caller_name']} on "
            f"the line who needs help with {context['summary']}. "
            f"I will leave you to it."
        )
        # Speak the introduction via TTS
        await self.speak_in_conference(conference_name, intro_text)

        # Remove the AI from the conference
        await asyncio.sleep(2)  # Brief pause after speaking
        await self.remove_ai_from_conference(conference_name)
```

## Context Passing Best Practices

The value of a warm transfer is the context. Structure it well:

```python
from dataclasses import dataclass
from typing import Optional

@dataclass
class TransferContext:
    """Structured context passed during call transfer."""
    caller_name: str
    caller_number: str
    call_duration_seconds: int
    issue_summary: str
    sentiment: str  # positive, neutral, frustrated, angry
    intent: str
    actions_taken: list[str]
    information_collected: dict
    previous_transfers: int
    preferred_language: str = "en"
    priority: str = "normal"
    notes: Optional[str] = None

    def to_whisper_script(self) -> str:
        """Generate a concise whisper message for the agent."""
        actions = ", ".join(self.actions_taken) if self.actions_taken else "none yet"
        return (
            f"Caller: {self.caller_name}. "
            f"Issue: {self.issue_summary}. "
            f"Mood: {self.sentiment}. "
            f"Already tried: {actions}. "
            f"Priority: {self.priority}."
        )

    def to_screen_pop(self) -> dict:
        """Generate data for the agent's screen pop display."""
        return {
            "caller": self.caller_name,
            "phone": self.caller_number,
            "summary": self.issue_summary,
            "sentiment_emoji": {
                "positive": "green",
                "neutral": "yellow",
                "frustrated": "orange",
                "angry": "red",
            }.get(self.sentiment, "yellow"),
            "history": self.actions_taken,
            "collected_data": self.information_collected,
            "transfer_count": self.previous_transfers,
        }
```

## FAQ

### When should I use warm transfer versus cold transfer?

Use cold transfer for simple routing where context is not critical — e.g., transferring to a general queue. Use warm transfer when the caller has already explained their issue to the AI and repeating it would cause frustration — especially for complaints, complex issues, or VIP callers. The extra 10-15 seconds for a warm transfer pays for itself in customer satisfaction.

### How do I handle the case where the human agent does not answer?

Implement a timeout with fallback logic. After 20-30 seconds of ringing, cancel the transfer and either return the caller to the AI agent, offer to take a message, or try an alternative agent. Always inform the caller what is happening: "Our specialist is not available right now. Would you like me to take a message, or would you prefer to try again later?"

### How do I pass context to the agent's screen in addition to the whisper?

Use a parallel HTTP notification. When you initiate the warm transfer, simultaneously POST the `TransferContext` data to your contact center's API or the agent's desktop application. Most modern contact center platforms (Five9, Genesys, Talkdesk) have APIs for screen pops. The whisper provides audio context, and the screen pop provides visual context — both arrive before the caller is bridged in.

---

#CallTransfer #WarmTransfer #VoiceAI #Telephony #AgentHandoff #ContactCenter #AgenticAI #LearnAI #AIEngineering

---

Source: https://callsphere.ai/blog/call-transfer-patterns-ai-agents-warm-cold-conferencing
