Assistant best practices

Last updated: September 1, 2025

Getting great results often comes down to picking the right engine settings. Use this checklist when configuring an assistant:

1. Pick a Mode

Mode	Why choose it?	Notes
Dualplex (Beta)	Fast turn-taking + premium/cloned voices	Recommended default. Pair with Gemini Flash 2.0/2.5 or GPT‑5 Realtime.
Speech-to-Speech (Multimodal)	Fastest turn-taking & most natural flow	Recommended model: GPT‑5 Realtime.
Pipeline	Maximum control over voice & long-form replies	Recommended model: GPT‑5 Mini. If you select Pipeline, continue to the Transcriber step below.

Want to know more about the differences between the modes? Read the Assistant modes guide.

Experiment with all three modes: record the same scenario in each and compare response time and caller satisfaction.

2. Choose a Transcriber (Pipeline only)

Transcriber	Accuracy	Latency	Best for
Azure	⭐⭐⭐⭐	⏱️⏱️⏱️ (slower)	When you need the highest transcription fidelity.
Gladia	⭐⭐⭐	⏱️ (faster)	Good all-rounder for most languages.
Deepgram	⭐⭐⭐	⏱️ (faster)	Another solid choice—test which performs better for your language & audio setup.

Tip: Different languages, accents, or background noise can impact each engine differently. Run a quick A/B test and keep the best performer.

3. Select an LLM Model

Model	Strengths	Trade-offs
GPT-5 Mini	Balanced reasoning with low latency	May be slower than realtime models for rapid turn-taking.
GPT-5 Realtime	Designed for ultra-low-latency voice turns	Best for Speech-to-Speech and Dualplex.
GPT-4o	Strong reasoning and multimodal understanding	Higher latency.
Gemini Flash 2.0 / 2.5	Ultra-fast for voice turns in Dualplex/Multimodal	Excellent for minimizing perceived latency.

If speed is critical, use GPT‑5 Realtime (great for Speech-to-Speech) or Gemini Flash 2.0/2.5 (great with Dualplex). For richer reasoning, use GPT-4o or GPT-5 Mini and offset latency by using filler audios.

4. Noise Cancellation

If callers are on speaker phone or in a quiet environment, keep noise cancellation ON. If your call volume is low or some words are “clipped,” turn it OFF so the transcriber gets the full waveform.

If your assistant is not hearing you well, you can try to turn off noise cancellation.

5. Conversation Timers

Parameter	Recommended	Why
Re-engagement	`≈ 30 s`	Gives callers enough time to think. Lower values can feel pushy.
Max silence duration	`≈ 60 s`	Prevents premature hang-ups while still ending truly silent calls.

Test different values in real calls—too low can interrupt, too high leaves awkward gaps.

6. Initial Message

Mode	How it’s used	Best practice
Pipeline	Read exactly as written (converted by TTS).	Write the greeting verbatim: “Hello, this is Alex from …”.
Dualplex	Read exactly as written (rendered via ElevenLabs TTS).	Write the greeting verbatim, then select your cloned voice.
Speech-to-Speech	Interpreted as a prompt by the model.	Include instructions like “Greet the customer and say …” or prepend `say exactly:` to ensure literal output.

7. Ambient sound

Enabled by default, ambient sound is a feature that adds background noise to the assistant’s voice.

If the assistant is not hearing you well, you can try to turn off ambient sound or turn the volume lower.

8. Endpointing sliders

Control when your assistant starts talking with the endpointing sensitivity slider at the bottom of assistant settings.

Setting	Effect	Use when
Lower sensitivity	Assistant responds faster after caller stops speaking	You want snappy, quick-turn conversations
Higher sensitivity	Assistant waits longer before responding	Callers give longer, more detailed replies

Pro tip: If your assistant cuts off callers mid-sentence, increase the sensitivity. If responses feel sluggish, decrease it.

9. Debug using call transcript

If you are having issues with your assistant, you can use the call transcript to debug the issue.

Go to the Call history page.
Click on the last call you tested
The call transcript will be shown including function calls and its parameters.

10. Still have questions?

If you have any questions, please contact our support team via the chat widget inside the app.

Test different settings with real calls—the right balance depends on your conversation flow and caller behavior patterns.

Need a complete list of every toggle and slider? See the full Assistant settings reference.

​1. Pick a Mode

​2. Choose a Transcriber (Pipeline only)

​3. Select an LLM Model

​4. Noise Cancellation

​5. Conversation Timers

​6. Initial Message

​7. Ambient sound

​8. Endpointing sliders

​9. Debug using call transcript

​10. Still have questions?

1. Pick a Mode

2. Choose a Transcriber (Pipeline only)

3. Select an LLM Model

4. Noise Cancellation

5. Conversation Timers

6. Initial Message

7. Ambient sound

8. Endpointing sliders

9. Debug using call transcript

10. Still have questions?