audio_sample_rate?|
confirmation_mode?|
Confirmation behavior: strict (explicit yes only), flexible (implicit OK), auto-proceed (timeout-based)
confirmation_timeout_seconds?|
Seconds to wait before re-prompting for confirmation (3-15)
default_language?|
Default language when detection confidence is too low (e.g., 'fr-FR')
detection_confidence_threshold?|
Minimum confidence percentage (50-100) to accept detected language
enable_explicit_language_lock?|
Honor user requests to stay in a specific language (e.g., 'rester en français')
language?|
Language/locale code (e.g., 'fr-FR', 'nl-NL', 'de-DE')
language_detection_enabled?|
Enable automatic language detection
language_voice_defaults?|
Mapping of language codes to default voice IDs (e.g., {'fr-FR': 'Lea', 'nl-NL': 'Lotte'})
openai_stt_prefix_padding_ms?|
Audio padding before speech in ms (only used when vad_type=server_vad)
openai_stt_silence_ms?|
Silence duration in ms before end-of-speech (only used when vad_type=server_vad)
openai_stt_vad_eagerness?|
Semantic VAD eagerness level (only used when vad_type=semantic_vad)
openai_stt_vad_threshold?|
VAD threshold (0.0-1.0, only used when vad_type=server_vad)
openai_stt_vad_type?|
OpenAI STT VAD type: semantic_vad (AI-based) or server_vad (silence-based)
redetection_threshold?|
Consecutive language disagreements needed to trigger re-detection (3-10, default 5)
stt_detect_language?|
Enable Whisper language detection (only used when stt_language=auto)
stt_language?|
Whisper language (auto|en|fr|nl|de)
stt_model?|
Provider-specific STT model (OpenAI: whisper-1; Groq: whisper-large-v3, distil-whisper-large-v3-en)
stt_provider?|
Speech-to-text provider: aws|openai|groq
supported_languages?array<>|
List of supported language codes for detection (e.g., ['en-US', 'fr-FR', 'nl-NL'])
vad_sensitivity?|
vad_silence_threshold_ms?|
voice_switch_cooldown_ms?|
Minimum milliseconds between voice switches (5000-60000, default 15000)