Update Voice Settings

Update voice settings.

Authorization

AuthorizationBearer <token>

In: header

Request Body

application/json

TypeScript Definitions

Use the request body type in TypeScript.

audio_sample_rate?|null

cartesia_language_voices?|

Cartesia localized mapping {iso_language -> localized voice UUID}

cartesia_localization_status?|

Cartesia localization status per language (pending|localizing|ready|failed)

cartesia_source_voice_id?|

Cartesia source voice UUID used for localized fallback

confirmation_mode?|

Confirmation behavior: strict (explicit yes only), flexible (implicit OK), auto-proceed (timeout-based)

confirmation_timeout_seconds?|

Seconds to wait before re-prompting for confirmation (3-15)

default_language?|

Default language when detection confidence is too low (e.g., 'fr-FR')

detection_confidence_threshold?|

Minimum confidence percentage (50-100) to accept detected language

enable_explicit_language_lock?|

Honor user requests to stay in a specific language (e.g., 'rester en français')

language?|

Language/locale code (e.g., 'fr-FR', 'nl-NL', 'de-DE')

language_detection_enabled?|

Enable automatic language detection

language_voice_defaults?|

Mapping of language codes to default voice IDs (e.g., {'fr-FR': 'Lea', 'nl-NL': 'Lotte'})

openai_stt_prefix_padding_ms?|

Audio padding before speech in ms (only used when vad_type=server_vad)

openai_stt_silence_ms?|

Silence duration in ms before end-of-speech (only used when vad_type=server_vad)

openai_stt_vad_eagerness?|

Semantic VAD eagerness level (only used when vad_type=semantic_vad)

openai_stt_vad_threshold?|

VAD threshold (0.0-1.0, only used when vad_type=server_vad)

openai_stt_vad_type?|

OpenAI STT VAD type: semantic_vad (AI-based) or server_vad (silence-based)

pitch?|null

polly_voice?|null

redetection_threshold?|

Consecutive language disagreements needed to trigger re-detection (3-10, default 5)

speech_rate?|null

stt_detect_language?|

Enable Whisper language detection (only used when stt_language=auto)

stt_language?|

Whisper language (auto|en|fr|nl|de|pl|sv|da|fi|pt)

stt_model?|

Provider-specific STT model (OpenAI: whisper-1; Groq: whisper-large-v3, distil-whisper-large-v3-en)

stt_provider?|

Speech-to-text provider: aws|openai|groq

supported_languages?array<string>|

List of supported language codes for detection (e.g., ['en-US', 'fr-FR', 'nl-NL'])

tts_provider?|

Text-to-speech provider: aws (Polly) | cartesia (Sonic 3)

tts_voice?|

Provider-specific voice identifier: Polly name or Cartesia curated key/UUID

vad_sensitivity?|null

vad_silence_threshold_ms?|null

voice_switch_cooldown_ms?|

Minimum milliseconds between voice switches (5000-60000, default 15000)

volume?|null

Response Body

`application/json`

curl -X PUT "https://loading/api/v1/configuration/voice-settings" \  -H "Content-Type: application/json" \  -d '{}'

{}

{
  "detail": [
    {
      "loc": [
        "string"
      ],
      "msg": "string",
      "type": "string"
    }
  ]
}

Authorization

Request Body

Response Body

200application/json

422application/json

`application/json`

`application/json`