StableVoice API

Reserve an audio output slot on StableUpload, then call /api/speech. Poll the returned job ID with SIWX until the audio URL is ready.

Models

Chatterbox Turbo

Fast 350M English model for voice agents, narration, and native paralinguistic tags.

Options: temperature, topP, topK, repetitionPenalty, normalizeReferenceLoudness

Chatterbox

Expressive English model with CFG, exaggeration, min-p, top-p, and temperature controls.

Options: temperature, topP, minP, repetitionPenalty, exaggeration, cfgWeight

Chatterbox Multilingual

Multilingual zero-shot TTS across 23 languages with voice cloning and creative controls.

Options: language, temperature, topP, minP, repetitionPenalty, exaggeration, cfgWeight

Catalog

Voices: Aaron, Abigail, Anaya, Andy, Archer, Brian, Chloe, Dylan, Emmanuel, Ethan, Evelyn, Gavin, Gordon, Ivan, Laura, Lucy, Madison, Marisol, Meera, Walter.

Aaron

Grounded and practical, with a balanced delivery that stays out of the way.

Abigail

Bright and approachable, useful when the product should sound friendly without getting silly.

Anaya

Crisp and energetic, good for short reads that need momentum and clarity.

Andy

Casual and conversational, with a dry edge that works for informal narration.

Archer

Confident and composed, suited to higher-drama launches and cinematic reads.

Brian

Steady and technical, a low-friction choice for operational or engineering content.

Chloe

Light and playful, best when small interface moments should feel more alive.

Dylan

Relaxed and understated, good for natural narration that should not feel overproduced.

Emmanuel

Polished and articulate, a dependable voice for structured explanation.

Ethan

Upbeat and clear, useful for task-oriented reads with forward motion.

Evelyn

Smooth and expressive, good for warmer flows where reassurance matters.

Gavin

Bold and animated, suited to high-energy content that needs presence.

Gordon

Measured and authoritative, good when the read should feel stable and serious.

Ivan

Precise and lightly comic, useful for technical reads with a deadpan tone.

Laura

Clear and friendly, a practical choice for help content and product education.

Lucy

Balanced and lively, the safest default for general assistant or product narration.

Madison

Polished and upbeat, useful for media-ready content and confident product copy.

Marisol

Warm and personable, good for conversational products and hospitality-style flows.

Meera

Thoughtful and calm, suited to longer explanation and reflective narration.

Walter

Classic and characterful, useful for deliberate reads with a little ceremony.

Output formats: wav, mp3.

Endpoints