StableVoice API
Reserve an audio output slot on StableUpload, then call /api/speech. Poll the returned job ID with SIWX until the audio URL is ready.
Models
Chatterbox Turbo
Fast 350M English model for voice agents, narration, and native paralinguistic tags.
Options: temperature, topP, topK, repetitionPenalty, normalizeReferenceLoudness
Chatterbox
Expressive English model with CFG, exaggeration, min-p, top-p, and temperature controls.
Options: temperature, topP, minP, repetitionPenalty, exaggeration, cfgWeight
Chatterbox Multilingual
Multilingual zero-shot TTS across 23 languages with voice cloning and creative controls.
Options: language, temperature, topP, minP, repetitionPenalty, exaggeration, cfgWeight
Catalog
Voices: Aaron, Abigail, Anaya, Andy, Archer, Brian, Chloe, Dylan, Emmanuel, Ethan, Evelyn, Gavin, Gordon, Ivan, Laura, Lucy, Madison, Marisol, Meera, Walter.
Aaron
Grounded and practical, with a balanced delivery that stays out of the way.
Abigail
Bright and approachable, useful when the product should sound friendly without getting silly.
Anaya
Crisp and energetic, good for short reads that need momentum and clarity.
Andy
Casual and conversational, with a dry edge that works for informal narration.
Archer
Confident and composed, suited to higher-drama launches and cinematic reads.
Brian
Steady and technical, a low-friction choice for operational or engineering content.
Chloe
Light and playful, best when small interface moments should feel more alive.
Dylan
Relaxed and understated, good for natural narration that should not feel overproduced.
Emmanuel
Polished and articulate, a dependable voice for structured explanation.
Ethan
Upbeat and clear, useful for task-oriented reads with forward motion.
Evelyn
Smooth and expressive, good for warmer flows where reassurance matters.
Gavin
Bold and animated, suited to high-energy content that needs presence.
Gordon
Measured and authoritative, good when the read should feel stable and serious.
Ivan
Precise and lightly comic, useful for technical reads with a deadpan tone.
Laura
Clear and friendly, a practical choice for help content and product education.
Lucy
Balanced and lively, the safest default for general assistant or product narration.
Madison
Polished and upbeat, useful for media-ready content and confident product copy.
Marisol
Warm and personable, good for conversational products and hospitality-style flows.
Meera
Thoughtful and calm, suited to longer explanation and reflective narration.
Walter
Classic and characterful, useful for deliberate reads with a little ceremony.
Output formats: wav, mp3.
Endpoints
GET /api/voicesSIWX model and voice catalog.GET /api/voice-samplesSIWX voice sample catalog.POST /api/speechpaid TTS job.GET /api/jobs/:jobIdSIWX status.GET /api/jobsSIWX job list.