Speechbase
Providers

Hume

Hume Octave text-to-speech through the Speechbase gateway, with expressive voices, voice cloning, and native timestamps on octave-2.

Prefixhume
Default modeloctave-2
Provider keyConnect under Provider Keys

Route to Hume by prefixing the model with hume/. Octave is built for expressive, emotionally aware delivery.

Models

ModelStreamingVoice cloningTimestampsMax input
octave-2YesYesNative5000
octave-1YesSTT fallback5000

Usage

curl -X POST https://api.speechbase.ai/v1/audio/speech \
  -H "Authorization: Bearer $SPEECHBASE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "mode": "inline",
    "model": "hume/octave-2",
    "voice": "KORA",
    "text": "Hello from Speechbase!",
    "output": "mp3"
  }' --output hello.mp3

voice is a Hume voice name such as KORA, AURA, STELLA, or DACHER.

Voice cloning

octave-2 supports voice cloning. Register a reference as a saved Voice and address it by voiceId with mode: "voice".

Provider options

Anything in providerOptions is forwarded to the Hume API unchanged — for example an acting_instructions description to steer the performance.

On this page