Speechbase: Universal Text-to-Speech Gateway & Voice Management

Streams audio from the provider for low latency (provider pass-through). Pass either `voiceId` (to use a saved Voice) or `model` + `voice` (inline), not both. Whole-clip params (`volumeDbfs`, `output` format conversion) are not accepted here. Use POST /v1/audio/speech for those.

Streams audio from the provider for low latency (provider pass-through). Pass either voiceId (to use a saved Voice) or model + voice (inline), not both. Whole-clip params (volumeDbfs, output format conversion) are not accepted here. Use POST /v1/audio/speech for those.

Authorization

bearerAuth

AuthorizationBearer <token>

API key

In: header

Request Body

application/json

TypeScript Definitions

Use the request body type in TypeScript.

voiceId*string

Speechbase Voice UUID. The gateway resolves this to the underlying provider/model/voice at request time. Pass this OR (model + voice), never both.

Match^[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}$/i

text*string

Length1 <= length

providerOptions?

pronunciations?

moderation_ruleset_id?string

Match^[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}$/i

volumeDbfs?unknown

output?unknown

speed?unknown

enhance?unknown

model*string

Provider/model in "/" form. Required for inline calls. Pass this with voice, OR pass voiceId alone, never both.

Match^[a-z0-9-]+\/[a-zA-Z0-9._-]+$

voice*string

Provider-native voice identifier (e.g. an ElevenLabs voice ID). Pass with model.

Length1 <= length

text*string

Length1 <= length

providerOptions?

pronunciations?

moderation_ruleset_id?string

Match^[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}$/i

volumeDbfs?unknown

output?unknown

speed?unknown

enhance?unknown

Response Body

`application/problem+json`

`application/json`

curl -X POST "https://example.com/v1/audio/speech/stream" \  -H "Authorization: Bearer $SPEECHBASE_API_KEY" \  -H "Content-Type: application/json" \  -d '{  "voiceId": "550e8400-e29b-41d4-a716-446655440000",  "text": "Hello from a saved voice."}'

"string"

{
  "type": "string",
  "title": "string",
  "status": 0,
  "detail": "string",
  "code": "string",
  "validation": [
    {
      "path": [
        "string"
      ],
      "message": "string"
    }
  ],
  "provider": "string",
  "upstream_code": "string",
  "upstream_status": 0,
  "turn_index": 0
}

{
  "type": "string",
  "title": "string",
  "status": 0,
  "detail": "string",
  "code": "string",
  "validation": [
    {
      "path": [
        "string"
      ],
      "message": "string"
    }
  ],
  "provider": "string",
  "upstream_code": "string",
  "upstream_status": 0,
  "turn_index": 0
}

{
  "error": {
    "code": "content_moderation_blocked",
    "message": "string",
    "reason": {
      "type": "error_fail_closed"
    }
  }
}

Stream speech

Authorization

Request Body

Response Body

200audio/mpeg

400application/problem+json

401application/problem+json

422application/json

`application/problem+json`

`application/problem+json`

`application/json`