Speechbase
← BackProduct Updates

MiniMax Speech 2.8

  • api

You can now route to MiniMax Speech 2.8 through the gateway in two variants: minimax/speech-2.8-hd for the highest audio quality and minimax/speech-2.8-turbo for faster, lower-cost synthesis.

2.8 is built for long-form work. It holds voice timbre and emotional tone steady across long passages, the point where most models start to drift, which makes it a strong fit for audiobooks, narrated articles, and long documentation read-alouds. It speaks 40+ languages, clones a voice from about 10 seconds of reference audio, and renders natural pauses and sound tags like laughs and sighs inline.

Reach for HD when you're producing finished audio and Turbo when latency and cost matter, such as voice agents.