Speechbase
← BackProduct Updates

Word-level timestamps

  • api

The /with-timestamps endpoints return word-level timing alongside your audio, so you can build synced captions, karaoke-style highlighting, or anything that needs to know exactly when each word is spoken.

It works across providers. Speechbase uses native alignment where the provider offers it and fills the gap when it doesn't.