Speech to Text

Accurate, scalable transcription for any audio

Enterprise‑grade STT with diarization, timestamps, and privacy controls. Batch today, streaming on the roadmap.

Why SonarText STT

Accurate transcription using SOTA transcription models with punctuation and casing, even in noisy calls

Optimized for fast batch processing to deliver transcription results quickly

Automatically identify speakers and separate transcription to simply follow conversations

Enable timestamps by word or segment for accurate subtitles or for searchable references

Use the input and output formats that work best for your application

Industry leading pricing to support your applications

Start with curl, Node, or Python.

curl -X POST https://api.sonartext.com/v1/transcribe \
+  -H "Authorization: Bearer <API_KEY>" \
+  -F file=@audio.mp3

See the API Reference for full parameters.