Speech to Text
Accurate, scalable transcription for any audio
Enterprise‑grade STT with diarization, timestamps, and privacy controls. Batch today, streaming on the roadmap.
Why SonarText STT
Accuracy
Accurate transcription using SOTA transcription models with punctuation and casing, even in noisy calls
Speed
Optimized for fast batch processing to deliver transcription results quickly
Diarization
Automatically identify speakers and separate transcription to simply follow conversations
Timestamps
Enable timestamps by word or segment for accurate subtitles or for searchable references
Flexibility
Use the input and output formats that work best for your application
Pricing
Industry leading pricing to support your applications
Build in Minutes
Start with curl, Node, or Python.
curl -X POST https://api.sonartext.com/v1/transcribe \
+ -H "Authorization: Bearer <API_KEY>" \
+ -F file=@audio.mp3
See the API Reference for full parameters.