Skip to content
Home>Our services>Amazon Transcribe

Amazon Transcribe

Amazon Transcribe is an automatic speech recognition service that can provide speech to text capabilities to applications.

Amazon Transcribe

Amazon Transcribe is powered by a multi-billion parameter speech foundation model, fully managed and continuously trained on millions of hours of audio data across a variety of languages. It delivers high accuracy transcriptions from either recorded speech or real-time audio.

Generate accurate transcriptions in real time

Transcribe produces accurate transcriptions of speech even from noisy environments, and can be used on pre-recorded data or in real-time to add subtitles to content during streaming. With automatic language detection and sophisticated punctuation abilities, it can deliver immediate production-quality transcriptions and summaries of speeches, conversations, and meetings.

Unlock insights trapped in your audio and video content

Combining the transcribed speech with generative AI allows the identification of key features and action points. Use this to summarise and interpret data, provide agent assistance during calls, and allow searching your media content. Transcribe supports custom models with understanding of domain-specific vocabulary and terms, ensuring high accuracy and relevance.

Incorporate voice commands into any application


With fast and reliable transcription, user instructions can be registered and used to trigger automatic tasks, either from stand-alone commands or when extracted from longer data such as calls.

Ensure privacy, safety, and inclusive environments


Transcribe can use domain-specific knowledge to pinpoint sensitive data and redact appropriately. Its inbuilt Toxicity Detection feature uses content awareness and other audio cues, such as tone and pitch, to identify potentially hostile or toxic content and ensure a safe environment for your users.

Our work with Transcribe

Softwire partnered with SentiraXR, developers of cutting-edge virtual reality simulations for training in the medical field. We developed a VR environment package which uses Transcribe to interpret the user’s speech in real-time, allowing it to drive the simulation.

This application required instant capture and interpretation of user vocalisations, together with additional AI processing to build an utterance-to-intent model from samples of real utterances.

Talk 1-1 with a consultant

Book a call with one of our consultants to discuss your challenges.