Skip to content
OnMSFT.com
  • Home
  • About
  • Contact
  • Windows
  • Surface
  • Xbox
  • How-To
  • OnPodcast
  • Edge
  • Teams
  • Gaming
Menu
  • Home
  • About
  • Contact
  • Windows
  • Surface
  • Xbox
  • How-To
  • OnPodcast
  • Edge
  • Teams
  • Gaming
  1. Home
  2. News
  3. Microsoft brings real-time translation to custom apps with Speech Translation API

Microsoft brings real-time translation to custom apps with Speech Translation API

Arif Bacchus Arif Bacchus
March 31, 2016
2 min read

Microsoft’s artificial intelligence technologies have long powered the Microsoft Translator API in both the Skype and the Microsoft Translator app. Now, Microsoft is expanding on this and is releasing a new version of a Microsoft Translator API which will add real-time speech translation capabilities to the existing text translation API.

This means that businesses will be able to add speech translation capabilities to their own services and bring a better experience to their customers. Available in eight languages, and spoken in 18 languages, the new version is also the first end-to-end speech translation solution optimized for real life conversations.

This speech translation service can also be used in a numerous amount of situations, both person to person, or human to machine. Examples include personal translation, subtitling, or remote or in person multilingual communications. It even can be used in groups, such as in online gaming chatrooms or real-time presentations such as keynotes.

Nonetheless, the process of translation is very challenging. Microsoft claims that it uses the latest AI technologies and that there is no other fully integrated speech solution available in the market today that would support real-life speech translation scenarios.

A look at how the Microsoft Translator API works

A look at how the Microsoft Translator API works

The full details on how this technology works can be seen below.

  1. Automatic Speech Recognition (ASR) — A deep neural network trained on thousands of hours of audio analyzes incoming speech. This model is trained on human-to-human interactions rather than human-to-machine commands, producing speech recognition that is optimized for normal conversations.
  2. TrueText — A Microsoft Research innovation, TrueText takes the literal text and transforms it to more closely reflect user intent. It achieves this by removing speech disfluencies, such as “um”s and “ah”s, as well as stutters and repetitions. The text is also made more readable and translatable by adding sentence breaks, proper punctuation and capitalization. 
  3. Translation — The text is translated into any of the 50+ languages supported by Microsoft Translator. The eight speech languages have been further optimized for conversations by training on millions of words of conversational data using deep neural networks powered language models.
  4. Text to Speech — If the target language is one of the eighteen speech languages supported, the text is converted into speech output using speech synthesis. This stage is omitted in speech-to-text translation scenarios such as video subtitling.

If you’re curious about the technology, a free 2-hour trial of the API is available by clicking here. You can also learn more about the API, and also read the documentation by visiting Microsoft’s Swagger page. 

Further reading: Developers, Microsoft, Microsoft Translator, Skype, Speech, Speech Translation

Share this article:
Tags:
Developers Microsoft Microsoft Translator Skype Speech Speech Translation
Previous Article Microsoft makes Azure IoT Suite available to everyone | On MSFT Next Article Gabe Aul gives deeper insight into the Windows Insider Program at BUILD 2016 | On MSFT

Related Articles

New Ryzen 9 9950X3D2 loses performance on air cooling

April 4, 2026

Legion Go 2 now costs $1,999 at Best Buy, pricing no longer makes sense

April 4, 2026

ELSA Launches GigaIO Gryf Portable AI System with Modular Design

April 4, 2026

Leave a Comment Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • New Ryzen 9 9950X3D2 loses performance on air cooling
  • Legion Go 2 now costs $1,999 at Best Buy, pricing no longer makes sense
  • ELSA Launches GigaIO Gryf Portable AI System with Modular Design
  • NASA Artemis II astronauts face Outlook issues in space as mission hits unexpected software glitch
  • Microsoft Publisher Will Shut Down in October 2026 and Users Are Not Happy

Recent Comments

  1. XxRIVTYxX on Intel Says It Tried to Help Before Crimson Desert Dropped Arc Support
  2. Gaurav Kumar on Chrome Prepares Nudge to ‘Move Tabs to the Side’ as Vertical Tabs Near Release
OnMSFT.com

The Tech News Site

Categories

  • Windows
  • Surface
  • Xbox
  • How-To
  • OnPodcast
  • Gaming
  • Edge
  • Teams

Recent Posts

  • New Ryzen 9 9950X3D2 loses performance on air cooling
  • Legion Go 2 now costs $1,999 at Best Buy, pricing no longer makes sense
  • ELSA Launches GigaIO Gryf Portable AI System with Modular Design
  • NASA Artemis II astronauts face Outlook issues in space as mission hits unexpected software glitch
  • Microsoft Publisher Will Shut Down in October 2026 and Users Are Not Happy

Quick Links

  • About OnMSFT.com
  • Contact OnMSFT
  • Join Our Team
  • Privacy Policy
© 2010–2026 OnMSFT.com LLC. All rights reserved.
About OnMSFT.comContact OnMSFTPrivacy Policy