Skip to content
OnMSFT.com
  • Home
  • About
  • Contact
  • Windows
  • Surface
  • Xbox
  • How-To
  • OnPodcast
  • Edge
  • Teams
  • Gaming
Menu
  • Home
  • About
  • Contact
  • Windows
  • Surface
  • Xbox
  • How-To
  • OnPodcast
  • Edge
  • Teams
  • Gaming
  1. Home
  2. News
  3. Automatic Image Captioning system created by Microsoft Research interns – onmsft.com

Automatic Image Captioning system created by Microsoft Research interns – onmsft.com

Sean Cameron Sean Cameron
November 19, 2014
1 min read

Microsoft Research Automatic Captioning technique

\

The field of machine intelligence is notoriously difficult to either research or quantify. What is defined as intelligence by one scientific discipline can often differ significantly from another, and as such what can be measured as quantifiable progress varies enormously. With regards to machine or ‘artificial’ intelligence, there are numerous tests available, each which purports to be the most accurate measure of ‘intelligence’.

\

These range from the famous Turing test, which challenges a machine capable of natural speech to sort nonsense from truth, to the less well known BLEU or METEOR metrics, which measure the ability to generate accurate descriptions. Several significant issues reside within each of these however; a machine capable of projecting a convincing enough illusion of intelligence can pass the Turing test, while the BLEU and METEOR metrics are accused of being a little too narrow in their focus.

\

Despite these niggles and quibbles, the field of machine intelligence has made great strides in the last decade and this seems set to continue, with recent work by a certain group of Microsoft Research interns being of particular interest. Working together across the summer, the team of twelve interns and researchers managed to create an Automatic Image Captioning system.

\

Microsoft Research Automatic Captioning technique

\

This achievement is made all the more remarkable given the field in which it was made. Teaching a machine to understand an image is one thing, however creating a program that can understand an image in a binary way and then ‘translate’ the information it obtained into something a human could read, let alone make sense of is something else altogether. That is exactly what the team managed, this is another important stepping stone along the road to ‘true’ machine intelligence.

\

That isn’t to say that the program isn’t without its quirks. Frequently, when pitted against a human to provide an accurate description of an image, the program failed to provide a sufficient detail compared to the human, even if the machine managed to out-compete when measured using the BLEU metric, and achieve a similar score using METEOR.

\

Deputy Managing Director John Platt notes,

\

\

This type of collective progress is just awesome to see. Image captioning is a fascinating and important problem, and I would like to better understand the strengths and weaknesses of these approaches. (I note that several people used recurrent neural networks and/or LSTM models). As a field, if we can agree on standardized test sets (such as COCO), and standard metrics, we’ll continue to move closer to that goal creating a system that can automatically generate descriptive captions of an image as well as a human. The results from our work this summer and from others suggests we’re moving in the right direction.

\

\

With similar advancements in facial recognition technology being made by Facebook, the field of ‘deep’ intelligence looks to be an area of major growth for at least the next decade. What other advancements will be made is yet to be seen, what is sure it that though this step is small, it is significant.

\

Share This Post:

Share this article:
Tags:
Microsoft Research
Previous Article Wolfram Alpha releases official Windows Phone and Windows 8 apps for $2.99 USD (Update) | On MSFT Next Article November firmware update for the Surface Pro 3 is now available via Windows Update

Related Articles

After Chrome, Edge tests launching the browser automatically when you sign into Windows

March 13, 2026
Latest iPhone Fold rumors reveal display crease details, hole-punch cameras, iOS multitasking layout, 12GB RAM, and storage options for Apple’s first foldable iPhone.

iPhone Fold Latest Rumors: Display, Cameras, RAM and Price Details Revealed

March 13, 2026
Samsung concerned about smartphone profits as chip prices surge

Samsung fears first mobile operating loss due to memory price surge

March 13, 2026

Leave a Comment Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • After Chrome, Edge tests launching the browser automatically when you sign into Windows
  • iPhone Fold Latest Rumors: Display, Cameras, RAM and Price Details Revealed
  • Samsung fears first mobile operating loss due to memory price surge
  • Elon Musk’s X to Change Verification in Europe Following EU Fine
  • Facebook Marketplace adds Meta AI to reply to buyer messages automatically

Recent Comments

No comments to show.
OnMSFT.com

OnMSFT.com covers Microsoft news, reviews, and how-to guides. Formerly known as WinBeta, we have been your source for Microsoft news since 1998.

Categories

  • Windows
  • Surface
  • Xbox
  • How-To
  • OnPodcast
  • Gaming
  • Edge
  • Teams

Recent Posts

  • After Chrome, Edge tests launching the browser automatically when you sign into Windows
  • iPhone Fold Latest Rumors: Display, Cameras, RAM and Price Details Revealed
  • Samsung fears first mobile operating loss due to memory price surge
  • Elon Musk’s X to Change Verification in Europe Following EU Fine
  • Facebook Marketplace adds Meta AI to reply to buyer messages automatically

Quick Links

  • About OnMSFT.com
  • Contact OnMSFT
  • Join Our Team
  • Privacy Policy
© 2010–2026 OnMSFT.com LLC. All rights reserved.
About OnMSFT.comContact OnMSFTPrivacy Policy