Skip to content
OnMSFT.com
  • Home
  • About
  • Contact
  • Windows
  • Surface
  • Xbox
  • How-To
  • OnPodcast
  • Edge
  • Teams
  • Gaming
Menu
  • Home
  • About
  • Contact
  • Windows
  • Surface
  • Xbox
  • How-To
  • OnPodcast
  • Edge
  • Teams
  • Gaming
  1. Home
  2. Latest news
  3. Researchers use bots and artificial intelligence to automatically tag and title videos

Researchers use bots and artificial intelligence to automatically tag and title videos

Laurent Giret Laurent Giret
October 10, 2016
2 min read

If you already tried to upload some of your pictures to OneDrive, you may be aware that Microsoft’s cloud storage service is able to automatically tag your photos and categorize them, group them by location, and more. By adding more data to user-generated content, Microsoft’s artificial intelligence tools also make it easier for OneDrive users to find relevant pictures using OneDrive’s search feature.

But could artificial intelligence accomplish the same sort of magic with video content? That’s exactly what Chia-Wen Lin and Min Sun, professors in the Electrical Engineering department of National Tsinghua University in Taiwan, are trying to do. In a new blog post on the Microsoft Research blog, the company explains that both professors partnered in 2015 with Dr. Tao Mei, lead researcher in multimedia at Microsoft Research Asia who worked on a new image recognition, segmentation, and captioning dataset called COCO (Common Objects in Context).

Using the dataset, the professors built a system that leverages bots and artificial intelligence to determine the highlights of a video, add a title to it, and suggest people with whom to share it:

Professor Sun created a video title generation method based on deep learning to automatically find the special moments—or highlights—in videos, and generate an accurate and interesting title for the highlights. In parallel, Professor Lin developed a method to detect and cluster the faces in videos to provide richer summaries of the videos and relevant suggestions about whom to share them with. Working together, their algorithms can detect highlights, generate descriptions of highlights and tag potential viewers of user-generated videos.

It’s important to note that the system is ultimately designed to improve the discoverability of video content and help creators reach a bigger audience. Professor Sun and his students have recently participated in the VideoToText challenge (sponsored by Microsoft Research) to improve the system, and the result of their work will be unveiled at the European Conference on Computer Vision which is currently underway in Amsterdam. “Our research has taken us one step closer to the holy-grail of visual intelligence, understanding visual content in user-generated videos,” explained Professor Sun.

While this new research is definitely interesting, Microsoft actually paved the way with Azure Media Services. In April, the company announced new machine learning features for its media cloud streaming offering including automatic video summarization, a speech-to-text indexer, a face recognition feature, and more.

Further reading: Artificial Intelligence, Data, Machine Learning, Microsoft, video

Share this article:
Tags:
Artificial Intelligence Data Machine Learning Microsoft video
Previous Article Where is Microsoft headed in Windows 10 speech recognition? Next Article Acer Liquid Jade Primo is now selling at $200 off at Microsoft Store

Related Articles

Nvidia CEO Jensen Huang says demand for Blackwell and Rubin AI chips could reach $1 trillion as AI infrastructure spending grows rapidly.

Nvidia CEO Jensen Huang sees $1 trillion demand for Blackwell and Rubin AI chips

March 16, 2026
Nvidia introduces DLSS 5 to improve game realism with generative AI

Nvidia introduces DLSS 5 to improve game realism with generative AI

March 16, 2026
Dictionary Publisher Files Copyright Lawsuit Against OpenAI

Dictionary Publisher Files Copyright Lawsuit Against OpenAI

March 16, 2026

Leave a Comment Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • Nvidia CEO Jensen Huang sees $1 trillion demand for Blackwell and Rubin AI chips
  • Nvidia introduces DLSS 5 to improve game realism with generative AI
  • Dictionary Publisher Files Copyright Lawsuit Against OpenAI
  • Shopify exec says AI shopping agents are the future of e-commerce
  • WhatsApp beta introduces guest chats for messaging without an account

Recent Comments

No comments to show.
OnMSFT.com

The Tech News Site

Categories

  • Windows
  • Surface
  • Xbox
  • How-To
  • OnPodcast
  • Gaming
  • Edge
  • Teams

Recent Posts

  • Nvidia CEO Jensen Huang sees $1 trillion demand for Blackwell and Rubin AI chips
  • Nvidia introduces DLSS 5 to improve game realism with generative AI
  • Dictionary Publisher Files Copyright Lawsuit Against OpenAI
  • Shopify exec says AI shopping agents are the future of e-commerce
  • WhatsApp beta introduces guest chats for messaging without an account

Quick Links

  • About OnMSFT.com
  • Contact OnMSFT
  • Join Our Team
  • Privacy Policy
© 2010–2026 OnMSFT.com LLC. All rights reserved.
About OnMSFT.comContact OnMSFTPrivacy Policy