Data Intelligence (Jun 2019)

Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding

  • Ji, Lei,
  • Wang, Yujing,
  • Shi, Botian,
  • Zhang, Dawei,
  • Wang, Zhongyuan,
  • Yan, Jun

DOI
https://doi.org/10.1162/dint_a_00013
Journal volume & issue
Vol. 1, no. 3
pp. 238 – 270

Abstract

Read online

Knowlege is important for text-related applications. In this paper, we introduce Microsoft Concept Graph, a knowledge graph engine that provides concept tagging APIs to facilitate the understanding of human languages. Microsoft Concept Graph is built upon Probase, a universal probabilistic taxonomy consisting of instances and concepts mined from the Web. We start by introducing the construction of the knowledge graph through iterative semantic extraction and taxonomy construction procedures, which extract 2.7 million concepts from 1.68 billion Web pages. We then use conceptualization models to represent text in the concept space to empower text-related applications, such as topic search, query recommendation, Web table understanding and Ads relevance. Since the release in 2016, Microsoft Concept Graph has received more than 100,000 pageviews, 2 million API calls and 3,000 registered downloads from 50,000 visitors over 64 countries.