EAI Endorsed Transactions on Industrial Networks and Intelligent Systems (Apr 2016)

Knowledge Extraction Framework for Building a Largescale Knowledge Base

  • Haklae Kim,
  • Liang He,
  • Ying Di

DOI
https://doi.org/10.4108/eai.21-4-2016.151157
Journal volume & issue
Vol. 3, no. 7

Abstract

Read online

As the Web has already permeated to life styles of human beings, people tend to consume more data in online spaces, and to exchange their behaviours among others. Simultaneously, various intelligent services are available for us such as virtual assistants, semantic search and intelligent recommendation. Most of these services have their own knowledge bases, however, constructing a knowledge base has a lot of different technical issues. In this paper, we propose a knowledge extraction framework, which comprises of several extraction components for processing various data formats such as metadata and web tables on web documents. Thus, this framework can be used for extracting a set of knowledge entities from large-scale web documents. Most of existing methods and tools tend to concentrate on obtaining knowledge from a specific format. Compared to them, this framework enables to handle various formats, and simultaneously extracted entities are interlinked to a knowledge base by automatic semantic matching. We will describe detailed features of each extractor and will provide some evaluation of them.

Keywords