IEEE Access (Jan 2018)
Automatic Non-Taxonomic Relation Extraction from Big Data in Smart City
Abstract
The explosive data growth in smart city is making domain big data a hot topic for knowledge extraction. Non-taxonomic relations refer to any relations between concept pairs except the is-a relation, which is an important part of Knowledge Graph. In this paper, toward big data in smart city, we present a multi-phase correlation search framework to automatically extract non-taxonomic relations from domain documents. Different kinds of semantic information are used to improve the performance of the system. First, inspired by the works of network representation; we propose a Semantic Graph-Based method to combine structure information of semantic graph and context information of terms together for nontaxonomic relationships identification. Second, different semantic types of verb sets are extracted based on the dependency syntactic information, which are ranked to act as non-taxonomic relationship labels. Extensive experiments demonstrate the efficiency of the proposed framework. The F1 value reaches 81.4% for identification of non-taxonomic relationships. The total precision of the non-taxonomic relationship labels extraction is 73.4%, and 87.8% non-taxonomic relations can be provided with “good”labels. We hope this article can provide a useful way for domain big data knowledge extraction in smart city.
Keywords