Efficient SPARQL Queries Generator for Question Answering Systems

Yi-Hui Chen; Eric Jui-Lin Lu; Ying-Yen Lin

doi:10.1109/access.2022.3206794

IEEE Access (Jan 2022)

Efficient SPARQL Queries Generator for Question Answering Systems

Yi-Hui Chen,
Eric Jui-Lin Lu,
Ying-Yen Lin

Affiliations

Yi-Hui Chen: ORCiD; Department of Information Management, Chang Gung University, Taoyuan, Taiwan
Eric Jui-Lin Lu: ORCiD; Department of Management Information Systems, National Chung Hsing University, Taichung, Taiwan
Ying-Yen Lin: Department of Management Information Systems, National Chung Hsing University, Taichung, Taiwan

DOI: https://doi.org/10.1109/access.2022.3206794
Journal volume & issue: Vol. 10
pp. 99850 – 99860

Abstract

Read online

Much like traditional database querying, the question answering process in a Question Answering (QA) system involves converting a user’s question input into query grammar, querying the knowledge base through the query grammar, and finally returning the query result (i.e., the answer) to the user. The accuracy of query grammar generation is therefore important in determining whether a Question Answering system can produce a correct answer. Generally speaking, incorrect query grammar will never find the right answer. SPARQL is the most frequently used query language in question answering systems. In the past, SPARQL was generated based on graph structures, such as dependency trees, syntax trees and so on. However, the query cost of generating SPARQL is high, which creates long processing times to answer questions. To reduce the query cost, this work proposes a low-cost SPARQL generator named Light-QAWizard, which integrates multi-label classification into a recurrent neural network (RNN), builds a template classifier, and generates corresponding query grammars based on the results of template classifier. Light-QAWizard reduces query frequency to DBpedia by aggregating multiple outputs into a single output using multi-label classification. In the experimental results, Light-QAWizard’s performance on Precision, Recall and F-measure metrics were evaluated on the QALD-7, QALD8 and QALD-9 datasets. Not only did Light-QAWizard outperform all other models, but it also had a lower query cost that was nearly half that of QAWizard.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords