Information (Sep 2021)

Simple but Effective Knowledge-Based Query Reformulations for Precision Medicine Retrieval

  • Stefano Marchesin,
  • Giorgio Maria Di Nunzio,
  • Maristella Agosti

DOI
https://doi.org/10.3390/info12100402
Journal volume & issue
Vol. 12, no. 10
p. 402

Abstract

Read online

In Information Retrieval (IR), the semantic gap represents the mismatch between users’ queries and how retrieval models answer to these queries. In this paper, we explore how to use external knowledge resources to enhance bag-of-words representations and reduce the effect of the semantic gap between queries and documents. In this regard, we propose several simple but effective knowledge-based query expansion and reduction techniques, and we evaluate them for the medical domain. The query reformulations proposed are used to increase the probability of retrieving relevant documents through the addition to, or the removal from, the original query of highly specific terms. The experimental analyses on different test collections for Precision Medicine IR show the effectiveness of the developed techniques. In particular, a specific subset of query reformulations allow retrieval models to achieve top performing results in all the considered test collections.

Keywords