Journal of Medical Internet Research (Feb 2015)

SimQ: Real-Time Retrieval of Similar Consumer Health Questions

  • Luo, Jake,
  • Zhang, Guo-Qiang,
  • Wentz, Susan,
  • Cui, Licong,
  • Xu, Rong

DOI
https://doi.org/10.2196/jmir.3388
Journal volume & issue
Vol. 17, no. 2
p. e43

Abstract

Read online

BackgroundThere has been a significant increase in the popularity of Web-based question-and-answer (Q&A) services that provide health care information for consumers. Large amounts of Q&As have been archived in these online communities, which form a valuable knowledge base for consumers who seek answers to their health care concerns. However, due to consumers’ possible lack of professional knowledge, it is still very challenging for them to find Q&As that are closely relevant to their own health problems. Consumers often repeatedly ask similar questions that have already been answered previously by other users. ObjectiveIn this study, we aim to develop efficient informatics methods that can retrieve similar Web-based consumer health questions using syntactic and semantic analysis. MethodsWe propose the “SimQ” to achieve this objective. SimQ is an informatics framework that compares the similarity of archived health questions and retrieves answers to satisfy consumers’ information needs. Statistical syntactic parsing was used to analyze each question’s syntactic structure. Standardized Unified Medical Language System (UMLS) was employed to annotate semantic types and extract medical concepts. Finally, the similarity between sentences was calculated using both semantic and syntactic features. ResultsWe used 2000 randomly selected consumer questions to evaluate the system’s performance. The results show that SimQ reached the highest precision of 72.2%, recall of 78.0%, and F-score of 75.0% when using compositional feature representations. ConclusionsWe demonstrated that SimQ complements the existing Q&A services of Netwellness, a not-for-profit community-based consumer health information service that consists of nearly 70,000 Q&As and serves over 3 million users each year. SimQ not only reduces response delay by instantly providing closely related questions and answers, but also helps consumers to improve the understanding of their health concerns.