Journal of Medical Internet Research (Dec 2014)

A Search Engine to Access PubMed Monolingual Subsets: Proof of Concept and Evaluation in French

  • Griffon, Nicolas,
  • Schuers, Matthieu,
  • Soualmia, Lina Fatima,
  • Grosjean, Julien,
  • Kerdelhué, Gaétan,
  • Kergourlay, Ivan,
  • Dahamna, Badisse,
  • Darmoni, Stéfan Jacques

DOI
https://doi.org/10.2196/jmir.3836
Journal volume & issue
Vol. 16, no. 12
p. e271

Abstract

Read online

BackgroundPubMed contains numerous articles in languages other than English. However, existing solutions to access these articles in the language in which they were written remain unconvincing. ObjectiveThe aim of this study was to propose a practical search engine, called Multilingual PubMed, which will permit access to a PubMed subset in 1 language and to evaluate the precision and coverage for the French version (Multilingual PubMed-French). MethodsTo create this tool, translations of MeSH were enriched (eg, adding synonyms and translations in French) and integrated into a terminology portal. PubMed subsets in several European languages were also added to our database using a dedicated parser. The response time for the generic semantic search engine was evaluated for simple queries. BabelMeSH, Multilingual PubMed-French, and 3 different PubMed strategies were compared by searching for literature in French. Precision and coverage were measured for 20 randomly selected queries. The results were evaluated as relevant to title and abstract, the evaluator being blind to search strategy. ResultsMore than 650,000 PubMed citations in French were integrated into the Multilingual PubMed-French information system. The response times were all below the threshold defined for usability (2 seconds). Two search strategies (Multilingual PubMed-French and 1 PubMed strategy) showed high precision (0.93 and 0.97, respectively), but coverage was 4 times higher for Multilingual PubMed-French. ConclusionsIt is now possible to freely access biomedical literature using a practical search tool in French. This tool will be of particular interest for health professionals and other end users who do not read or query sufficiently in English. The information system is theoretically well suited to expand the approach to other European languages, such as German, Spanish, Norwegian, and Portuguese.