PLoS Computational Biology (May 2019)

DigChem: Identification of disease-gene-chemical relationships from Medline abstracts.

  • Jeongkyun Kim,
  • Jung-Jae Kim,
  • Hyunju Lee

DOI
https://doi.org/10.1371/journal.pcbi.1007022
Journal volume & issue
Vol. 15, no. 5
p. e1007022

Abstract

Read online

Chemicals interact with genes in the process of disease development and treatment. Although much biomedical research has been performed to understand relationships among genes, chemicals, and diseases, which have been reported in biomedical articles in Medline, there are few studies that extract disease-gene-chemical relationships from biomedical literature at a PubMed scale. In this study, we propose a deep learning model based on bidirectional long short-term memory to identify the evidence sentences of relationships among genes, chemicals, and diseases from Medline abstracts. Then, we develop the search engine DigChem to enable disease-gene-chemical relationship searches for 35,124 genes, 56,382 chemicals, and 5,675 diseases. We show that the identified relationships are reliable by comparing them with manual curation and existing databases. DigChem is available at http://gcancer.org/digchem.