BMC Bioinformatics (Sep 2022)

Mining on Alzheimer’s diseases related knowledge graph to identity potential AD-related semantic triples for drug repurposing

  • Yi Nian,
  • Xinyue Hu,
  • Rui Zhang,
  • Jingna Feng,
  • Jingcheng Du,
  • Fang Li,
  • Larry Bu,
  • Yuji Zhang,
  • Yong Chen,
  • Cui Tao

DOI
https://doi.org/10.1186/s12859-022-04934-1
Journal volume & issue
Vol. 23, no. S6
pp. 1 – 15

Abstract

Read online

Abstract Background To date, there are no effective treatments for most neurodegenerative diseases. Knowledge graphs can provide comprehensive and semantic representation for heterogeneous data, and have been successfully leveraged in many biomedical applications including drug repurposing. Our objective is to construct a knowledge graph from literature to study the relations between Alzheimer’s disease (AD) and chemicals, drugs and dietary supplements in order to identify opportunities to prevent or delay neurodegenerative progression. We collected biomedical annotations and extracted their relations using SemRep via SemMedDB. We used both a BERT-based classifier and rule-based methods during data preprocessing to exclude noise while preserving most AD-related semantic triples. The 1,672,110 filtered triples were used to train with knowledge graph completion algorithms (i.e., TransE, DistMult, and ComplEx) to predict candidates that might be helpful for AD treatment or prevention. Results Among three knowledge graph completion models, TransE outperformed the other two (MR = 10.53, Hits@1 = 0.28). We leveraged the time-slicing technique to further evaluate the prediction results. We found supporting evidence for most highly ranked candidates predicted by our model which indicates that our approach can inform reliable new knowledge. Conclusion This paper shows that our graph mining model can predict reliable new relationships between AD and other entities (i.e., dietary supplements, chemicals, and drugs). The knowledge graph constructed can facilitate data-driven knowledge discoveries and the generation of novel hypotheses.

Keywords