Frontiers in Genetics (Dec 2018)

RDAD: A Machine Learning System to Support Phenotype-Based Rare Disease Diagnosis

  • Jinmeng Jia,
  • Ruiyuan Wang,
  • Zhongxin An,
  • Yongli Guo,
  • Xi Ni,
  • Tieliu Shi,
  • Tieliu Shi

DOI
https://doi.org/10.3389/fgene.2018.00587
Journal volume & issue
Vol. 9

Abstract

Read online

DNA sequencing has allowed for the discovery of the genetic cause for a considerable number of diseases, paving the way for new disease diagnostics. However, due to the lack of clinical samples and records, the molecular cause for rare diseases is always hard to identify, significantly limiting the number of rare Mendelian diseases diagnosed through sequencing technologies. Clinical phenotype information therefore becomes a major resource to diagnose rare diseases. In this article, we adopted both a phenotypic similarity method and a machine learning method to build four diagnostic models to support rare disease diagnosis. All the diagnostic models were validated using the real medical records from RAMEDIS. Each model provides a list of the top 10 candidate diseases as the prediction outcome and the results showed that all models had a high diagnostic precision (≥98%) with the highest recall reaching up to 95% while the models with machine learning methods showed the best performance. To promote effective diagnosis for rare disease in clinical application, we developed the phenotype-based Rare Disease Auxiliary Diagnosis system (RDAD) to assist clinicians in diagnosing rare diseases with the above four diagnostic models. The system is freely accessible through http://www.unimd.org/RDAD/.

Keywords