BMC Bioinformatics (Dec 2019)

Benchmarking machine learning models for late-onset alzheimer’s disease prediction from genomic data

  • Javier De Velasco Oriol,
  • Edgar E. Vallejo,
  • Karol Estrada,
  • José Gerardo Taméz Peña,
  • The Alzheimer’s Disease Neuroimaging Initiative

DOI
https://doi.org/10.1186/s12859-019-3158-x
Journal volume & issue
Vol. 20, no. 1
pp. 1 – 17

Abstract

Read online

Abstract Background Late-Onset Alzheimer’s Disease (LOAD) is a leading form of dementia. There is no effective cure for LOAD, leaving the treatment efforts to depend on preventive cognitive therapies, which stand to benefit from the timely estimation of the risk of developing the disease. Fortunately, a growing number of Machine Learning methods that are well positioned to address this challenge are becoming available. Results We conducted systematic comparisons of representative Machine Learning models for predicting LOAD from genetic variation data provided by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) cohort. Our experimental results demonstrate that the classification performance of the best models tested yielded ∼72% of area under the ROC curve. Conclusions Machine learning models are promising alternatives for estimating the genetic risk of LOAD. Systematic machine learning model selection also provides the opportunity to identify new genetic markers potentially associated with the disease.

Keywords