NeuroImage: Clinical (Jan 2017)

Five-class differential diagnostics of neurodegenerative diseases using random undersampling boosting

  • Tong Tong,
  • Christian Ledig,
  • Ricardo Guerrero,
  • Andreas Schuh,
  • Juha Koikkalainen,
  • Antti Tolonen,
  • Hanneke Rhodius,
  • Frederik Barkhof,
  • Betty Tijms,
  • Afina W Lemstra,
  • Hilkka Soininen,
  • Anne M Remes,
  • Gunhild Waldemar,
  • Steen Hasselbalch,
  • Patrizia Mecocci,
  • Marta Baroni,
  • Jyrki Lötjönen,
  • Wiesje van der Flier,
  • Daniel Rueckert

Journal volume & issue
Vol. 15
pp. 613 – 624

Abstract

Read online

Differentiating between different types of neurodegenerative diseases is not only crucial in clinical practice when treatment decisions have to be made, but also has a significant potential for the enrichment of clinical trials. The purpose of this study is to develop a classification framework for distinguishing the four most common neurodegenerative diseases, including Alzheimer's disease, frontotemporal lobe degeneration, Dementia with Lewy bodies and vascular dementia, as well as patients with subjective memory complaints. Different biomarkers including features from images (volume features, region-wise grading features) and non-imaging features (CSF measures) were extracted for each subject. In clinical practice, the prevalence of different dementia types is imbalanced, posing challenges for learning an effective classification model. Therefore, we propose the use of the RUSBoost algorithm in order to train classifiers and to handle the class imbalance training problem. Furthermore, a multi-class feature selection method based on sparsity is integrated into the proposed framework to improve the classification performance. It also provides a way for investigating the importance of different features and regions. Using a dataset of 500 subjects, the proposed framework achieved a high accuracy of 75.2% with a balanced accuracy of 69.3% for the five-class classification using ten-fold cross validation, which is significantly better than the results using support vector machine or random forest, demonstrating the feasibility of the proposed framework to support clinical decision making. Keywords: Neurodegenerative diseases, Differential diagnosis, MRI, Dementia, Imbalance learning, Multi-class feature selection