Alzheimer’s Research & Therapy (May 2022)

Comparative analysis of machine learning algorithms for multi-syndrome classification of neurodegenerative syndromes

  • Leonie Lampe,
  • Sebastian Niehaus,
  • Hans-Jürgen Huppertz,
  • Alberto Merola,
  • Janis Reinelt,
  • Karsten Mueller,
  • Sarah Anderl-Straub,
  • Klaus Fassbender,
  • Klaus Fliessbach,
  • Holger Jahn,
  • Johannes Kornhuber,
  • Martin Lauer,
  • Johannes Prudlo,
  • Anja Schneider,
  • Matthis Synofzik,
  • Adrian Danek,
  • Janine Diehl-Schmid,
  • Markus Otto,
  • FTLD-Consortium Germany,
  • Arno Villringer,
  • Karl Egger,
  • Elke Hattingen,
  • Rüdiger Hilker-Roggendorf,
  • Alfons Schnitzler,
  • Martin Südmeyer,
  • Wolfgang Oertel,
  • German Atypical Parkinson Consortium Study Group,
  • Jan Kassubek,
  • Günter Höglinger,
  • Matthias L. Schroeter

DOI
https://doi.org/10.1186/s13195-022-00983-z
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Importance The entry of artificial intelligence into medicine is pending. Several methods have been used for the predictions of structured neuroimaging data, yet nobody compared them in this context. Objective Multi-class prediction is key for building computational aid systems for differential diagnosis. We compared support vector machine, random forest, gradient boosting, and deep feed-forward neural networks for the classification of different neurodegenerative syndromes based on structural magnetic resonance imaging. Design, setting, and participants Atlas-based volumetry was performed on multi-centric T1-weighted MRI data from 940 subjects, i.e., 124 healthy controls and 816 patients with ten different neurodegenerative diseases, leading to a multi-diagnostic multi-class classification task with eleven different classes. Interventions N.A. Main outcomes and measures Cohen’s kappa, accuracy, and F1-score to assess model performance. Results Overall, the neural network produced both the best performance measures and the most robust results. The smaller classes however were better classified by either the ensemble learning methods or the support vector machine, while performance measures for small classes were comparatively low, as expected. Diseases with regionally specific and pronounced atrophy patterns were generally better classified than diseases with widespread and rather weak atrophy. Conclusions and relevance Our study furthermore underlines the necessity of larger data sets but also calls for a careful consideration of different machine learning methods that can handle the type of data and the classification task best.

Keywords