Deep learning-based Alzheimer's disease detection: reproducibility and the effect of modeling choices

Rosanna Turrisi; Rosanna Turrisi; Alessandro Verri; Alessandro Verri; Annalisa Barla; Annalisa Barla

doi:10.3389/fncom.2024.1360095

Frontiers in Computational Neuroscience (Sep 2024)

Deep learning-based Alzheimer's disease detection: reproducibility and the effect of modeling choices

Rosanna Turrisi,
Rosanna Turrisi,
Alessandro Verri,
Alessandro Verri,
Annalisa Barla,
Annalisa Barla

Affiliations

Rosanna Turrisi: Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy
Rosanna Turrisi: Machine Learning Genoa (MaLGa) Center, University of Genoa, Genoa, Italy
Alessandro Verri: Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy
Alessandro Verri: Machine Learning Genoa (MaLGa) Center, University of Genoa, Genoa, Italy
Annalisa Barla: Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy
Annalisa Barla: Machine Learning Genoa (MaLGa) Center, University of Genoa, Genoa, Italy

DOI: https://doi.org/10.3389/fncom.2024.1360095
Journal volume & issue: Vol. 18

Abstract

Read online

IntroductionMachine Learning (ML) has emerged as a promising approach in healthcare, outperforming traditional statistical techniques. However, to establish ML as a reliable tool in clinical practice, adherence to best practices in data handling, and modeling design and assessment is crucial. In this work, we summarize and strictly adhere to such practices to ensure reproducible and reliable ML. Specifically, we focus on Alzheimer's Disease (AD) detection, a challenging problem in healthcare. Additionally, we investigate the impact of modeling choices, including different data augmentation techniques and model complexity, on overall performance.MethodsWe utilize Magnetic Resonance Imaging (MRI) data from the ADNI corpus to address a binary classification problem using 3D Convolutional Neural Networks (CNNs). Data processing and modeling are specifically tailored to address data scarcity and minimize computational overhead. Within this framework, we train 15 predictive models, considering three different data augmentation strategies and five distinct 3D CNN architectures with varying convolutional layers counts. The augmentation strategies involve affine transformations, such as zoom, shift, and rotation, applied either concurrently or separately.ResultsThe combined effect of data augmentation and model complexity results in up to 10% variation in prediction accuracy. Notably, when affine transformation are applied separately, the model achieves higher accuracy, regardless the chosen architecture. Across all strategies, the model accuracy exhibits a concave behavior as the number of convolutional layers increases, peaking at an intermediate value. The best model reaches excellent performance both on the internal and additional external testing set.DiscussionsOur work underscores the critical importance of adhering to rigorous experimental practices in the field of ML applied to healthcare. The results clearly demonstrate how data augmentation and model depth—often overlooked factors– can dramatically impact final performance if not thoroughly investigated. This highlights both the necessity of exploring neglected modeling aspects and the need to comprehensively report all modeling choices to ensure reproducibility and facilitate meaningful comparisons across studies.

Published in Frontiers in Computational Neuroscience

ISSN: 1662-5188 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: http://www.frontiersin.org/computational_neuroscience

About the journal

Abstract

Keywords