Cell Reports (Jan 2024)

Performance reserves in brain-imaging-based phenotype prediction

  • Marc-Andre Schulz,
  • Danilo Bzdok,
  • Stefan Haufe,
  • John-Dylan Haynes,
  • Kerstin Ritter

Journal volume & issue
Vol. 43, no. 1
p. 113597

Abstract

Read online

Summary: This study examines the impact of sample size on predicting cognitive and mental health phenotypes from brain imaging via machine learning. Our analysis shows a 3- to 9-fold improvement in prediction performance when sample size increases from 1,000 to 1 M participants. However, despite this increase, the data suggest that prediction accuracy remains worryingly low and far from fully exploiting the predictive potential of brain imaging data. Additionally, we find that integrating multiple imaging modalities boosts prediction accuracy, often equivalent to doubling the sample size. Interestingly, the most informative imaging modality often varied with increasing sample size, emphasizing the need to consider multiple modalities. Despite significant performance reserves for phenotype prediction, achieving substantial improvements may necessitate prohibitively large sample sizes, thus casting doubt on the practical or clinical utility of machine learning in some areas of neuroimaging.

Keywords