Brain-phenotype predictions of language and executive function can survive across diverse real-world data: Dataset shifts in developmental populations

Brendan D. Adkinson; Matthew Rosenblatt; Javid Dadashkarimi; Link Tejavibulya; Rongtao Jiang; Stephanie Noble; Dustin Scheinost

Developmental Cognitive Neuroscience (Dec 2024)

Brain-phenotype predictions of language and executive function can survive across diverse real-world data: Dataset shifts in developmental populations

Brendan D. Adkinson,
Matthew Rosenblatt,
Javid Dadashkarimi,
Link Tejavibulya,
Rongtao Jiang,
Stephanie Noble,
Dustin Scheinost

Affiliations

Brendan D. Adkinson: Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06510, USA; Correspondence to: Magnetic Resonance Research Center, 300 Cedar Street, New Haven, CT 06519, USA.
Matthew Rosenblatt: Department of Biomedical Engineering, Yale University, New Haven, CT 06520, USA
Javid Dadashkarimi: Department of Radiology, Athinoula. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Charlestown, MA 02129, USA; Department of Radiology, Harvard Medical School, Boston, MA 02129, USA
Link Tejavibulya: Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06510, USA
Rongtao Jiang: Department of Radiology & Biomedical Imaging, Yale School of Medicine, New Haven, CT 06510, USA
Stephanie Noble: Department of Radiology & Biomedical Imaging, Yale School of Medicine, New Haven, CT 06510, USA; Department of Bioengineering, Northeastern University, Boston, MA 02120, USA; Department of Psychology, Northeastern University, Boston, MA 02115, USA
Dustin Scheinost: Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06510, USA; Department of Biomedical Engineering, Yale University, New Haven, CT 06520, USA; Department of Radiology & Biomedical Imaging, Yale School of Medicine, New Haven, CT 06510, USA; Department of Statistics & Data Science, Yale University, New Haven, CT 06520, USA; Child Study Center, Yale School of Medicine, New Haven, CT 06510, USA; Wu Tsai Institute, Yale University, New Haven, CT 06510, USA

Journal volume & issue: Vol. 70
p. 101464

Abstract

Read online

Predictive modeling potentially increases the reproducibility and generalizability of neuroimaging brain-phenotype associations. Yet, the evaluation of a model in another dataset is underutilized. Among studies that undertake external validation, there is a notable lack of attention to generalization across dataset-specific idiosyncrasies (i.e., dataset shifts). Research settings, by design, remove the between-site variations that real-world and, eventually, clinical applications demand. Here, we rigorously test the ability of a range of predictive models to generalize across three diverse, unharmonized developmental samples: the Philadelphia Neurodevelopmental Cohort (n=1291), the Healthy Brain Network (n=1110), and the Human Connectome Project in Development (n=428). These datasets have high inter-dataset heterogeneity, encompassing substantial variations in age distribution, sex, racial and ethnic minority representation, recruitment geography, clinical symptom burdens, fMRI tasks, sequences, and behavioral measures. Through advanced methodological approaches, we demonstrate that reproducible and generalizable brain-behavior associations can be realized across diverse dataset features. Results indicate the potential of functional connectome-based predictive models to be robust despite substantial inter-dataset variability. Notably, for the HCPD and HBN datasets, the best predictions were not from training and testing in the same dataset (i.e., cross-validation) but across datasets. This result suggests that training on diverse data may improve prediction in specific cases. Overall, this work provides a critical foundation for future work evaluating the generalizability of brain-phenotype associations in real-world scenarios and clinical settings.

Published in Developmental Cognitive Neuroscience

ISSN: 1878-9293 (Print); 1878-9307 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Science: Physiology: Neurophysiology and neuropsychology
Website: http://www.journals.elsevier.com/developmental-cognitive-neuroscience/

About the journal

Abstract

Keywords