Bridging big data in the ENIGMA consortium to combine non-equivalent cognitive measures

Eamonn Kennedy; Shashank Vadlamani; Hannah M. Lindsey; Pui-Wa Lei; Mary Jo-Pugh; Paul M. Thompson; David F. Tate; Frank G. Hillary; Emily L. Dennis; Elisabeth A. Wilde; for the ENIGMA Clinical Endpoints Working Group

doi:10.1038/s41598-024-72968-x

Scientific Reports (Oct 2024)

Bridging big data in the ENIGMA consortium to combine non-equivalent cognitive measures

Eamonn Kennedy,
Shashank Vadlamani,
Hannah M. Lindsey,
Pui-Wa Lei,
Mary Jo-Pugh,
Paul M. Thompson,
David F. Tate,
Frank G. Hillary,
Emily L. Dennis,
Elisabeth A. Wilde,
for the ENIGMA Clinical Endpoints Working Group

Affiliations

Eamonn Kennedy: Department of Neurology, University of Utah School of Medicine
Shashank Vadlamani: Department of Neurology, University of Utah School of Medicine
Hannah M. Lindsey: Department of Neurology, University of Utah School of Medicine
Pui-Wa Lei: Department of Educational Psychology, Counseling, and Special Education, Pennsylvania State University
Mary Jo-Pugh: Department of Neurology, University of Utah School of Medicine
Paul M. Thompson: Imaging Genetics Center, Stevens Neuroimaging & Informatics Institute, Keck School of Medicine of USC
David F. Tate: Department of Neurology, University of Utah School of Medicine
Frank G. Hillary: Department of Psychology, Penn State University
Emily L. Dennis: Department of Neurology, University of Utah School of Medicine
Elisabeth A. Wilde: Department of Neurology, University of Utah School of Medicine
for the ENIGMA Clinical Endpoints Working Group

DOI: https://doi.org/10.1038/s41598-024-72968-x
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Investigators in neuroscience have turned to Big Data to address replication and reliability issues by increasing sample size. These efforts unveil new questions about how to integrate data across distinct sources and instruments. The goal of this study was to link scores across common auditory verbal learning tasks (AVLTs). This international secondary analysis aggregated multisite raw data for AVLTs across 53 studies totaling 10,505 individuals. Using the ComBat-GAM algorithm, we isolated and removed the component of memory scores associated with site effects while preserving instrumental effects. After adjustment, a continuous item response theory model used multiple memory items of varying difficulty to estimate each individual’s latent verbal learning ability on a single scale. Equivalent raw scores across AVLTs were then found by linking individuals through the ability scale. Harmonization reduced total cross-site score variance by 37% while preserving meaningful memory effects. Age had the largest impact on scores overall (− 11.4%), while race/ethnicity variable was not significant (p > 0.05). The resulting tools were validated on dually administered tests. The conversion tool is available online so researchers and clinicians can convert memory scores across instruments. This work demonstrates that global harmonization initiatives can address reproducibility challenges across the behavioral sciences.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords