Using imputation to provide harmonized longitudinal measures of cognition across AIBL and ADNI

Rosita Shishegar; Timothy Cox; David Rolls; Pierrick Bourgeat; Vincent Doré; Fiona Lamb; Joanne Robertson; Simon M. Laws; Tenielle Porter; Jurgen Fripp; Duygu Tosun; Paul Maruff; Greg Savage; Christopher C. Rowe; Colin L. Masters; Michael W. Weiner; Victor L. Villemagne; Samantha C. Burnham

doi:10.1038/s41598-021-02827-6

Scientific Reports (Dec 2021)

Using imputation to provide harmonized longitudinal measures of cognition across AIBL and ADNI

Rosita Shishegar,
Timothy Cox,
David Rolls,
Pierrick Bourgeat,
Vincent Doré,
Fiona Lamb,
Joanne Robertson,
Simon M. Laws,
Tenielle Porter,
Jurgen Fripp,
Duygu Tosun,
Paul Maruff,
Greg Savage,
Christopher C. Rowe,
Colin L. Masters,
Michael W. Weiner,
Victor L. Villemagne,
Samantha C. Burnham

Affiliations

Rosita Shishegar: The Australian e-Health Research Centre, CSIRO
Timothy Cox: The Australian e-Health Research Centre, CSIRO
David Rolls: The Australian e-Health Research Centre, CSIRO
Pierrick Bourgeat: The Australian e-Health Research Centre, CSIRO
Vincent Doré: The Australian e-Health Research Centre, CSIRO
Fiona Lamb: Department of Molecular Imaging and Therapy, Austin Health
Joanne Robertson: Florey Institute of Neuroscience and Mental Health, The University of Melbourne
Simon M. Laws: Centre for Precision Health, Edith Cowan University
Tenielle Porter: Centre for Precision Health, Edith Cowan University
Jurgen Fripp: The Australian e-Health Research Centre, CSIRO
Duygu Tosun: Department of Radiology and Biomedical Imaging, University of California-San Francisco
Paul Maruff: Cogstate Ltd.
Greg Savage: Department of Psychology, Macquarie University
Christopher C. Rowe: Department of Molecular Imaging and Therapy, Austin Health
Colin L. Masters: Florey Institute of Neuroscience and Mental Health, The University of Melbourne
Michael W. Weiner: Department of Radiology and Biomedical Imaging, University of California-San Francisco
Victor L. Villemagne: Department of Molecular Imaging and Therapy, Austin Health
Samantha C. Burnham: The Australian e-Health Research Centre, CSIRO

DOI: https://doi.org/10.1038/s41598-021-02827-6
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 11

Abstract

Read online

Abstract To improve understanding of Alzheimer’s disease, large observational studies are needed to increase power for more nuanced analyses. Combining data across existing observational studies represents one solution. However, the disparity of such datasets makes this a non-trivial task. Here, a machine learning approach was applied to impute longitudinal neuropsychological test scores across two observational studies, namely the Australian Imaging, Biomarkers and Lifestyle Study (AIBL) and the Alzheimer's Disease Neuroimaging Initiative (ADNI) providing an overall harmonised dataset. MissForest, a machine learning algorithm, capitalises on the underlying structure and relationships of data to impute test scores not measured in one study aligning it to the other study. Results demonstrated that simulated missing values from one dataset could be accurately imputed, and that imputation of actual missing data in one dataset showed comparable discrimination (p < 0.001) for clinical classification to measured data in the other dataset. Further, the increased power of the overall harmonised dataset was demonstrated by observing a significant association between CVLT-II test scores (imputed for ADNI) with PET Amyloid-β in MCI APOE-ε4 homozygotes in the imputed data (N = 65) but not for the original AIBL dataset (N = 11). These results suggest that MissForest can provide a practical solution for data harmonization using imputation across studies to improve power for more nuanced analyses.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal