Integrating large-scale neuroimaging research datasets: Harmonisation of white matter hyperintensity measurements across Whitehall and UK Biobank datasets
Valentina Bordin,
Ilaria Bertani,
Irene Mattioli,
Vaanathi Sundaresan,
Paul McCarthy,
Sana Suri,
Enikő Zsoldos,
Nicola Filippini,
Abda Mahmood,
Luca Melazzini,
Maria Marcella Laganà,
Giovanna Zamboni,
Archana Singh-Manoux,
Mika Kivimäki,
Klaus P Ebmeier,
Giuseppe Baselli,
Mark Jenkinson,
Clare E Mackay,
Eugene P Duff,
Ludovica Griffanti
Affiliations
Valentina Bordin
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK; Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy
Ilaria Bertani
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK; Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy
Irene Mattioli
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK; Department of Biomedical, Metabolic and Neural Sciences, University of Modena and Reggio Emilia, Italy
Vaanathi Sundaresan
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
Paul McCarthy
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
Sana Suri
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Human Brain Activity, Department of Psychiatry, University of Oxford, Oxford, UK; Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, UK
Enikő Zsoldos
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK; Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Human Brain Activity, Department of Psychiatry, University of Oxford, Oxford, UK; Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, UK
Nicola Filippini
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Human Brain Activity, Department of Psychiatry, University of Oxford, Oxford, UK
Abda Mahmood
Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, UK
Luca Melazzini
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK; Department of Biomedical Sciences for Health, Università degli Studi di Milano, Milan, Italy
Maria Marcella Laganà
IRCCS Fondazione Don Carlo Gnocchi ONLUS, Milan, Italy
Giovanna Zamboni
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK; Department of Biomedical, Metabolic and Neural Sciences, University of Modena and Reggio Emilia, Italy
Archana Singh-Manoux
INSERM U1153, Epidemiology of Ageing and Neurodegenerative diseases, Université de Paris, Paris, France; Department of Epidemiology and Public Health, University College London, London, UK
Mika Kivimäki
Department of Epidemiology and Public Health, University College London, London, UK
Klaus P Ebmeier
Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, UK
Giuseppe Baselli
Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy
Mark Jenkinson
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
Clare E Mackay
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK; Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Human Brain Activity, Department of Psychiatry, University of Oxford, Oxford, UK; Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, UK; Oxford Health NHS Foundation Trust, Oxford, UK
Eugene P Duff
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK; Department of Paediatrics, University of Oxford, Oxford, UK
Ludovica Griffanti
Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK; Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Human Brain Activity, Department of Psychiatry, University of Oxford, Oxford, UK; Corresponding author at: Wellcome Centre for Integrative Neuroimaging, Oxford Centre for Human Brain Activity, Department of Psychiatry, University of Oxford, Warneford Ln, Oxford, Headington OX3 7JX, UK.
Large scale neuroimaging datasets present the possibility of providing normative distributions for a wide variety of neuroimaging markers, which would vastly improve the clinical utility of these measures. However, a major challenge is our current poor ability to integrate measures across different large-scale datasets, due to inconsistencies in imaging and non-imaging measures across the different protocols and populations. Here we explore the harmonisation of white matter hyperintensity (WMH) measures across two major studies of healthy elderly populations, the Whitehall II imaging sub-study and the UK Biobank. We identify pre-processing strategies that maximise the consistency across datasets and utilise multivariate regression to characterise study sample differences contributing to differences in WMH variations across studies. We also present a parser to harmonise WMH-relevant non-imaging variables across the two datasets. We show that we can provide highly calibrated WMH measures from these datasets with: (1) the inclusion of a number of specific standardised processing steps; and (2) appropriate modelling of sample differences through the alignment of demographic, cognitive and physiological variables. These results open up a wide range of applications for the study of WMHs and other neuroimaging markers across extensive databases of clinical data.