Finding the needle in a high-dimensional haystack: Canonical correlation analysis for neuroscientists

Hao-Ting Wang; Jonathan Smallwood; Janaina Mourao-Miranda; Cedric Huchuan Xia; Theodore D. Satterthwaite; Danielle S. Bassett; Danilo Bzdok

NeuroImage (Aug 2020)

Finding the needle in a high-dimensional haystack: Canonical correlation analysis for neuroscientists

Hao-Ting Wang,
Jonathan Smallwood,
Janaina Mourao-Miranda,
Cedric Huchuan Xia,
Theodore D. Satterthwaite,
Danielle S. Bassett,
Danilo Bzdok

Affiliations

Hao-Ting Wang: Department of Psychology, University of York, Heslington, York, United Kingdom; Sackler Center for Consciousness Science, University of Sussex, Brighton, United Kingdom; Corresponding author. Sackler Center for Consciousness Science, University of Sussex, Brighton, United Kingdom.
Jonathan Smallwood: Department of Psychology, University of York, Heslington, York, United Kingdom
Janaina Mourao-Miranda: Centre for Medical Image Computing, Department of Computer Science, University College London, London, United Kingdom; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom
Cedric Huchuan Xia: Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
Theodore D. Satterthwaite: Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
Danielle S. Bassett: Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, 19104, USA; Department of Electrical and Systems Engineering, University of Pennsylvania, Philadelphia, PA, 19104, USA; Department of Neurology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Department of Physics & Astronomy, School of Arts & Sciences, University of Pennsylvania, Philadelphia, PA, 19104, USA
Danilo Bzdok: Department of Psychiatry, Psychotherapy and Psychosomatics, RWTH Aachen University, Germany; JARA-BRAIN, Jülich-Aachen Research Alliance, Germany; Parietal Team, INRIA, Neurospin, Bat 145, CEA Saclay, 91191, Gif-sur-Yvette, France; Department of Biomedical Engineering, Montreal Neurological Institute, Faculty of Medicine, McGill University, Montreal, Canada; Mila - Quebec Artificial Intelligence Institute, Canada; Corresponding author. Department of Biomedical Engineering, Montreal Neurological Institute, McGill University, Mila - Quebec Artificial Intelligence Institute, Montreal, Canada.

Journal volume & issue: Vol. 216
p. 116745

Abstract

Read online

The 21st century marks the emergence of “big data” with a rapid increase in the availability of datasets with multiple measurements. In neuroscience, brain-imaging datasets are more commonly accompanied by dozens or hundreds of phenotypic subject descriptors on the behavioral, neural, and genomic level. The complexity of such “big data” repositories offer new opportunities and pose new challenges for systems neuroscience. Canonical correlation analysis (CCA) is a prototypical family of methods that is useful in identifying the links between variable sets from different modalities. Importantly, CCA is well suited to describing relationships across multiple sets of data, such as in recently available big biomedical datasets. Our primer discusses the rationale, promises, and pitfalls of CCA.

Published in NeuroImage

ISSN: 1053-8119 (Print); 1095-9572 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: https://www.journals.elsevier.com/neuroimage

About the journal

Abstract

Keywords