Data Integration Methods for Phenotype Harmonization in Multi-Cohort Genome-Wide Association Studies With Behavioral Outcomes

Justin M. Luningham; Daniel B. McArtor; Anne M. Hendriks; Anne M. Hendriks; Catharina E. M. van Beijsterveldt; Catharina E. M. van Beijsterveldt; Paul Lichtenstein; Sebastian Lundström; Henrik Larsson; Henrik Larsson; Meike Bartels; Meike Bartels; Meike Bartels; Dorret I. Boomsma; Dorret I. Boomsma; Dorret I. Boomsma; Gitta H. Lubke

doi:10.3389/fgene.2019.01227

Frontiers in Genetics (Dec 2019)

Data Integration Methods for Phenotype Harmonization in Multi-Cohort Genome-Wide Association Studies With Behavioral Outcomes

Justin M. Luningham,
Daniel B. McArtor,
Anne M. Hendriks,
Anne M. Hendriks,
Catharina E. M. van Beijsterveldt,
Catharina E. M. van Beijsterveldt,
Paul Lichtenstein,
Sebastian Lundström,
Henrik Larsson,
Henrik Larsson,
Meike Bartels,
Meike Bartels,
Meike Bartels,
Dorret I. Boomsma,
Dorret I. Boomsma,
Dorret I. Boomsma,
Gitta H. Lubke

Affiliations

Justin M. Luningham: Department of Psychology, University of Notre Dame, Notre Dame, IN, United States
Daniel B. McArtor: Department of Psychology, University of Notre Dame, Notre Dame, IN, United States
Anne M. Hendriks: Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
Anne M. Hendriks: Faculty of Behavioural and Movement Sciences, Amsterdam Public Health Research Institute, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
Catharina E. M. van Beijsterveldt: Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
Catharina E. M. van Beijsterveldt: Faculty of Behavioural and Movement Sciences, Amsterdam Public Health Research Institute, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
Paul Lichtenstein: Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
Sebastian Lundström: Gillberg Neuropsychiatry Centre, University of Gothenburg, Gothenburg, Sweden
Henrik Larsson: Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
Henrik Larsson: School of Medical Sciences, Örebro University, Örebro, Sweden
Meike Bartels: Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
Meike Bartels: Faculty of Behavioural and Movement Sciences, Amsterdam Public Health Research Institute, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
Meike Bartels: Amsterdam Neuroscience, VU University Amsterdam, Amsterdam, Netherlands
Dorret I. Boomsma: Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
Dorret I. Boomsma: Faculty of Behavioural and Movement Sciences, Amsterdam Public Health Research Institute, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
Dorret I. Boomsma: Amsterdam Neuroscience, VU University Amsterdam, Amsterdam, Netherlands
Gitta H. Lubke: Department of Psychology, University of Notre Dame, Notre Dame, IN, United States

DOI: https://doi.org/10.3389/fgene.2019.01227
Journal volume & issue: Vol. 10

Abstract

Read online

Parallel meta-analysis is a popular approach for increasing the power to detect genetic effects in genome-wide association studies across multiple cohorts. Consortia studying the genetics of behavioral phenotypes are oftentimes faced with systematic differences in phenotype measurement across cohorts, introducing heterogeneity into the meta-analysis and reducing statistical power. This study investigated integrative data analysis (IDA) as an approach for jointly modeling the phenotype across multiple datasets. We put forth a bi-factor integration model (BFIM) that provides a single common phenotype score and accounts for sources of study-specific variability in the phenotype. In order to capitalize on this modeling strategy, a phenotype reference panel was utilized as a supplemental sample with complete data on all behavioral measures. A simulation study showed that a mega-analysis of genetic variant effects in a BFIM were more powerful than meta-analysis of genetic effects on a cohort-specific sum score of items. Saving the factor scores from the BFIM and using those as the outcome in meta-analysis was also more powerful than the sum score in most simulation conditions, but a small degree of bias was introduced by this approach. The reference panel was necessary to realize these power gains. An empirical demonstration used the BFIM to harmonize aggression scores in 9-year old children across the Netherlands Twin Register and the Child and Adolescent Twin Study in Sweden, providing a template for application of the BFIM to a range of different phenotypes. A supplemental data collection in the Netherlands Twin Register served as a reference panel for phenotype modeling across both cohorts. Our results indicate that model-based harmonization for the study of complex traits is a useful step within genetic consortia.

Published in Frontiers in Genetics

ISSN: 1664-8021 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Science: Biology (General): Genetics
Website: http://journal.frontiersin.org/journal/genetics

About the journal

Abstract

Keywords