Deriving stratified effects from joint models investigating gene-environment interactions

Vincent Laville; Timothy Majarian; Paul S. de Vries; Amy R. Bentley; Mary F. Feitosa; Yun J. Sung; D. C. Rao; Alisa Manning; Hugues Aschard; on behalf of the CHARGE Gene-Lifestyle Interactions Working Group

doi:10.1186/s12859-020-03569-4

BMC Bioinformatics (Jun 2020)

Deriving stratified effects from joint models investigating gene-environment interactions

Vincent Laville,
Timothy Majarian,
Paul S. de Vries,
Amy R. Bentley,
Mary F. Feitosa,
Yun J. Sung,
D. C. Rao,
Alisa Manning,
Hugues Aschard,
on behalf of the CHARGE Gene-Lifestyle Interactions Working Group

Affiliations

Vincent Laville: Department of Computational Biology, USR 3756 CNRS, Institut Pasteur
Timothy Majarian: Program in Medical and Population Genetics, Broad Institute of MIT and Harvard
Paul S. de Vries: Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston
Amy R. Bentley: Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health
Mary F. Feitosa: Division of Biostatistics, Department of Genetics, Washington University School of Medecine
Yun J. Sung: Division of Biostatistics, Department of Genetics, Washington University School of Medecine
D. C. Rao: Division of Biostatistics, Department of Genetics, Washington University School of Medecine
Alisa Manning: Program in Medical and Population Genetics, Broad Institute of MIT and Harvard
Hugues Aschard: Department of Computational Biology, USR 3756 CNRS, Institut Pasteur
on behalf of the CHARGE Gene-Lifestyle Interactions Working Group

DOI: https://doi.org/10.1186/s12859-020-03569-4
Journal volume & issue: Vol. 21, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Background Models including an interaction term and performing a joint test of SNP and/or interaction effect are often used to discover Gene-Environment (GxE) interactions. When the environmental exposure is a binary variable, analyses from exposure-stratified models which consist of estimating genetic effect in unexposed and exposed individuals separately can be of interest. In large-scale consortia focusing on GxE interactions in which only the joint test has been performed, it may be challenging to get summary statistics from both exposure-stratified and marginal (i.e not accounting for interaction) models. Results In this work, we developed a simple framework to estimate summary statistics in each stratum of a binary exposure and in the marginal model using summary statistics from the “joint” model. We performed simulation studies to assess our estimators’ accuracy and examined potential sources of bias, such as correlation between genotype and exposure and differing phenotypic variances within exposure strata. Results from these simulations highlight the high theoretical accuracy of our estimators and yield insights into the impact of potential sources of bias. We then applied our methods to real data and demonstrate our estimators’ retained accuracy after filtering SNPs by sample size to mitigate potential bias. Conclusions These analyses demonstrated the accuracy of our method in estimating both stratified and marginal summary statistics from a joint model of gene-environment interaction. In addition to facilitating the interpretation of GxE screenings, this work could be used to guide further functional analyses. We provide a user-friendly Python script to apply this strategy to real datasets. The Python script and documentation are available at https://gitlab.pasteur.fr/statistical-genetics/j2s .

Published in BMC Bioinformatics

ISSN: 1471-2105 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbioinformatics/

About the journal

Abstract

Keywords