Stats (Feb 2022)

Modeling Secondary Phenotypes Conditional on Genotypes in Case–Control Studies

  • Naomi C. Brownstein,
  • Jianwen Cai,
  • Shad Smith,
  • Luda Diatchenko,
  • Gary D. Slade,
  • Eric Bair

DOI
https://doi.org/10.3390/stats5010014
Journal volume & issue
Vol. 5, no. 1
pp. 203 – 214

Abstract

Read online

Traditional case–control genetic association studies examine relationships between case–control status and one or more covariates. It is becoming increasingly common to study secondary phenotypes and their association with the original covariates. The Orofacial Pain: Prospective Evaluation and Risk Assessment (OPPERA) project, a study of temporomandibular disorders (TMD), motivates this work. Numerous measures of interest are collected at enrollment, such as the number of comorbid pain conditions from which a participant suffers. Examining the potential genetic basis of these measures is of secondary interest. Assessing these associations is statistically challenging, as participants do not form a random sample from the population of interest. Standard methods may be biased and lack coverage and power. We propose a general method for the analysis of arbitrary phenotypes utilizing inverse probability weighting and bootstrapping for standard error estimation. The method may be applied to the complicated association tests used in next-generation sequencing studies, such as analyses of haplotypes with ambiguous phase. Simulation studies show that our method performs as well as competing methods when they are applicable and yield promising results for outcome types, such as time-to-event, to which other methods may not apply. The method is applied to the OPPERA baseline case–control genetic study.

Keywords