Mathematics (Feb 2020)

Robust Linear Trend Test for Low-Coverage Next-Generation Sequence Data Controlling for Covariates

  • Jung Yeon Lee,
  • Myeong-Kyu Kim,
  • Wonkuk Kim

DOI
https://doi.org/10.3390/math8020217
Journal volume & issue
Vol. 8, no. 2
p. 217

Abstract

Read online

Low-coverage next-generation sequencing experiments assisted by statistical methods are popular in a genetic association study. Next-generation sequencing experiments produce genotype data that include allele read counts and read depths. For low sequencing depths, the genotypes tend to be highly uncertain; therefore, the uncertain genotypes are usually removed or imputed before performing a statistical analysis. It may result in the inflated type I error rate and in a loss of statistical power. In this paper, we propose a mixture-based penalized score association test adjusting for non-genetic covariates. The proposed score test statistic is based on a sandwich variance estimator so that it is robust under the model misspecification between the covariates and the latent genotypes. The proposed method takes advantage of not requiring either external imputation or elimination of uncertain genotypes. The results of our simulation study show that the type I error rates are well controlled and the proposed association test have reasonable statistical power. As an illustration, we apply our statistic to pharmacogenomics data for drug responsiveness among 400 epilepsy patients.

Keywords