Frontiers in Genetics (Apr 2018)

Empirical Bayes Estimation of Semi-parametric Hierarchical Mixture Models for Unbiased Characterization of Polygenic Disease Architectures

  • Jo Nishino,
  • Jo Nishino,
  • Yuta Kochi,
  • Yuta Kochi,
  • Daichi Shigemizu,
  • Daichi Shigemizu,
  • Daichi Shigemizu,
  • Daichi Shigemizu,
  • Mamoru Kato,
  • Mamoru Kato,
  • Katsunori Ikari,
  • Katsunori Ikari,
  • Hidenori Ochi,
  • Hidenori Ochi,
  • Hidenori Ochi,
  • Hisashi Noma,
  • Hisashi Noma,
  • Kota Matsui,
  • Kota Matsui,
  • Takashi Morizono,
  • Keith A. Boroevich,
  • Tatsuhiko Tsunoda,
  • Tatsuhiko Tsunoda,
  • Tatsuhiko Tsunoda,
  • Tatsuhiko Tsunoda,
  • Shigeyuki Matsui,
  • Shigeyuki Matsui,
  • Shigeyuki Matsui

DOI
https://doi.org/10.3389/fgene.2018.00115
Journal volume & issue
Vol. 9

Abstract

Read online

Genome-wide association studies (GWAS) suggest that the genetic architecture of complex diseases consists of unexpectedly numerous variants with small effect sizes. However, the polygenic architectures of many diseases have not been well characterized due to lack of simple and fast methods for unbiased estimation of the underlying proportion of disease-associated variants and their effect-size distribution. Applying empirical Bayes estimation of semi-parametric hierarchical mixture models to GWAS summary statistics, we confirmed that schizophrenia was extremely polygenic [~40% of independent genome-wide SNPs are risk variants, most within odds ratio (OR = 1.03)], whereas rheumatoid arthritis was less polygenic (~4 to 8% risk variants, significant portion reaching OR = 1.05 to 1.1). For rheumatoid arthritis, stratified estimations revealed that expression quantitative loci in blood explained large genetic variance, and low- and high-frequency derived alleles were prone to be risk and protective, respectively, suggesting a predominance of deleterious-risk and advantageous-protective mutations. Despite genetic correlation, effect-size distributions for schizophrenia and bipolar disorder differed across allele frequency. These analyses distinguished disease polygenic architectures and provided clues for etiological differences in complex diseases.

Keywords