Cell Reports (Nov 2021)

NyuWa Genome resource: A deep whole-genome sequencing-based variation profile and reference panel for the Chinese population

  • Peng Zhang,
  • Huaxia Luo,
  • Yanyan Li,
  • You Wang,
  • Jiajia Wang,
  • Yu Zheng,
  • Yiwei Niu,
  • Yirong Shi,
  • Honghong Zhou,
  • Tingrui Song,
  • Quan Kang,
  • Tao Xu,
  • Shunmin He

Journal volume & issue
Vol. 37, no. 7
p. 110017

Abstract

Read online

Summary: The lack of haplotype reference panels and whole-genome sequencing resources specific to the Chinese population has greatly hindered genetic studies in the world’s largest population. Here, we present the NyuWa genome resource, based on deep (26.2×) sequencing of 2,999 Chinese individuals, and construct a NyuWa reference panel of 5,804 haplotypes and 19.3 million variants, which is a high-quality publicly available Chinese population-specific reference panel with thousands of samples. Compared with other panels, the NyuWa reference panel reduces the Han Chinese imputation error rate by a margin ranging from 30% to 51%. Population structure and imputation simulation tests support the applicability of one integrated reference panel for northern and southern Chinese. In addition, a total of 22,504 loss-of-function variants in coding and noncoding genes are identified, including 11,493 novel variants. These results highlight the value of the NyuWa genome resource in facilitating genetic research in Chinese and Asian populations.

Keywords