Impact of linkage disequilibrium heterogeneity along the genome on genomic prediction and heritability estimation

Duanyang Ren; Xiaodian Cai; Qing Lin; Haoqiang Ye; Jinyan Teng; Jiaqi Li; Xiangdong Ding; Zhe Zhang

doi:10.1186/s12711-022-00737-3

Genetics Selection Evolution (Jun 2022)

Impact of linkage disequilibrium heterogeneity along the genome on genomic prediction and heritability estimation

Duanyang Ren,
Xiaodian Cai,
Qing Lin,
Haoqiang Ye,
Jinyan Teng,
Jiaqi Li,
Xiangdong Ding,
Zhe Zhang

Affiliations

Duanyang Ren: Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University
Xiaodian Cai: Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University
Qing Lin: Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University
Haoqiang Ye: Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University
Jinyan Teng: Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University
Jiaqi Li: Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University
Xiangdong Ding: Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture and Rural Affairs, National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University
Zhe Zhang: Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University

DOI: https://doi.org/10.1186/s12711-022-00737-3
Journal volume & issue: Vol. 54, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background Compared to medium-density single nucleotide polymorphism (SNP) data, high-density SNP data contain abundant genetic variants and provide more information for the genetic evaluation of livestock, but it has been shown that they do not confer any advantage for genomic prediction and heritability estimation. One possible reason is the uneven distribution of the linkage disequilibrium (LD) along the genome, i.e., LD heterogeneity among regions. The aim of this study was to effectively use genome-wide SNP data for genomic prediction and heritability estimation by using models that control LD heterogeneity among regions. Methods The LD-adjusted kinship (LDAK) and LD-stratified multicomponent (LDS) models were used to control LD heterogeneity among regions and were compared with the classical model that has no such control. Simulated and real traits of 2000 dairy cattle individuals with imputed high-density (770K) SNP data were used. Five types of phenotypes were simulated, which were controlled by very strongly, strongly, moderately, weakly and very weakly tagged causal variants, respectively. The performances of the models with high- and medium-density (50K) panels were compared to verify that the models that controlled LD heterogeneity among regions were more effective with high-density data. Results Compared to the medium-density panel, the use of the high-density panel did not improve and even decreased prediction accuracies and heritability estimates from the classical model for both simulated and real traits. Compared to the classical model, LDS effectively improved the accuracy of genomic predictions and unbiasedness of heritability estimates, regardless of the genetic architecture of the trait. LDAK applies only to traits that are mainly controlled by weakly tagged causal variants, but is still less effective than LDS for this type of trait. Compared with the classical model, LDS improved prediction accuracy by about 13% for simulated phenotypes and by 0.3 to ~ 10.7% for real traits with the high-density panel, and by ~ 1% for simulated phenotypes and by − 0.1 to ~ 6.9% for real traits with the medium-density panel. Conclusions Grouping SNPs based on regional LD to construct the LD-stratified multicomponent model can effectively eliminate the adverse effects of LD heterogeneity among regions, and greatly improve the efficiency of high-density SNP data for genomic prediction and heritability estimation.

Published in Genetics Selection Evolution

ISSN: 0999-193X (Print); 1297-9686 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Agriculture: Animal culture; Science: Biology (General): Genetics
Website: https://gsejournal.biomedcentral.com/

About the journal