Agriculture (Mar 2022)
Accounting for Missing Pedigree Information with Single-Step Random Regression Test-Day Models
Abstract
Genomic selection is widely used in dairy cattle breeding, but still, single-step models are rarely used in national dairy cattle evaluations. New computing methods have allowed the utilization of very large genomic data sets. However, an unsolved model problem is how to build genomic- (G) and pedigree- (A22) relationship matrices that satisfy the theoretical assumptions about the same scale and equal base populations. Incompatibility issues have also been observed in the manner in which the genetic groups are included in the model. In this study, we compared three approaches for accounting for missing pedigree information: (1) GT_H used the full Quaas and Pollak (QP) transformation for the genetic groups, including both the pedigree-based and the genomic-relationship matrices, (2) GT_A22 used the partial QP transformation that omitted QP transformation in G−1, and (3) GT_MF used the metafounder approach. In addition to the genomic models, (4) an official animal model with a unknown parent groups (UPG) from the QP transformation and (5) an animal model with the metafounder approach were used for comparison. These models were tested with Nordic Holstein test-day production data and models. The test-day data included 8.5 million cows with a total of 173.7 million records and 10.9 million animals in the pedigree, and there were 274,145 genotyped animals. All models used VanRaden method 1 in G and had a 30% residual polygenic proportion (RPG). The G matrices in GT_H and GT_A22 were scaled to have an average diagonal equal to that of A22. Comparisons between the models were based on Mendelian sampling terms and forward prediction validation using linear regression with solutions from the full- and reduced-data evaluations. Models GT_H and GT_A22 gave very similar results in terms of overprediction. The MF approach showed the lowest bias.
Keywords