The Plant Genome (Nov 2019)

Training Population Optimization for Genomic Selection

  • Inés Berro,
  • Bettina Lado,
  • Rafael S. Nalin,
  • Martin Quincke,
  • Lucía Gutiérrez

DOI
https://doi.org/10.3835/plantgenome2019.04.0028
Journal volume & issue
Vol. 12, no. 3
pp. n/a – n/a

Abstract

Read online

Core Ideas Training populations can be optimized for specific testing populations. Optimized training populations are smaller, more related, and more predictive. Stratified sampling with a relationship matrix weighted by marker effect is optimal. The effectiveness of genomic selection in breeding programs depends on the phenotypic quality and depth, the prediction model, the number and type of molecular markers, and the size and composition of the training population (TR). Furthermore, population structure and diversity have a key role in the composition of the optimal training sets. Our goal was to compare strategies for optimizing the TR for specific testing populations (TE). A total of 1353 wheat (Triticum aestivum L.) and 644 rice (Oryza sativa L.) advanced lines were evaluated for grain yield in multiple environments. Several within‐TR optimization strategies were compared to identify groups of individuals with increased predictive ability. Additionally, optimization strategies to choose individuals from the TR with higher predictive ability for a specific TE were compared. There is a benefit in considering both the population structure and the relationship between the TR and the TE when designing an optimal TR for genomic selection. A weighted relationship matrix with stratified sampling is the best strategy for forward predictions of quantitative traits in populations several generations apart.