Frontiers in Plant Science (Aug 2023)

G2P Provides an Integrative Environment for Multi-model genomic selection analysis to improve genotype-to-phenotype prediction

  • Qian Wang,
  • Qian Wang,
  • Shan Jiang,
  • Shan Jiang,
  • Tong Li,
  • Tong Li,
  • Zhixu Qiu,
  • Zhixu Qiu,
  • Jun Yan,
  • Jun Yan,
  • Ran Fu,
  • Ran Fu,
  • Chuang Ma,
  • Chuang Ma,
  • Xiangfeng Wang,
  • Xiangfeng Wang,
  • Shuqin Jiang,
  • Shuqin Jiang,
  • Qian Cheng,
  • Qian Cheng

DOI
https://doi.org/10.3389/fpls.2023.1207139
Journal volume & issue
Vol. 14

Abstract

Read online

Genotype-to-phenotype (G2P) prediction has become a mainstream paradigm to facilitate genomic selection (GS)-assisted breeding in the seed industry. Many methods have been introduced for building GS models, but their prediction precision may vary depending on species and specific traits. Therefore, evaluation of multiple models and selection of the appropriate one is crucial to effective GS analysis. Here, we present the G2P container developed for the Singularity platform, which not only contains a library of 16 state-of-the-art GS models and 13 evaluation metrics. G2P works as an integrative environment offering comprehensive, unbiased evaluation analyses of the 16 GS models, which may be run in parallel on high-performance computing clusters. Based on the evaluation outcome, G2P performs auto-ensemble algorithms that not only can automatically select the most precise models but also can integrate prediction results from multiple models. This functionality should further improve the precision of G2P prediction. Another noteworthy function is the refinement design of the training set, in which G2P optimizes the training set based on the genetic diversity analysis of a studied population. Although the training samples in the optimized set are fewer than in the original set, the prediction precision is almost equivalent to that obtained when using the whole set. This functionality is quite useful in practice, as it reduces the cost of phenotyping when constructing training population. The G2P container and source codes are freely accessible at https://g2p-env.github.io/.

Keywords