Frontiers in Plant Science (Dec 2022)

Genetic variation and marker−trait association affect the genomic selection prediction accuracy of soybean protein and oil content

  • Bo Sun,
  • Bo Sun,
  • Rui Guo,
  • Zhi Liu,
  • Xiaolei Shi,
  • Qing Yang,
  • Jiayao Shi,
  • Mengchen Zhang,
  • Chunyan Yang,
  • Shugang Zhao,
  • Jie Zhang,
  • Jianhan He,
  • Jiaoping Zhang,
  • Jiaoping Zhang,
  • Jianhui Su,
  • Qijian Song,
  • Long Yan

DOI
https://doi.org/10.3389/fpls.2022.1064623
Journal volume & issue
Vol. 13

Abstract

Read online

IntroductionGenomic selection (GS) is a potential breeding approach for soybean improvement.MethodsIn this study, GS was performed on soybean protein and oil content using the Ridge Regression Best Linear Unbiased Predictor (RR-BLUP) based on 1,007 soybean accessions. The SoySNP50K SNP dataset of the accessions was obtained from the USDA-ARS, Beltsville, MD lab, and the protein and oil content of the accessions were obtained from GRIN.ResultsOur results showed that the prediction accuracy of oil content was higher than that of protein content. When the training population size was 100, the prediction accuracies for protein content and oil content were 0.60 and 0.79, respectively. The prediction accuracy increased with the size of the training population. Training populations with similar phenotype or with close genetic relationships to the prediction population exhibited better prediction accuracy. A greatest prediction accuracy for both protein and oil content was observed when approximately 3,000 markers with -log10(P) greater than 1 were included.DiscussionThis information will help improve GS efficiency and facilitate the application of GS.

Keywords