G3: Genes, Genomes, Genetics (Jun 2017)

A Variational Bayes Genomic-Enabled Prediction Model with Genotype × Environment Interaction

  • Osval A. Montesinos-López,
  • Abelardo Montesinos-López,
  • José Crossa,
  • José Cricelio Montesinos-López,
  • Francisco Javier Luna-Vázquez,
  • Josafhat Salinas-Ruiz,
  • José R. Herrera-Morales,
  • Raymundo Buenrostro-Mariscal

DOI
https://doi.org/10.1534/g3.117.041202
Journal volume & issue
Vol. 7, no. 6
pp. 1833 – 1853

Abstract

Read online

There are Bayesian and non-Bayesian genomic models that take into account G×E interactions. However, the computational cost of implementing Bayesian models is high, and becomes almost impossible when the number of genotypes, environments, and traits is very large, while, in non-Bayesian models, there are often important and unsolved convergence problems. The variational Bayes method is popular in machine learning, and, by approximating the probability distributions through optimization, it tends to be faster than Markov Chain Monte Carlo methods. For this reason, in this paper, we propose a new genomic variational Bayes version of the Bayesian genomic model with G×E using half-t priors on each standard deviation (SD) term to guarantee highly noninformative and posterior inferences that are not sensitive to the choice of hyper-parameters. We show the complete theoretical derivation of the full conditional and the variational posterior distributions, and their implementations. We used eight experimental genomic maize and wheat data sets to illustrate the new proposed variational Bayes approximation, and compared its predictions and implementation time with a standard Bayesian genomic model with G×E. Results indicated that prediction accuracies are slightly higher in the standard Bayesian model with G×E than in its variational counterpart, but, in terms of computation time, the variational Bayes genomic model with G×E is, in general, 10 times faster than the conventional Bayesian genomic model with G×E. For this reason, the proposed model may be a useful tool for researchers who need to predict and select genotypes in several environments.

Keywords