Benchmarking machine learning and parametric methods for genomic prediction of feed efficiency-related traits in Nellore cattle

Lucio F. M. Mota; Leonardo M. Arikawa; Samuel W. B. Santos; Gerardo A. Fernandes Júnior; Anderson A. C. Alves; Guilherme J. M. Rosa; Maria E. Z. Mercadante; Joslaine N. S. G. Cyrillo; Roberto Carvalheiro; Lucia G. Albuquerque

doi:10.1038/s41598-024-57234-4

Scientific Reports (Mar 2024)

Benchmarking machine learning and parametric methods for genomic prediction of feed efficiency-related traits in Nellore cattle

Lucio F. M. Mota,
Leonardo M. Arikawa,
Samuel W. B. Santos,
Gerardo A. Fernandes Júnior,
Anderson A. C. Alves,
Guilherme J. M. Rosa,
Maria E. Z. Mercadante,
Joslaine N. S. G. Cyrillo,
Roberto Carvalheiro,
Lucia G. Albuquerque

Affiliations

Lucio F. M. Mota: School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)
Leonardo M. Arikawa: School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)
Samuel W. B. Santos: School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)
Gerardo A. Fernandes Júnior: School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)
Anderson A. C. Alves: School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)
Guilherme J. M. Rosa: Department of Animal and Dairy Sciences, University of Wisconsin
Maria E. Z. Mercadante: Institute of Animal Science, Beef Cattle Research Center
Joslaine N. S. G. Cyrillo: Institute of Animal Science, Beef Cattle Research Center
Roberto Carvalheiro: School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)
Lucia G. Albuquerque: School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)

DOI: https://doi.org/10.1038/s41598-024-57234-4
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Genomic selection (GS) offers a promising opportunity for selecting more efficient animals to use consumed energy for maintenance and growth functions, impacting profitability and environmental sustainability. Here, we compared the prediction accuracy of multi-layer neural network (MLNN) and support vector regression (SVR) against single-trait (STGBLUP), multi-trait genomic best linear unbiased prediction (MTGBLUP), and Bayesian regression (BayesA, BayesB, BayesC, BRR, and BLasso) for feed efficiency (FE) traits. FE-related traits were measured in 1156 Nellore cattle from an experimental breeding program genotyped for ~ 300 K markers after quality control. Prediction accuracy (Acc) was evaluated using a forward validation splitting the dataset based on birth year, considering the phenotypes adjusted for the fixed effects and covariates as pseudo-phenotypes. The MLNN and SVR approaches were trained by randomly splitting the training population into fivefold to select the best hyperparameters. The results show that the machine learning methods (MLNN and SVR) and MTGBLUP outperformed STGBLUP and the Bayesian regression approaches, increasing the Acc by approximately 8.9%, 14.6%, and 13.7% using MLNN, SVR, and MTGBLUP, respectively. Acc for SVR and MTGBLUP were slightly different, ranging from 0.62 to 0.69 and 0.62 to 0.68, respectively, with empirically unbiased for both models (0.97 and 1.09). Our results indicated that SVR and MTGBLUBP approaches were more accurate in predicting FE-related traits than Bayesian regression and STGBLUP and seemed competitive for GS of complex phenotypes with various degrees of inheritance.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal