Poultry Science (Jan 2025)
An investigation of machine learning methods applied to genomic prediction in yellow-feathered broilers
Abstract
Machine learning (ML) methods have rapidly developed in various theoretical and practical research areas, including predicting genomic breeding values for large livestock animals. However, few studies have investigated the application of ML in broiler breeding. In this study, seven different ML methods—support vector regression (SVR), random forest (RF), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), kernel ridge regression (KRR) and multilayer perceptron (MLP) were employed to predict the genomic breeding values of laying traits, growth and carcass traits in a yellow-feathered broiler breeding population. The results indicated that classic methods, such as GBLUP and Bayesian, achieved superior prediction accuracy compared to ML methods in five of the eight traits. For half-eviscerated weight (HEW), ML methods showed an average improvement of 54.4% over GBLUP and Bayesian methods. Among the ML methods, SVR, RF, GBDT, and XGBoost exhibited improvements exceeding 60%, with respective values of 61.3%, 61.0%, 60.4%, and 60.7%; while MLP improved by 54.4% and LightGBM by 53.7%, KRR had the lowest improvement at 29.4%. For eviscerated weight (EW), ML methods still outperformed GBLUP and Bayesian methods. MLP gained the largest improvement at 19.0%, while SVR, RF, GBDT, XGBoost, LightGBM, and KRR improved by 15.0%, 16.5%, 9.5%, 7.0%, 1.6%, and 15.9%, respectively. Compared to default hyperparameters, the average improvement of ML methods with tuned hyperparameters was 34.0%, 32.9%, 27.0%, 19.3%, 26.8%, 13.2%, 18.9%, and 46.3%, respectively. The prediction accuracy of above algorithms could be optimized using genome-wide association study (GWAS) to select subsets of significant SNPs. This work provides valuable insights into genomic prediction, aiding genetic breeding in broilers.