Artificial Intelligence in Agriculture (Jan 2021)
Yield performance estimation of corn hybrids using machine learning algorithms
Abstract
Estimation of yield performance for crop products is a topic of interest in agriculture. In breeding programs, we cannot test all possible hybrids created by crossing two parents (inbred and tester) since it would be too time consuming and costly. In this paper, we exploit different machine learning algorithms including decision tree, gradient boosting machine, random forest, adaptive boosting, XGBoost and neural network to predict the yield of corn hybrids using data provided in the 2020 Syngenta Crop Challenge. The participants were asked to predict the yield of missing hybrids which were not tested before. Our results show that the prediction obtained by XGBoost is more accurate than other models with a root mean square error equal to 0.0524. Therefore, we use XGBoost model to estimate the yield performance for untested combinations of inbreds and testers. Using this approach, we identify hybrids with high predicted yield that can be bred to increase corn production.