Agriculture (Aug 2020)

Soybean Yield Estimation and Its Components: A Linear Regression Approach

  • Marcelo Chan Fu Wei,
  • José Paulo Molin

DOI
https://doi.org/10.3390/agriculture10080348
Journal volume & issue
Vol. 10, no. 8
p. 348

Abstract

Read online

Soybean yield estimation is either based on yield monitors or agro-meteorological and satellite imagery data, but they present several limiting factors regarding on-farm decision level. Aware that machine learning approaches have been largely applied to estimate soybean yield and the availability of data regarding soybean yield and its components (number of grains (NG) and thousand grains weight (TGW)), there is an opportunity to study their relationships. The objective was to explore the relationships between soybean yield and its components, generate equations to estimate yield and evaluate its prediction accuracy. The training dataset was composed of soybean yield and its components’ data from 2010 to 2019. Linear regression models based on NG, TGW and yield were fitted on the training dataset and applied to a validation dataset composed of 58 on-field collected samples. It was found that globally TGW and NG presented weak (r = 0.50) and strong (r = 0.92) linear relationships with yield, respectively. In addition to that, applying the fitted models to the validation dataset, model based on NG presented the highest accuracy, coefficient of determination (R2) of 0.70, mean absolute error (MAE) of 639.99 kg ha−1 and root mean squared error (RMSE) of 726.67 kg ha−1.

Keywords