Smart Agricultural Technology (Dec 2024)

Enhancing corn yield prediction: Optimizing data quality or model complexity?

  • Yuting Zhou,
  • Shengfang Ma,
  • Huihui Zhang,
  • Sathyanarayanan Aakur

Journal volume & issue
Vol. 9
p. 100671

Abstract

Read online

Field-scale corn yield prediction before harvest can assist farmers in better organizing their resources. Machine learning-based pipelines for analyzing remote sensing imagery offer an efficient solution to this problem. However, the cost of data acquisition and training requirements for machine or deep learning models depend on various factors, such as equipment (multispectral vs. RGB sensors) and the ability to predict yield from observations across growth stages. In this study, we aim to provide a comprehensive analysis of the effectiveness of traditional ensemble learning methods (Random Forest and Gradient Boosting) and deep learning models (ResNet 18, ResNet34, and ViT) in predicting corn yield across deficit and fully irrigated fields using UAV-based RGB and multispectral imagery. The performance of these models was examined across early, middle, and late growth stages, considering both computational complexity and accuracy. We also developed a novel shallow CNN framework called SimRes, inspired by the ResNet framework but tailored for streamlined efficiency and simplicity for yield prediction. Extensive quantitative analysis demonstrated that the customized SimRes performed as well as deep learning baselines but with faster computing times, while traditional approaches, such as Random Forests and Gradient Boosting exhibited marginally smaller R-squared values. Models utilizing multispectral data outperformed models using RGB, albeit with variations across growth stages. Deep learning methods performed better than ensemble learning methods in the early and late growth stages using RGB, while performance became comparable in the middle stage. These results underscore the importance of additional information or more complex models to enhance prediction accuracy alongside a trade-off between computational complexity and accuracy. This research provides valuable insights for optimizing corn yield prediction across different growth stages, informing agricultural management and harvest planning decisions.

Keywords