IEEE Access (Jan 2024)

Incorporating Meteorological Data and Pesticide Information to Forecast Crop Yields Using Machine Learning

  • Md Jiabul Hoque,
  • Md. Saiful Islam,
  • Jia Uddin,
  • Md. Abdus Samad,
  • Beatriz Sainz De Abajo,
  • Debora Libertad Ramirez Vargas,
  • Imran Ashraf

DOI
https://doi.org/10.1109/ACCESS.2024.3383309
Journal volume & issue
Vol. 12
pp. 47768 – 47786

Abstract

Read online

The agricultural sector is more vulnerable to the adverse effects of climate change and excessive pesticide application, posing a significant risk to global food security. Accurately predicting crop yields is essential for mitigating these risks and providing information on sustainable agricultural practices. This research presents a novel crop yield prediction system that utilizes a year’s worth of meteorological data, pesticide records, crop yield data, and machine learning techniques. We employed rigorous methods to gather, clean, and enhance data and then trained and evaluated three machine learning models: Gradient Boosting, K-Nearest Neighbors, and Multivariate Logistic Regression. We utilized the GridSearchCV method for hyper-parameter tweaking to identify the most suitable hyper-parameter throughout K-Fold cross-validation, aiming to improve the model’s performance by avoiding overfitting. The remarkable performance of the Gradient Boosting model, with an almost flawless coefficient of determination ( $R^{2}$ ) of 99.99%, demonstrates its promise for precise yield prediction. This research also examined the correlation between projected and actual crop yields and identified the ideal meteorological conditions. It paves the way for data-driven methods in sustainable agriculture and resource distribution, ultimately leading to a more secure future with respect to food availability and resilience to climate change.

Keywords