Scientific Reports (Oct 2022)

An integrated learning algorithm for early prediction of melon harvest

  • Chunyang Qian,
  • Taihang Du,
  • Shuguang Sun,
  • Wei Liu,
  • Haiguang Zheng,
  • Jianchun Wang

DOI
https://doi.org/10.1038/s41598-022-20799-z
Journal volume & issue
Vol. 12, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Different modeling techniques must be applied to manage production and statistical estimation to predict the expected harvest. By calculating advanced production methods and the rational valuation of different factors, we can accurately capture the variety of growth characteristics and the expected yield. This paper obtained 32 feature variables related to melons, including phenological features, shape features, and color features. The Gradient Boosted Decision Tree (GBDT) network and the Grid Search (GS) hyperparameter seeking method was applied to calculate the degree of importance of all melon fruits' characteristics and construct prediction models for three expected harvest indexes of melon yield, sugar content, and endocarp hardness. To facilitate growers to carry out prediction and estimation in the field without destroying the melon fruits. The reduced feature variables were selected as inputs. The GBDT model was used to provide a significant advantage in prediction compared to both Random Forest (RF) and Support Vector Regression (SVR) methods. In addition, to verify the feasibility of using only reduced feature variables as input for the evaluation work, this study also compares the predictive effects of the model when all feature variables and only reduced feature variables are used. The GBDT prediction model proposed in this paper predicted melon yield, sugar content, and hardness using reduced features as input, and the model R2 could reach more than 90%. Therefore, this method can effectively help growers carry out early non-destructive inspection and growth prediction of melons in the field.