Dynamic ensemble-based machine learning models for predicting pest populations

Ankit Kumar Singh; Md Yeasin; Ranjit Kumar Paul; A. K. Paul; Anita Sarkar

doi:10.3389/fams.2024.1435517

Frontiers in Applied Mathematics and Statistics (Dec 2024)

Dynamic ensemble-based machine learning models for predicting pest populations

Ankit Kumar Singh,
Md Yeasin,
Ranjit Kumar Paul,
A. K. Paul,
Anita Sarkar

Affiliations

Ankit Kumar Singh: The Graduate School, ICAR-Indian Agricultural Research Institute, New Delhi, India
Md Yeasin: ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
Ranjit Kumar Paul: ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
A. K. Paul: ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
Anita Sarkar: The Graduate School, ICAR-Indian Agricultural Research Institute, New Delhi, India

DOI: https://doi.org/10.3389/fams.2024.1435517
Journal volume & issue: Vol. 10

Abstract

Read online

Early prediction of pest occurrences can enhance crop production, reduce input costs, and minimize environmental damage. Advances in machine learning algorithms facilitate the development of efficient pest alert systems. Furthermore, ensemble algorithms help in the utilization of several models rather than being dependent on a single model. This study introduces a dynamic ensemble model with absolute log error (ALE) and logistic error functions using four machine learning models—artificial neural networks (ANNs), support vector regression (SVR), k-nearest neighbors (kNN), and random forests (RF). Various abiotic factors such as minimum and maximum temperature, rainfall, and morning and evening relative humidity were incorporated into the model as exogenous variables. The proposed algorithms were compared with fixed-weighted and unweighted ensemble methods, and candidate machine learning models, using the pest population data for yellow stem borer (YSB) from two regions of India. Error metrics include the root mean square log error (RMSLE), root relative square error (RRSE), and median absolute error (MDAE), along with the Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) algorithm. This study concluded that the proposed dynamic ensemble algorithm demonstrated better predictive accuracy in forecasting YSB infestation in rice crops.

Published in Frontiers in Applied Mathematics and Statistics

ISSN: 2297-4687 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Applied mathematics. Quantitative methods; Science: Mathematics: Probabilities. Mathematical statistics
Website: http://journal.frontiersin.org/journal/applied-mathematics-and-statistics#

About the journal

Abstract

Keywords