Geomatics, Natural Hazards & Risk (Dec 2024)
A novel voting ensemble model empowered by metaheuristic feature selection for accurate flash flood susceptibility mapping
Abstract
This study addresses the challenges of flash flood susceptibility mapping in Yemen’s Qaa’Jahran Basin, characterized by complex terrain and limited hydro-meteorological data. To enhance predictive accuracy, we integrate metaheuristic feature selection with ensemble learning models. Initially, fifteen flash flood variables were retrieved using Geographic Information System (GIS) based remote sensing, setting the stage for a novel feature selection algorithm. Then, the Memo Search Algorithm (MSA), a metaheuristic approach is proposed to efficiently reduce feature space. Through comprehensive comparisons with established algorithms such as the Artificial Bee Colony (ABC) and Gray Wolf Optimizer (GWO), MSA refined the selection, identifying 'elevation’ and 'distance to streams’ as optimal factors. Statistical validations using the Friedman and Wilcoxon signed-rank tests confirmed the significant superiority of MSA over competing algorithms. Ensemble classifiers (bagging, boosting, stacking) were then applied to the reduced feature space. Comprehensive evaluation revealed the boosting ensemble with MSA outperformed traditional techniques reaching 98.75% accuracy, 0.9896 Area Under the Curve (AUC), and 98.95% the harmonic mean of the precision and recall (F1-score). Precision in identifying high-risk flash flood zones was underlined via spatial prediction, confirming the integrated framework’s ability to significantly improve forecast accuracy. The findings aid disaster management with powerful geographic mapping in data-poor regions. The proposed framework is adaptable globally for flash flood-prone areas with similar constraints. As climate change is expected to increase extreme rainfall events, communities globally will need robust data-driven methodologies for flash flood susceptibility mapping. The Key recommendations of the current study include investigating hybrid feature selection methods to better enhance predictive inputs and analyzing transferability across hydro-climatic zones.
Keywords