Information (Nov 2024)

Stacking Ensemble Technique Using Optimized Machine Learning Models with Boruta–XGBoost Feature Selection for Landslide Susceptibility Mapping: A Case of Kermanshah Province, Iran

  • Zeynab Yousefi,
  • Ali Asghar Alesheikh,
  • Ali Jafari,
  • Sara Torktatari,
  • Mohammad Sharif

DOI
https://doi.org/10.3390/info15110689
Journal volume & issue
Vol. 15, no. 11
p. 689

Abstract

Read online

Landslides cause significant human and financial losses in different regions of the world. A high-accuracy landslide susceptibility map (LSM) is required to reduce the adverse effects of landslides. Machine learning (ML) is a robust tool for LSM creation. ML models require large amounts of data to predict landslides accurately. This study has developed a stacking ensemble technique based on ML and optimization to enhance the accuracy of an LSM while considering small datasets. The Boruta–XGBoost feature selection was used to determine the optimal combination of features. Then, an intelligent and accurate analysis was performed to prepare the LSM using a dynamic and hybrid approach based on the Adaptive Fuzzy Inference System (ANFIS), Extreme Learning Machine (ELM), Support Vector Regression (SVR), and new optimization algorithms (Ladybug Beetle Optimization [LBO] and Electric Eel Foraging Optimization [EEFO]). After model optimization, a stacking ensemble learning technique was used to weight the models and combine the model outputs to increase the accuracy and reliability of the LSM. The weight combinations of the models were optimized using LBO and EEFO. The Root Mean Square Error (RMSE) and Area Under the Receiver Operating Characteristic Curve (AUC-ROC) parameters were used to assess the performance of these models. A landslide dataset from Kermanshah province, Iran, and 17 influencing factors were used to evaluate the proposed approach. Landslide inventory was 116 points, and the combined Voronoi and entropy method was applied for non-landslide point sampling. The results showed higher accuracy from the stacking ensemble technique with EEFO and LBO algorithms with AUC-ROC values of 94.81% and 94.84% and RMSE values of 0.3146 and 0.3142, respectively. The proposed approach can help managers and planners prepare accurate and reliable LSMs and, as a result, reduce the human and financial losses associated with landslide events.

Keywords