Geomatics, Natural Hazards & Risk (Dec 2024)

Ensemble learning landslide susceptibility assessment with optimized non-landslide samples selection

  • Jiangang Lu,
  • Yi He,
  • Lifeng Zhang,
  • Qing Zhang,
  • Binghai Gao,
  • Hesheng Chen,
  • Yumin Fang

DOI
https://doi.org/10.1080/19475705.2024.2378176
Journal volume & issue
Vol. 15, no. 1

Abstract

Read online

Non-landslide samples influence the outcomes of landslide susceptibility assessment. Existing studies did not fully consider the equilibrium between landslide and non-landslide samples in similar environments, resulting in poor reliability of landslide susceptibility assessment. This study proposed a non-landslide samples optimization method with a constraint of disaster-pregnant environment similarity to construct the balanced landslide and non-landslide samples. We employed the heterogeneous stacking and blending ensemble learning models to generate the landslide susceptibility assessment. This study focused on the Bailong River Basin with complex environment and frequent landslides as the study area. First, we extracted 12 landslide influencing factors based on multiple sources and analyzed the spatial distribution patterns of landslides. Second, we constructed similar environments based on the assessment units obtained from the curvature watershed method and selected an equal amount of both landslide and non-landslide samples in every different environment. Finally, three classic neural network models, namely multilayer perceptron, convolutional neural network, and gated recurrent unit were used as base models for stacking and blending ensemble learning models to assess landslide susceptibility. The findings suggested that the landslide susceptibility assessment results with optimized non-landslide samples were more reliable, especially the proposed method improved the prediction results in sample-sparse regions. The stacking ensemble constructed in this study demonstrated the highest area under the curve of 0.88 for the testing dataset, outperforming the blending ensemble and three base models. The issue of unreliable landslide susceptibility assessment results in sample-sparse regions within complex environments can be effectively addressed by considering the balanced sampling of non-landslide samples under the constraint of similar environments.

Keywords