Applied Sciences (Nov 2024)
Versatility Evaluation of Landslide Risk with Window Sizes and Sampling Techniques Based on Deep Learning
Abstract
Across the globe, landslides cause significant loss of life, injuries, and widespread damage to homes and infrastructure. Therefore, assessing and analyzing landslide hazards is crucial to human, environmental, cultural, economic, and social sustainability. This study utilizes ArcGIS 10.8 and Python 3.9 to create landslide databases for Niigata Prefecture (NIG), Iwate and Miyagi Prefectures (IWT-MYG), and Hokkaido (HKD), drawing on data obtained from the National Research Institute for Earth Science and Disaster Resilience, Japan. A distinguishing feature of this study is the application of a Convolutional Neural Network (CNN), which significantly outperforms traditional machine learning models in image-based pattern recognition by extracting contextual information from surrounding areas, a distinct advantage in image and pattern recognition tasks. Unlike conventional methods that often require manual feature selection and engineering, CNNs automate feature extraction, enabling a more nuanced understanding of complex patterns. By experimenting with CNN input window sizes ranging from 3 × 3 to 27 × 27 pixels and employing diverse sampling techniques, we demonstrate that larger windows enhance the model’s predictive accuracy by capturing a wider range of environmental interactions critical for effective landslide modeling. CNN models with 19 × 19 pixel windows typically yield the best overall performance, with CNN-19 achieving an AUC of 0.950, 0.982 and 0.969 for NIG, HKD, and IWT-MYG, respectively. Furthermore, we improve prediction reliability using oversampling and a random window-moving method. For instance, in the NIG region, the AUC of the oversampling CNN-19 is 0.983, while the downsampling AUC is 0.950). These techniques, less commonly applied in traditional machine learning approaches to landslide detection, help address the issue of data imbalance often seen in landslide datasets, where instances of landslides are far outnumbered by non-landslide occurrences. While challenges remain in enhancing the model’s generalization, this research makes significant progress in developing more robust and adaptable tools for landslide prediction, which are vital for ensuring environmental and societal resilience.
Keywords