GIScience & Remote Sensing (Dec 2024)
Heterogeneous transfer learning considering feature representation and environmental consistency for landslide spatial prediction
Abstract
The difficult, time-consuming, and imbalanced acquisition of landslide inventories in complex and heterogeneous large areas often results in limited predictive performance for most statistical landslide spatial prediction methods. Although partial transfer learning methods have produced reliable predictive results and successfully implemented knowledge transfer between data-rich and data-scarce areas, most of these methods generally only extract inadequate environmental features and lack the ability to interpret when, what, and how to effectively transfer knowledge to other regions with limited data. In this paper, a heterogeneous transfer learning method considering feature representations and environmental consistency, which features robustness, similarity, and transferability, is proposed for landslide spatial prediction. Specifically, we trained a stacked autoencoder (SAE) to extract more nonlinear features among environmental factors, and added an environmental similarity criterion to the transfer adaptation boosting (TrAdaBoost) algorithm to minimize feature dissimilarities and avoid negative transfer in different scenarios. To evaluate the robustness of the proposed method, we first selected two target areas (Lushan County and Luding County, China) and a source area (Wenchuan County, China) as case study areas. Then, we directly combined the source area and target area as an additional dataset without considering transfer learning to validate the significance and necessity of the proposed method. The area under the receiver operating characteristic curve (AUC) of the two target regions for the proposed method were 0.920 and 0.972, respectively, which were greater than those of the traditional TrAdaBoost (0.909 and 0.969, respectively), SAE (0.790 and 0.937, respectively), and random forest (0.915 and 0.966, respectively) methods. Furthermore, the AUC values of the SAE (0.851 and 0.900) and random forest (0.890 and 0.935) models based on the expanded datasets were also lower than those of the proposed method. Therefore, the experimental results show that the proposed method can be generalized well due to its efficient utilization and high adaptability. Moreover, the proposed method can not only be applied to emergency rescue and disaster prevention but can also offer a promising way to improve landslide predictions of models with incomplete landslide inventories.
Keywords