Agriculture (Sep 2024)
Soil Salinity Prediction in an Arid Area Based on Long Time-Series Multispectral Imaging
Abstract
Traditional soil salinity measurement methods are generally complex and labor-intensive, restricting the long-term monitoring of soil salinity, particularly in arid areas. In this context, the soil salt content (SSC) data from farms in the Heihe River Basin in Northwest China were collected in three consecutive years (2021, 2022, and 2023). In addition, the spectral reflectance and texture features of different sampling sites in the study area were extracted from long-term unmanned aerial vehicle (UAV) multispectral images to replace the red and near-infrared bands with a newly introduced red edge band. The spectral index was calculated in this study before using four sensitive variable combinations to predict soil salt contents. A Pearson correlation analysis was performed in this study to screen 57 sensitive features. In addition, 36 modeling scenarios were conducted based on the Extreme Gradient Boosting (XGBoost Implemented using R language 4.3.1), Backpropagation Neural Network (BPNN), and Random Forest (RF) algorithms. The most optimal algorithms for predicting the soil salt contents in farmland located in the Heihe River Basin, in the arid region of Northwest China, were determined. The results showed a higher prediction accuracy for the XGBoost algorithm than the RF and BPNN algorithms, accurately reflecting the actual soil salt contents in the arid area. On the other hand, the most accurate predicted soil salt contents were obtained in 2023 using the XGBoost algorithm, with coefficient of determination (R2), root mean square error (RMSE), and mean absolute error (MAE) ranges of 0.622–0.820, 0.086–0.157, and 0.078–0.134, respectively, whereas the most stable prediction results were obtained using the collected data in 2022. From the perspective of different sensitive variable input combinations, the implementation of the XGBoost algorithm using the spectral index–spectral reflectance–texture feature input combination resulted in comparatively higher prediction accuracies than those of the other variable combinations in 2022 and 2023. Specifically, the R2, RMSE, and MAE values obtained using the spectral index–spectral reflectance–texture feature input combination were 0.674, 0.133, and 0.086 in 2022 and 0.820, 0.165, and 0.134 in 2023, respectively. Therefore, our results demonstrated that the spectral index–spectral reflectance–texture feature was the optimal sensitive variable input combination for the machine learning algorithms, of which the XGBoost algorithm is the most optimal model for predicting soil salt contents. The results of this study provide a theoretical basis for the rapid and accurate prediction of soil salinity in arid areas.
Keywords