Scientific Reports (Mar 2025)
Spatial interpolation of cropland soil bulk density by increasing soil samples with filled missing values
Abstract
Abstract Large sample sizes are crucial for accurately capturing spatial changes in soil properties by spatial interpolation methods. However, soil bulk density (BD) data in historical datasets is often incomplete, and it’s uncertain if filled values enhance spatial interpolation accuracy. Using 2,883 cropland soil BD samples from the Sichuan Basin in China, we developed the best prediction models from traditional pedotransfer function (PTF), multiple linear regression (MLR), random forest (RF), and radial basis function neural network (RBFNN) to fill missing BD values for 1,336 samples. We then applied ordinary kriging (OK) and inverse distance weighting (IDW) to map soil BD, incorporating the filled BD as modeling points. The RBFNN model, tailored for each sub-watershed, yielded the highest accuracy in filling missing BD, with an increase in coefficient of determination (R 2) by 19.54–37.36% and reductions in mean absolute error (MAE), mean relative error (MRE) and root mean square error (RMSE) by 8.91–14.81%, 9.02–16.22% and 7.71–13.61%, respectively. Incorporating filled BD data reduced the MAE, MRE, and RMSE of OK and IDW by 4.17%, 4.36%, 4.96%, and 6.54%, 6.92%, 8.15%, respectively, significantly lowering spatial interpolation uncertainty. This methodology improves the accuracy of soil property mapping in regions with incomplete historical data.
Keywords