Journal of Integrative Agriculture (Aug 2024)

Improving model performance in mapping cropland soil organic matter using time-series remote sensing data

  • Xianglin Zhang,
  • Jie Xue,
  • Songchao Chen,
  • Zhiqing Zhuo,
  • Zheng Wang,
  • Xueyao Chen,
  • Yi Xiao,
  • Zhou Shi

Journal volume & issue
Vol. 23, no. 8
pp. 2820 – 2841

Abstract

Read online

Faced with increasing global soil degradation, spatially explicit data on cropland soil organic matter (SOM) provides crucial data for soil carbon pool accounting, cropland quality assessment and the formulation of effective management policies. As a spatial information prediction technique, digital soil mapping (DSM) has been widely used to spatially map soil information at different scales. However, the accuracy of digital SOM maps for cropland is typically lower than for other land cover types due to the inherent difficulty in precisely quantifying human disturbance. To overcome this limitation, this study systematically assessed a framework of “information extraction-feature selection-model averaging” for improving model performance in mapping cropland SOM using 462 cropland soil samples collected in Guangzhou, China in 2021. The results showed that using the framework of dynamic information extraction, feature selection and model averaging could efficiently improve the accuracy of the final predictions (R2: 0.48 to 0.53) without having obviously negative impacts on uncertainty. Quantifying the dynamic information of the environment was an efficient way to generate covariates that are linearly and nonlinearly related to SOM, which improved the R2 of random forest from 0.44 to 0.48 and the R2 of extreme gradient boosting from 0.37 to 0.43. Forward recursive feature selection (FRFS) is recommended when there are relatively few environmental covariates (500). The Granger-Ramanathan model averaging approach could improve the prediction accuracy and average uncertainty. When the structures of initial prediction models are similar, increasing in the number of averaging models did not have significantly positive effects on the final predictions. Given the advantages of these selected strategies over information extraction, feature selection and model averaging have a great potential for high-accuracy soil mapping at any scales, so this approach can provide more reliable references for soil conservation policy-making.

Keywords