Geoderma (Dec 2024)
A Moroccan soil spectral library use framework for improving soil property prediction: Evaluating a geostatistical approach
Abstract
A soil spectrum generated by any spectrometer requires a calibration model to estimate soil properties from it. To achieve best results, the assumption is that locally calibrated models offer more accurate predictions. However, achieving this higher accuracy comes with associated costs, complexity, and resource requirements, thus limiting widespread adoption. Furthermore, there is a lack of comprehensive frameworks for developing and utilizing soil spectral libraries (SSLs) to make predictions for specific samples. While calibration samples are necessary, there is the need to optimize SSL development through strategically determining the quantity, location, and timing of these samples based on the quality of the information in the library. This research aimed to develop a spatially optimized SSL and propose a use-framework tailored for predicting soil properties for a specific farmland context. Consequently, the Moroccan SSL (MSSL) was established utilizing a stratified spatially balanced sampling design, using six environmental covariates and FAO soil units. Subsequently, various criteria for calibration sample selection were explored, including a spatial autocorrelation of spectra principal component (PC) scores (spatial calibration sample selection), spectra similarity memory-based learner (MBL), and selection based on environmental covariate clustering. Twelve soil properties were used to evaluate these calibration sample selections to predict soil properties using the near infrared (NIR) and mid infrared (MIR) ranges. Among the methods assessed, we observed distinct precision improvements resulting from spatial sample selection and MBL compared to the use of the entire MSSL. Notably, the Lin’s Concordance Correlation Coefficient (CCC) values using the spatial calibration sample selection was improved for Olsen extractable phosphorus (OlsenP) by 41.3% and Mehlich III extractable phosphorus (P_M3) by 8.5% for the MIR spectra and for CEC by 25.6%, pH by 13.0% and total nitrogen (Tot_N) by 10.6% for the NIR spectra in reference to use of the entire MSSL. Utilizing the spatial autocorrelation of the spectra PC scores proved beneficial in identifying appropriate calibration samples for a new sample location, thereby enhancing prediction performance comparable to, or surpassing that of the use of the entire MSSL. This study signifies notable advancement in crafting targeted models tailored for specific samples within a vast and diverse SSL.