Geoderma (Mar 2024)
Quantification of soil organic carbon in particle size fractions using a near-infrared spectral library in West Africa
Abstract
Particle size fractionation enables a better understanding of soil organic carbon (C) dynamics since it separates fractions that differ in composition, residence time and function. However, this method is time-consuming and tedious; thus, its use has been greatly limited. Our objective was to evaluate the ability of an existing soil spectral library (SSL) from different regions of West Africa to predict the C amount in the fractions (gC kg−1 soil) of the samples in a new target set from Benin.The SSL included 181 samples from five countries, and the target set included 94 samples (depth ≤ 40 cm), most of which were coarse-textured; near-infrared reflectance (NIR) spectra were collected for 2 mm sieved samples (non-fractionated samples). The predicted variables were the C amounts in the non-fractionated soil and in the 50 µm fractions (F50, respectively). Different methods were tested to optimize the predictions: (i) SSL enrichment with 10 or 15 samples selected from the target set (spiking) and replicated six times (i.e. extra-weighted); (ii) locally weighted (local) partial least squares regression (PLSR), which is calibration by the spectral neighbours with the highest weights attributed to closest neighbours, and was compared to “global” (i.e., common) PLSR, where all calibration samples equally contribute; and (iii) spectrum pretreatments (e.g., smoothing, centring, derivatization). In addition, the intermediate precision of the conventional data (standard error of laboratory; SELint) was estimated through triplicate fractionation of three samples carried out by three operators (one per replicate).When the SSL alone was used for calibration, the predictions were inaccurate for the C amounts in the non-fractionated soil and in F50, with minimal benefit from the local PLSR over the global PLSR in general. For the non-fractionated soil, F50, the ratios of performance to the interquartile range in the validation set, RPIQVAL, were 1.6–1.8, 1.6–1.7, 1.9 and 1.9–2.1, respectively. Calibration with SSL spiked (i.e., completed with spiking samples) yielded an increase in RPIQVAL from 33 to 56% for the C amount in the non-fractionated soil and F50 (RPIQVAL reached 2.4–2.5, 2.2–2.3, 1.9–2.0 and 2.1–2.3, respectively), and the benefit of local PLSR was still limited. The SELint was based on a few samples and thus only provided a rough estimation; this estimate represented at least 65% of the prediction error for the C amounts in the fractions. Therefore, the SELint needs to be determined more extensively to both improve the model accuracy and refine the interpretation of the predictions based on NIR spectra. This library should be enriched with samples from other sites to represent other soil types.