Earth System Science Data (Jan 2024)

Reference maps of soil phosphorus for the pan-Amazon region

  • J. P. Darela-Filho,
  • J. P. Darela-Filho,
  • J. P. Darela-Filho,
  • A. Rammig,
  • K. Fleischer,
  • K. Fleischer,
  • T. Reichert,
  • L. F. Lugli,
  • C. A. Quesada,
  • L. C. C. Hurtarte,
  • L. C. C. Hurtarte,
  • M. D. de Paula,
  • D. M. Lapola

DOI
https://doi.org/10.5194/essd-16-715-2024
Journal volume & issue
Vol. 16
pp. 715 – 729

Abstract

Read online

Phosphorus (P) is recognized as an important driver of terrestrial primary productivity across biomes. Several recent developments in process-based vegetation models aim at the concomitant representation of the carbon (C), nitrogen (N), and P cycles in terrestrial ecosystems, building upon the ecological stoichiometry and the processes that govern nutrient availability in soils. Thus, understanding the spatial distribution of P forms in soil is fundamental to initializing and/or evaluating process-based models that include the biogeochemical cycle of P. One of the major constraints for the large-scale application of these models is the lack of data related to the spatial patterns of the various forms of P present in soils, given the sparse nature of in situ observations. We applied a model selection approach based on random forest regression models trained and tested for the prediction of different P forms (total, available, organic, inorganic, and occluded P) – obtained by the Hedley sequential extraction method. As input for the models, reference soil group and textural properties, geolocation, N and C contents, terrain elevation and slope, soil pH, and mean annual precipitation and temperature from 108 sites of the RAINFOR network were used. The selected models were then applied to predict the target P forms using several spatially explicit datasets containing contiguous estimated values across the area of interest. Here, we present a set of maps depicting the distribution of total, available, organic, inorganic, and occluded P forms in the topsoil profile (0–30 cm) of the pan-Amazon region in the spatial resolution of 5 arcmin. The random forest regression models presented a good level of mean accuracy for the total, available, organic, inorganic, and occluded P forms (77.37 %, 76,86 %, 75.14 %, 68.23 %, and 64.62% respectively). Our results confirm that the mapped area generally has very low total P concentration status, with a clear gradient of soil development and nutrient content. Total N was the most important variable for the prediction of all target P forms and the analysis of partial dependence indicates several features that are also related with soil concentration of all target P forms. We observed that gaps in the data used to train and test the random forest models, especially in the most elevated areas, constitute a problem to the methods applied here. However, most of the area could be mapped with a good level of accuracy. Also, the biases of gridded data used for model prediction are introduced in the P maps. Nonetheless, the final map of total P resembles the expected geographical patterns. Our maps may be useful for the parametrization and evaluation of process-based terrestrial ecosystem models as well as other types of models. Also, they can promote the testing of new hypotheses about the gradient and status of P availability and soil-vegetation feedback in the pan-Amazon region. The reference maps can be downloaded from https://doi.org/10.25824/redu/FROESE (Darela-Filho and Lapola, 2023).