Agrosystems, Geosciences & Environment (Jun 2024)
A data‐driven topsoil classification framework to support soil health assessment in Minnesota
Abstract
Abstract Soil health assessments aim to quantify soil health status using indicators linked to ecosystem services such as yield, nutrient cycling, water cycling, or carbon storage. Many indicators are related to soil biological processes, which can be challenging to interpret because they are sensitive not only to management, but also to nonmanagement variables such as soil inherent properties, topography, and climate. Existing studies address this challenge by grouping similar soils by taxonomy, geography, or a combination of these and other variables for soil health assessment. We investigated whether grouping soils based on multiple quantitative topsoil properties could be an alternative to taxonomic or geographic groups. We used an unsupervised classification algorithm, k‐means, to cluster publicly available soil and climate data for Minnesota. Clustering into eight conceptual groups (“clusters”) based on 10 topsoil properties was determined to be the optimal algorithm output. We evaluated the ability of our soil clusters and other grouping methods to explain variance in eight soil health indicators. We found the combination of Major Land Resource Area (MLRA) and soil cluster performed best, explaining as much or more variance than other groupings for five of the eight indicators. The clusters distinguish zones of topsoil variation at the field scale, and MLRAs account for broader scale variation in climate and other landscape factors. The approach we describe is flexible and could be applied at different locations and scales to produce conceptual soil groups and associated maps to support soil health test sampling and interpretation at the field scale.