Comparing sample bias correction methods for species distribution modeling using virtual species

Richard Inman; Janet Franklin; Todd Esque; Kenneth Nussear

doi:10.1002/ecs2.3422

Ecosphere (Mar 2021)

Comparing sample bias correction methods for species distribution modeling using virtual species

Richard Inman,
Janet Franklin,
Todd Esque,
Kenneth Nussear

Affiliations

Richard Inman: School of Geographical Sciences and Urban Planning Arizona State University PO Box 875302 Tempe Arizona85287USA
Janet Franklin: Department of Botany and Plant Sciences University of California – Riverside 900 University Ave Riverside California92521USA
Todd Esque: U.S. Geological Survey Western Ecological Research Center Henderson Field Station 160 N. Stephanie Street Henderson Nevada89074USA
Kenneth Nussear: Department of Geography University of Nevada – Reno 1664 N. Virginia Street Reno Nevada89557USA

DOI: https://doi.org/10.1002/ecs2.3422
Journal volume & issue: Vol. 12, no. 3
pp. n/a – n/a

Abstract

Read online

Abstract A key assumption in species distribution modeling (SDM) with presence‐background (PB) methods is that sampling of occurrence localities is unbiased and that any sampling bias is proportional to the background distribution of environmental covariates. This assumption is rarely met when SDM practitioners rely on federated museum records from natural history collections for geo‐located occurrences due to inherent sampling bias found in these collections. We use a simulation approach to explore the effectiveness of three methods developed to account for sampling bias in SDM with PB frameworks. Two of the methods rely on careful filtering of observation data—geographic thinning (G‐Filter) and environmental thinning (E‐Filter)—while a third, FactorBiasOut, creates selection weights for background data to bias locations toward areas where the observation dataset was sampled. While these methods have been assessed previously, evaluation has emphasized spatial predictions of habitat potential. Here, we dig deeper into the effectiveness of these methods by exploring how sampling bias not only affects predictions of habitat potential, but also our understanding of niche characteristics such as which explanatory variables and response curves best represent species–environment relationships. We simulate 100 virtual species ranging from generalist to specialist in their habitat preferences and introduce geographic and environmental bias at three intensity levels to measure the effectiveness of each correction method to (1) predict true probability of occurrence across a study area, (2) recover true species–environment relationships, and (3) identify true explanatory variables. We find that the FactorBiasOut most often showed the greatest improvement in recreating known distributions but did no better at correctly identifying environmental covariates or recreating species–environment relationships than G‐Filter or E‐Filter methods. Narrow niche species are most problematic for biased calibration datasets, such that correction methods can, in some cases, make predictions worse.

Published in Ecosphere

ISSN: 2150-8925 (Online)
Publisher: Wiley
Country of publisher: United States
LCC subjects: Science: Biology (General): Ecology
Website: https://esajournals.onlinelibrary.wiley.com/journal/21508925

About the journal

Abstract

Keywords