Statistical tests for non-independent partitions of large autocorrelated datasets

Anthony R. Ives; Likai Zhu; Fangfang Wang; Jun Zhu; Clay J. Morrow; Volker C. Radeloff

MethodsX (Jan 2022)

Statistical tests for non-independent partitions of large autocorrelated datasets

Anthony R. Ives,
Likai Zhu,
Fangfang Wang,
Jun Zhu,
Clay J. Morrow,
Volker C. Radeloff

Affiliations

Anthony R. Ives: Corresponding author.; Integrative Biology, University of Wisconsin-Madison, Madison, WI 53706, USA
Likai Zhu: Integrative Biology, University of Wisconsin-Madison, Madison, WI 53706, USA
Fangfang Wang: Integrative Biology, University of Wisconsin-Madison, Madison, WI 53706, USA
Jun Zhu: Integrative Biology, University of Wisconsin-Madison, Madison, WI 53706, USA
Clay J. Morrow: Integrative Biology, University of Wisconsin-Madison, Madison, WI 53706, USA
Volker C. Radeloff: Integrative Biology, University of Wisconsin-Madison, Madison, WI 53706, USA

Journal volume & issue: Vol. 9
p. 101660

Abstract

Read online

Large sets of autocorrelated data are common in fields such as remote sensing and genomics. For example, remote sensing can produce maps of information for millions of pixels, and the information from nearby pixels will likely be spatially autocorrelated. Although there are well-established statistical methods for testing hypotheses using autocorrelated data, these methods become computationally impractical for large datasets.• The method developed here makes it feasible to perform F-tests, likelihood ratio tests, and t-tests for large autocorrelated datasets. The method involves subsetting the dataset into partitions, analyzing each partition separately, and then combining the separate tests to give an overall test.• The separate statistical tests on partitions are non-independent, because the points in different partitions are not independent. Therefore, combining separate analyses of partitions requires accounting for the non-independence of the test statistics among partitions.• The methods can be applied to a wide range of data, including not only purely spatial data but also spatiotemporal data. For spatiotemporal data, it is possible to estimate coefficients from time-series models at different spatial locations and then analyze the spatial distribution of the estimates. The spatial analysis can be simplified by estimating spatial autocorrelation directly from the spatial autocorrelation among time series.

Method for performing statistical tests using non-independent data partitions

Published in MethodsX

ISSN: 2215-0161 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Science
Website: http://www.journals.elsevier.com/methodsx/

About the journal

Abstract

Keywords