Developing the Raster Big Data Benchmark: A Comparison of Raster Analysis on Big Data Platforms

David Haynes; Philip Mitchell; Eric Shook

doi:10.3390/ijgi9110690

ISPRS International Journal of Geo-Information (Nov 2020)

Developing the Raster Big Data Benchmark: A Comparison of Raster Analysis on Big Data Platforms

David Haynes,
Philip Mitchell,
Eric Shook

Affiliations

David Haynes: Institute for Health Informatics, University of Minnesota, Minneapolis, MN 55455, USA
Philip Mitchell: Ali I. Al-Naimi Petroleum Engineering Research Center, King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia
Eric Shook: Geography Environment and Society, University of Minnesota, Minneapolis, MN 55455, USA

DOI: https://doi.org/10.3390/ijgi9110690
Journal volume & issue: Vol. 9, no. 11
p. 690

Abstract

Read online

Technologies around the world produce and interact with geospatial data instantaneously, from mobile web applications to satellite imagery that is collected and processed across the globe daily. Big raster data allow researchers to integrate and uncover new knowledge about geospatial patterns and processes. However, we are at a critical moment, as we have an ever-growing number of big data platforms that are being co-opted to support spatial analysis. A gap in the literature is the lack of a robust assessment comparing the efficiency of raster data analysis on big data platforms. This research begins to address this issue by establishing a raster data benchmark that employs freely accessible datasets to provide a comprehensive performance evaluation and comparison of raster operations on big data platforms. The benchmark is critical for evaluating the performance of spatial operations on big data platforms. The benchmarking datasets and operations are applied to three big data platforms. We report computing times and performance bottlenecks so that GIScientists can make informed choices regarding the performance of each platform. Each platform is evaluated for five raster operations: pixel count, reclassification, raster add, focal averaging, and zonal statistics using three raster different datasets.

Published in ISPRS International Journal of Geo-Information

ISSN: 2220-9964 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Geography. Anthropology. Recreation: Geography (General)
Website: http://www.mdpi.com/journal/ijgi

About the journal

Abstract

Keywords