Database Systems Journal (Jul 2015)
Approaches for parallel data loading and data querying
Abstract
This paper aims to bring contributions in data loading and data querying using products from the Apache Hadoop ecosystem. Currently, we talk about Big Data at up to zettabytes scale (10^21 bytes). Research in this area is usually interdisciplinary combining elements from statistics, system integration, parallel processing and cloud computing.