A survey on data storage and placement methodologies for Cloud-Big Data ecosystem

Somnath Mazumdar; Daniel Seybold; Kyriakos Kritikos; Yiannis Verginadis

doi:10.1186/s40537-019-0178-3

Journal of Big Data (Feb 2019)

A survey on data storage and placement methodologies for Cloud-Big Data ecosystem

Somnath Mazumdar,
Daniel Seybold,
Kyriakos Kritikos,
Yiannis Verginadis

Affiliations

Somnath Mazumdar: Simula Research Laboratory
Daniel Seybold: Ulm University
Kyriakos Kritikos: ICS-FORTH
Yiannis Verginadis: Institute of Communication and Computer Systems (ICCS)

DOI: https://doi.org/10.1186/s40537-019-0178-3
Journal volume & issue: Vol. 6, no. 1
pp. 1 – 37

Abstract

Read online

Abstract Currently, the data to be explored and exploited by computing systems increases at an exponential rate. The massive amount of data or so-called “Big Data” put pressure on existing technologies for providing scalable, fast and efficient support. Recent applications and the current user support from multi-domain computing, assisted in migrating from data-centric to knowledge-centric computing. However, it remains a challenge to optimally store and place or migrate such huge data sets across data centers (DCs). In particular, due to the frequent change of application and DC behaviour (i.e., resources or latencies), data access or usage patterns need to be analyzed as well. Primarily, the main objective is to find a better data storage location that improves the overall data placement cost as well as the application performance (such as throughput). In this survey paper, we are providing a state of the art overview of Cloud-centric Big Data placement together with the data storage methodologies. It is an attempt to highlight the actual correlation between these two in terms of better supporting Big Data management. Our focus is on management aspects which are seen under the prism of non-functional properties. In the end, the readers can appreciate the deep analysis of respective technologies related to the management of Big Data and be guided towards their selection in the context of satisfying their non-functional application requirements. Furthermore, challenges are supplied highlighting the current gaps in Big Data management marking down the way it needs to evolve in the near future.

Published in Journal of Big Data

ISSN: 2196-1115 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://journalofbigdata.springeropen.com

About the journal

Abstract

Keywords