A Review of Polyglot Persistence in the Big Data World

Pwint  Phyu Khine; Zhaoshun Wang

doi:10.3390/info10040141

Information (Apr 2019)

A Review of Polyglot Persistence in the Big Data World

Pwint Phyu Khine,
Zhaoshun Wang

Affiliations

Pwint Phyu Khine: Department of Computer Science and Technology, School of Computer and Communication Engineering, University of Science and Technology Beijing (USTB), Beijing 10083, China
Zhaoshun Wang: Department of Computer Science and Technology, School of Computer and Communication Engineering, University of Science and Technology Beijing (USTB), Beijing 10083, China

DOI: https://doi.org/10.3390/info10040141
Journal volume & issue: Vol. 10, no. 4
p. 141

Abstract

Read online

The inevitability of the relationship between big data and distributed systems is indicated by the fact that data characteristics cannot be easily handled by a standalone centric approach. Among the different concepts of distributed systems, the CAP theorem (Consistency, Availability, and Partition Tolerant) points out the prominent use of the eventual consistency property in distributed systems. This has prompted the need for other, different types of databases beyond SQL (Structured Query Language) that have properties of scalability and availability. NoSQL (Not-Only SQL) databases, mostly with the BASE (Basically Available, Soft State, and Eventual consistency), are gaining ground in the big data era, while SQL databases are left trying to keep up with this paradigm shift. However, none of these databases are perfect, as there is no model that fits all requirements of data-intensive systems. Polyglot persistence, i.e., using different databases as appropriate for the different components within a single system, is becoming prevalent in data-intensive big data systems, as they are distributed and parallel by nature. This paper reflects the characteristics of these databases from a conceptual point of view and describes a potential solution for a distributed system—the adoption of polyglot persistence in data-intensive systems in the big data era.

Published in Information

ISSN: 2078-2489 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/information/

About the journal

Abstract

Keywords