IEEE Access (Jan 2021)

A Systematic Review of Data Models for the Big Data Problem

  • Faezeh Mostajabi,
  • Ali Asghar Safaei,
  • Amir Sahafi

DOI
https://doi.org/10.1109/ACCESS.2021.3112880
Journal volume & issue
Vol. 9
pp. 128889 – 128904

Abstract

Read online

Nowadays, data are generated in a continuous streaming manner as the inputs of various applications. The sources of such generated data can be wired or wireless sensor networks commonly used in various fields of geographical, traffic, Internet of Things (IoT), financial tickers, Web2 and Web3, e-commerce, social networks, and online communities. The high volume, high variety, and high velocity of data have recently posed the challenge of 3Vs to this field, also known as the Big Data Problem. The 3Vs dimensions of complexities for the big data entails high-speed storage, scalability of database systems, suitable data models, real-time responsiveness and so on. Data model, as the representation schema of data is an essential issue since many others (e.g., DBMS systems’ design, DB languages, etc.) rely on. So, the study of data models is a key and fundamental aspect in structuring, organizing, storing, and manipulating big data. It is also the essence in various areas of cloud migration, web-scale, and so forth. In this paper, we have systematically reviewed different types of data models, the rationale behind them, their applications and support capabilities, and the technologies to switch from one model to another. To address the user needs in various fields, a systematic review method is adopted to classify and present different types and characteristics of data models.

Keywords