Frontiers in Big Data (Oct 2023)

How big is Big Data? A comprehensive survey of data production, storage, and streaming in science and industry

  • Luca Clissa,
  • Luca Clissa,
  • Mario Lassnig,
  • Lorenzo Rinaldi,
  • Lorenzo Rinaldi

DOI
https://doi.org/10.3389/fdata.2023.1271639
Journal volume & issue
Vol. 6

Abstract

Read online

The contemporary surge in data production is fueled by diverse factors, with contributions from numerous stakeholders across various sectors. Comparing the volumes at play among different big data entities is challenging due to the scarcity of publicly available data. This survey aims to offer a comprehensive perspective on the orders of magnitude involved in yearly data generation by some public and private leading organizations, using an array of online sources for estimation. These estimates are based on meaningful, individual data production metrics and plausible per-unit sizes. The primary objective is to offer insights into the comparative scales of major big data players, their sources, and data production flows, rather than striving for precise measurements or incorporating the latest updates. The results are succinctly conveyed through a visual representation of the relative data generation volumes across these entities.

Keywords