Ain Shams Engineering Journal (Mar 2022)

Fault tolerance in big data storage and processing systems: A review on challenges and solutions

  • Muntadher Saadoon,
  • Siti Hafizah Ab. Hamid,
  • Hazrina Sofian,
  • Hamza H.M. Altarturi,
  • Zati Hakim Azizul,
  • Nur Nasuha

DOI
https://doi.org/10.1016/j.asej.2021.06.024
Journal volume & issue
Vol. 13, no. 2
p. 101538

Abstract

Read online

Big data systems are sufficiently stable to store and process a massive volume of rapidly changing data. However, big data systems are composed of large-scale hardware resources that make their subspecies easily fail. Fault tolerance is the main property of such systems because it maintains availability, reliability, and constant performance during faults. Achieving an efficient fault tolerance solution in a big data system is challenging because fault tolerance must meet some constraints related to the system performance and resource consumption. This study aims to provide a consistent understanding of fault tolerance in big data systems and highlights common challenges that hinder the improvement in fault tolerance efficiency. The fault tolerance solutions applied by previous studies intended to address the identified challenges are reviewed. The paper also presents a perceptive discussion of the findings derived from previous studies and proposes a list of future directions to address the fault tolerance challenges.

Keywords