Journal of Cloud Computing: Advances, Systems and Applications (Sep 2022)

CLQLMRS: improving cache locality in MapReduce job scheduling using Q-learning

  • Rana Ghazali,
  • Sahar Adabi,
  • Ali Rezaee,
  • Douglas G. Down,
  • Ali Movaghar

DOI
https://doi.org/10.1186/s13677-022-00322-5
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 17

Abstract

Read online

Abstract Scheduling of MapReduce jobs is an integral part of Hadoop and effective job scheduling has a direct impact on Hadoop performance. Data locality is one of the most important factors to be considered in order to improve efficiency, as it affects data transmission through the system. A number of researchers have suggested approaches for improving data locality, but few have considered cache locality. In this paper, we present a state-of-the-art job scheduler, CLQLMRS (Cache Locality with Q-Learning in MapReduce Scheduler) for improving both data locality and cache locality using reinforcement learning. The proposed algorithm is evaluated by various experiments in a heterogeneous environment. Experimental results show significantly decreased execution time compared with FIFO, Delay, and the Adaptive Cache Local scheduler.

Keywords