大数据 (Jul 2024)

System performance optimization practice for big data scenarios

  • Jibin WANG,
  • Hailong YANG,
  • Kai FENG,
  • Xin SUN,
  • Minda ZHANG,
  • Kelun LEI,
  • Zhiwen XIAO,
  • Yifei ZHANG,
  • Jiaxi WU

Journal volume & issue
Vol. 10
pp. 21 – 33

Abstract

Read online

In the existing large-scale distributed environments, there is still much room for improvement in the performance and computational efficiency of big data applications.However, performance analysis and optimization in large-scale environments requires a large number of human resources from domain experts.This paper proposes a general lowperformance query statement detection and optimization process for performance optimization in big data applications, summarizes four types of low-performance behaviors that significantly affect the performance of big data applications, and proposes specific optimization strategies for each type of low-performance behavior.Finally, through experimental evaluation, the effectiveness of the optimization scheme in actual large-scale cluster is verified.

Keywords