When wavelet decomposition meets external attention: a lightweight cloud server load prediction model

Zhen Zhang; Chen Xu; Jinyu Zhang; Zhe Zhu; Shaohua Xu

doi:10.1186/s13677-024-00698-6

Journal of Cloud Computing: Advances, Systems and Applications (Aug 2024)

When wavelet decomposition meets external attention: a lightweight cloud server load prediction model

Zhen Zhang,
Chen Xu,
Jinyu Zhang,
Zhe Zhu,
Shaohua Xu

Affiliations

Zhen Zhang: School of Computer Science and Engineering, Shandong University of Science and Technology
Chen Xu: School of Computer Science, Fudan University
Jinyu Zhang: School of Computer Science and Engineering, Shandong University of Science and Technology
Zhe Zhu: Shandong Artificial Intelligence Institute, Qilu University of Technology (Shandong Academy of Sciences)
Shaohua Xu: School of Computer Science and Engineering, Shandong University of Science and Technology

DOI: https://doi.org/10.1186/s13677-024-00698-6
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 19

Abstract

Read online

Abstract Load prediction tasks aim to predict the dynamic trend of future load based on historical performance sequences, which are crucial for cloud platforms to make timely and reasonable task scheduling. However, existing prediction models are limited while capturing complicated temporal patterns from the load sequences. Besides, the frequently adopted global weighting strategy (e.g., the self-attention mechanism) in temporal modeling schemes has quadratic computational complexity, hindering the immediate response of cloud servers in complex real-time scenarios. To address the above limitations, we propose a Wavelet decomposition-enhanced External Transformer (WETformer) to provide accurate yet efficient load prediction for cloud servers. Specifically, we first incorporate discrete wavelet transform to progressively extract long-term trends, highlighting the intrinsic attributes of temporal sequences. Then, we propose a lightweight multi-head External Attention (EA) mechanism to simultaneously consider the inter-element relationships within load sequences and the correlations across different sequences. Such an external component has linear computational complexity, mitigating the encoding redundancy prevalent and enhancing prediction efficiency. Extensive experiments conducted on Alibaba Cloud’s cluster tracking dataset demonstrate that WETformer achieves superior prediction accuracy and the shortest inference time compared to several state-of-the-art baseline methods.

Published in Journal of Cloud Computing: Advances, Systems and Applications

ISSN: 2192-113X (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://journalofcloudcomputing.springeropen.com

About the journal

Abstract

Keywords