On the Stability of the Kubernetes Horizontal Autoscaler Control Loop

Berta Serracanta; Andor Lukacs; Alberto Rodriguez-Natal; Albert Cabellos; Gabor Retvari

doi:10.1109/ACCESS.2025.3526751

IEEE Access (Jan 2025)

On the Stability of the Kubernetes Horizontal Autoscaler Control Loop

Berta Serracanta,
Andor Lukacs,
Alberto Rodriguez-Natal,
Albert Cabellos,
Gabor Retvari

Affiliations

Berta Serracanta: ORCiD; Department of Computer Architecture, Universitat Politècnica de Catalunya, Barcelona, Spain
Andor Lukacs: ORCiD; Faculty of Mathematics and Computer Science, Babeş-Bolyai University, Cluj-Napoca, Romania
Alberto Rodriguez-Natal: ORCiD; Cisco, Madrid, Spain
Albert Cabellos: Department of Computer Architecture, Universitat Politècnica de Catalunya, Barcelona, Spain
Gabor Retvari: ORCiD; Department of Telecommunications and Artificial Intelligence, Budapest University of Technology and Economics, Budapest, Hungary

DOI: https://doi.org/10.1109/ACCESS.2025.3526751
Journal volume & issue: Vol. 13
pp. 7160 – 7166

Abstract

Read online

Kubernetes is a widely used platform for deploying and managing containerized applications due to its efficient elastic capabilities. The Horizontal Pod Autoscaler (HPA) in Kubernetes independently adjusts the number of pods for each service, yet these services often operate in an interconnected manner. This study aims to understand the effects of autoscaling events on a graph of interconnected services. To achieve this, we apply control theory to model the HPA’s behavior. We analyze the stability of this model, perform numerical simulations, and deploy a real testbed to evaluate the performance. Our findings demonstrate that the control theory-based model accurately predicts the HPA’s behavior, ensuring system stability with CPU utilization meeting desired thresholds and no traffic loss after a transitional period. The model provides insights into optimizing resource scheduling and improving application performance in Kubernetes environments. Additionally, we extend our model to the whole service graph to understand how individual scaling decisions influence the complex graphs of cloud applications.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords