Dynamic Sizing of Cloud-Native Telco Data Centers With Digital Twin and Reinforcement Learning

Angelos Pentelas; Dimitris Katsiros; Dimitra Paranou; George Doukas; Konstantinos Chondralis; Giorgos Giannopoulos; Evangelos Angelou; George Papastefanatos

doi:10.1109/ACCESS.2024.3421289

IEEE Access (Jan 2024)

Dynamic Sizing of Cloud-Native Telco Data Centers With Digital Twin and Reinforcement Learning

Angelos Pentelas,
Dimitris Katsiros,
Dimitra Paranou,
George Doukas,
Konstantinos Chondralis,
Giorgos Giannopoulos,
Evangelos Angelou,
George Papastefanatos

Affiliations

Angelos Pentelas: ORCiD; Intracom S.A. Telecom Solutions, Paiania, Greece
Dimitris Katsiros: Athena Research Center, Marousi, Greece
Dimitra Paranou: Athena Research Center, Marousi, Greece
George Doukas: Intracom S.A. Telecom Solutions, Paiania, Greece
Konstantinos Chondralis: Intracom S.A. Telecom Solutions, Paiania, Greece
Giorgos Giannopoulos: ORCiD; Athena Research Center, Marousi, Greece
Evangelos Angelou: ORCiD; Intracom S.A. Telecom Solutions, Paiania, Greece
George Papastefanatos: ORCiD; Athena Research Center, Marousi, Greece

DOI: https://doi.org/10.1109/ACCESS.2024.3421289
Journal volume & issue: Vol. 12
pp. 91462 – 91479

Abstract

Read online

Telco edge data centers (DCs) accommodate applications whose load fluctuates considerably during a day. This variability mandates swift and responsive resource adjustments to mitigate the risk of incurring unnecessary costs during off-peak periods, where a significant fraction of nodes may be under-utilized. Tackling this challenge is integral to optimizing operational efficiency and cost-effectiveness of telco edge DCs. To this end, this article aims to address the Dynamic Data Center Sizing (DDS) problem, which boils down to optimizing the number of active nodes as per current resource demand. The proposed DDS solution consists of two core modules, namely a forecasting module, which predicts resource demands, and a decision-making module, which acts upon predicted demands. The decision-making of DDS is implemented via the filtering and the rank-drain-observe (RDO) algorithms. Filtering is based on integer linear programming, and it computes the theoretically optimal state of the DC based on predicted resource demands. RDO is a heuristic that strives to realize the optimal DC state in an iterative and robust fashion. To expedite RDO in large-scale clusters, we further devise a Reinforcement Learning (RL)-enabled DDS variant (i.e., RL-DDS), in which RDO integrates an RL agent that computes optimized batches of nodes for concurrent deactivation. We propose an innovative solution based on the notion of Digital Twin to train the RL agent in emulation mode. DDS and RL-DDS are evaluated upon real-life testbeds resembling actual DCs. Results demonstrate significant cost reduction, ranging from 7% in conservative and relatively static scenarios, up to 38% in highly dynamic settings.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords