Reinforcement Learning Based Adaptive Blocklength and MCS for Optimizing Age Violation Probability

Aysenur Ozkaya; Ahsen Topbas; Elif Tugce Ceran

doi:10.1109/ACCESS.2023.3326748

IEEE Access (Jan 2023)

Reinforcement Learning Based Adaptive Blocklength and MCS for Optimizing Age Violation Probability

Aysenur Ozkaya,
Ahsen Topbas,
Elif Tugce Ceran

Affiliations

Aysenur Ozkaya: ORCiD; Aselsan Inc., Ankara, Turkey
Ahsen Topbas: ORCiD; Department of Electrical and Electronics Engineering, Middle East Technical University, Ankara, Turkey
Elif Tugce Ceran: ORCiD; Department of Electrical and Electronics Engineering, Middle East Technical University, Ankara, Turkey

DOI: https://doi.org/10.1109/ACCESS.2023.3326748
Journal volume & issue: Vol. 11
pp. 122411 – 122425

Abstract

Read online

As a measure of the freshness of data, Age of Information (AoI) has become an essential performance metric in status update applications with stringent timeliness constraints. This study employs adaptive strategies to minimize the novel, information freshness-based performance metric age violation probability (AVP), the probability of the instantaneous age exceeding a predefined constraint, in short packet communications (SPC). AVP can be considered one of the key performance indicators (KPIs) in 5G Ultra-Reliable Low Latency Communications (URLLC), and it is expected to gain more importance in 6G technologies, especially in extreme URLLC (xURLLC). Two distinct approaches are considered: the first focuses on adaptively selecting the blocklengths with either imperfect or missing channel state information exploiting finite blocklength theory approximations. The second involves dynamically choosing the modulation and coding scheme (MCS) to minimize the AVP under stringent timeliness constraints and non-asymptotic information theory bounds. In the context of adaptive blocklength selection, state-aggregated value iteration, Q-learning algorithms, and finite blocklength theory approximations are leveraged to adjust blocklengths to achieve low age violation probabilities adaptively. The simulation results highlight the effectiveness of these algorithms in minimizing age violation probabilities compared to the fixed blocklengths under varying channel conditions. Additionally, constructing a deep reinforcement learning (DRL) framework, we propose a deep Q-network policy for the dynamic selection of the modulation and coding scheme among the available MCSs defined for URLLC systems. Through comprehensive simulations, we demonstrate the superiority of the proposed adaptive methods over traditional benchmark methods.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords