Joint Beamforming, Power Control, and Interference Coordination: A Reinforcement Learning Approach Replacing Rewards With Examples

Jeng-Shin Sheu; Cheng-Kuei Huang; Chun-Lung Tsai

doi:10.1109/ACCESS.2023.3306518

IEEE Access (Jan 2023)

Joint Beamforming, Power Control, and Interference Coordination: A Reinforcement Learning Approach Replacing Rewards With Examples

Jeng-Shin Sheu,
Cheng-Kuei Huang,
Chun-Lung Tsai

Affiliations

Jeng-Shin Sheu: ORCiD; Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Yunlin, Taiwan
Cheng-Kuei Huang: Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Yunlin, Taiwan
Chun-Lung Tsai: Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Yunlin, Taiwan

DOI: https://doi.org/10.1109/ACCESS.2023.3306518
Journal volume & issue: Vol. 11
pp. 88854 – 88868

Abstract

Read online

In this paper, we consider the problem of multi-cell interference coordination by joint beamforming and power control. Recent efforts have explored the use of reinforcement learning (RL) methods to tackle this complex optimization problem. Typically, a decentralized multi-agent framework is adopted, wherein each base station operates as an independent RL agent. This distributed coordination has gained attention because designing a reward function that effectively captures the condition of the entire cellular network is challenging for single-agent RL models. However, the distributed approach introduces unique challenges, particularly the non-stationary of the multi-agent environment, as agents continually adapt their policies to interact with one another. The non-stationary environment necessitates information exchange among agents, as local observations of each agent are insufficient to fully capture the true state of the environment. Unfortunately, this information exchange incurs a significant overhead, thereby limiting data transmission capabilities. To address these challenges, we propose a novel single-agent RL approach that eliminates the need for information exchange and the conventional reward function. Instead, we leverage success examples to guide the learning process. Simulation results show that the proposed approach outperforms the existing multi-agent method and theoretical algorithm in terms of sum rates. Additionally, our approach ensures a uniform quality of service while maximizing the overall sum rate.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords