Multi-dimensional resource allocation strategy for LEO satellite communication uplinks based on deep reinforcement learning

Yu Hu; Feipeng Qiu; Fei Zheng; Jilong Zhao

doi:10.1186/s13677-024-00621-z

Journal of Cloud Computing: Advances, Systems and Applications (Mar 2024)

Multi-dimensional resource allocation strategy for LEO satellite communication uplinks based on deep reinforcement learning

Yu Hu,
Feipeng Qiu,
Fei Zheng,
Jilong Zhao

Affiliations

Yu Hu: Ministry of Education Key Laboratory of Cognitive Radio and Information Processing, Guilin University of Electronic Technology
Feipeng Qiu: Ministry of Education Key Laboratory of Cognitive Radio and Information Processing, Guilin University of Electronic Technology
Fei Zheng: Ministry of Education Key Laboratory of Cognitive Radio and Information Processing, Guilin University of Electronic Technology
Jilong Zhao: Ministry of Education Key Laboratory of Cognitive Radio and Information Processing, Guilin University of Electronic Technology

DOI: https://doi.org/10.1186/s13677-024-00621-z
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 15

Abstract

Read online

Abstract In the LEO satellite communication system, the resource utilization rate is very low due to the constrained resources on satellites and the non-uniform distribution of traffics. In addition, the rapid movement of LEO satellites leads to complicated and changeable networks, which makes it difficult for traditional resource allocation strategies to improve the resource utilization rate. To solve the above problem, this paper proposes a resource allocation strategy based on deep reinforcement learning. The strategy takes the weighted sum of spectral efficiency, energy efficiency and blocking rate as the optimization objective, and constructs a joint power and channel allocation model. The strategy allocates channels and power according to the number of channels, the number of users and the type of business. In the reward decision mechanism, the maximum reward is obtained by maximizing the increment of the optimization target. However, during the optimization process, the decision always focuses on the optimal allocation for current users, and ignores QoS for new users. To avoid the situation, current service beams are integrated with high- traffic beams, and states of beams are refactored to maximize long-term benefits to improve system performance. Simulation experiments show that in scenarios with a high number of users, the proposed resource allocation strategy reduces the blocking rate by at least 5% compared to reinforcement learning methods, effectively enhancing resource utilization.

Published in Journal of Cloud Computing: Advances, Systems and Applications

ISSN: 2192-113X (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://journalofcloudcomputing.springeropen.com

About the journal

Abstract

Keywords