Tongxin xuebao (Jun 2020)

Improved satellite resource allocation algorithm based on DRL and MOP

  • Pei ZHANG,
  • Shuaijun LIU,
  • Zhiguo MA,
  • Xiaohui WANG,
  • Junde SONG

Journal volume & issue
Vol. 41
pp. 51 – 60

Abstract

Read online

In view of the multi-objective optimization (MOP) problem of sequential decision-making for resource allocations in multi-beam satellite systems,a deep reinforcement learning(DRL) based DRL-MOP algorithm was proposed to improve the system performance and user satisfaction degree.With considering the normalized weighted sum of spectrum efficiency,energy efficiency,and satisfaction index as the optimization goal,the dynamically changing system environments and user arrival model were built by the proposed algorithm,and the optimization of the accumulative performance in satellite systems based on DRL and MOP was realized.Simulation results show that the proposed algorithm can solve the MOP problem with rapid convergence ability and low complexity,and it is obviously superior to other algorithms in terms of system performance and user satisfaction optimization.

Keywords