IET Networks (Sep 2020)
QoE‐aware Q‐learning resource allocation for NOMA wireless multimedia communications
Abstract
Hybrid control traffic and multimedia flow over emerging non‐orthogonal multiple access (NOMA) could provide both very low latency and very high bandwidth. In this study, a Q‐learning‐based resource allocation scheme is proposed to improve the quality of experience (QoE) for NOMA user equipment (UE) in downlink wireless multimedia communications. In the proposed framework, the utility is modelled as the QoE with regard to communication resource cost, where UE acts as the agent in the reinforcement Q‐learning. UE observes the wireless channel states and takes resource allocation actions based on the immediate reward of QoE gain and communication cost. In addition, benefiting from the NOMA communications, the authors propose to solve the multiple agent reinforcement learning problems with the simplified sequential single agent reinforcement learning (SARL) approach. The numerical simulation results demonstrate the efficiency of the proposed Q‐QoE resource allocation framework and prove that the UE would obtain desirable QoE performance with the SARL scheme.
Keywords