An Intelligent SDWN Routing Algorithm Based on Network Situational Awareness and Deep Reinforcement Learning

Jinqiang Li; Miao Ye; Linqiang Huang; Xiaofang Deng; Hongbing Qiu; Yong Wang; Qiuxiang Jiang

doi:10.1109/ACCESS.2023.3302178

IEEE Access (Jan 2023)

An Intelligent SDWN Routing Algorithm Based on Network Situational Awareness and Deep Reinforcement Learning

Jinqiang Li,
Miao Ye,
Linqiang Huang,
Xiaofang Deng,
Hongbing Qiu,
Yong Wang,
Qiuxiang Jiang

Affiliations

Jinqiang Li: ORCiD; School of Information and Communication, Guilin University of Electronic Technology, Guilin, China
Miao Ye: ORCiD; School of Information and Communication, Guilin University of Electronic Technology, Guilin, China
Linqiang Huang: ORCiD; School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, China
Xiaofang Deng: ORCiD; School of Information and Communication, Guilin University of Electronic Technology, Guilin, China
Hongbing Qiu: ORCiD; School of Information and Communication, Guilin University of Electronic Technology, Guilin, China
Yong Wang: ORCiD; School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, China
Qiuxiang Jiang: Guangxi Key Laboratory of Wireless Broadband Communication and Signal Processing, Guilin University of Electronic Technology, Guilin, China

DOI: https://doi.org/10.1109/ACCESS.2023.3302178
Journal volume & issue: Vol. 11
pp. 83322 – 83342

Abstract

Read online

To address the challenges of obtaining network state information, flexibly forwarding data, and improving the communication quality of service (QoS) in wireless network transmission environments in response to dynamic changes in network topology, this paper introduces an intelligent routing algorithm based on deep reinforcement learning (DRL) with network situational awareness under a software-defined wireless networking (SDWN) architecture. First, comprehensive network traffic information is collected under the SDWN architecture, and a graph convolutional network-gated recurrent unit (GCN-GRU) prediction mechanism is used to perceive future traffic trends. Second, a proximal policy optimization (PPO) DRL-based data forwarding mechanism is designed in the knowledge plane. The predicted network traffic matrix and topology information matrix are treated as the DRL environment, while next-hop adjacent nodes are treated as executable actions, and action selection policies are designed for different network conditions. To guide the learning and improvement of the DRL agent’s routing strategy, reward functions of different forms are designed by utilizing network link information and different penalty mechanisms. Additionally, importance sampling steps and gradient clipping methods are employed during gradient updating to enhance the convergence speed and stability of the designed intelligent routing method. Experimental results show that this solution outperforms traditional routing methods in network throughput, delay, packet loss rate, and wireless node distance. Compared to value-function-based Dueling Deep Q-Network (DQN) routing, the convergence of the proposed method is significantly faster and more stable. Simultaneously, hardware storage consumption is reduced, and real-time routing decisions can be made using the current network state information. The source code can be accessed at https://github.com/GuetYe/DRL-PPONSA.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords