Deep Reinforcement Learning Based Decision Making for Complex Jamming Waveforms

Yuting Xu; Chao Wang; Jiakai Liang; Keqiang Yue; Wenjun Li; Shilian Zheng; Zhijin Zhao

doi:10.3390/e24101441

Entropy (Oct 2022)

Deep Reinforcement Learning Based Decision Making for Complex Jamming Waveforms

Yuting Xu,
Chao Wang,
Jiakai Liang,
Keqiang Yue,
Wenjun Li,
Shilian Zheng,
Zhijin Zhao

Affiliations

Yuting Xu: Key Laboratory of RF Circuits and Systems, Ministry of Education, Hangzhou Dianzi University, Hangzhou 310018, China
Chao Wang: Key Laboratory of RF Circuits and Systems, Ministry of Education, Hangzhou Dianzi University, Hangzhou 310018, China
Jiakai Liang: Key Laboratory of RF Circuits and Systems, Ministry of Education, Hangzhou Dianzi University, Hangzhou 310018, China
Keqiang Yue: Key Laboratory of RF Circuits and Systems, Ministry of Education, Hangzhou Dianzi University, Hangzhou 310018, China
Wenjun Li: Key Laboratory of RF Circuits and Systems, Ministry of Education, Hangzhou Dianzi University, Hangzhou 310018, China
Shilian Zheng: Science and Technology on Communication Information Security Control Laboratory, The No. 011 Research Center, Jiaxing 314033, China
Zhijin Zhao: The School of Communication Engineering, Hangzhou Dianzi University, Hangzhou 310018, China

DOI: https://doi.org/10.3390/e24101441
Journal volume & issue: Vol. 24, no. 10
p. 1441

Abstract

Read online

With the development of artificial intelligence, intelligent communication jamming decision making is an important research direction of cognitive electronic warfare. In this paper, we consider a complex intelligent jamming decision scenario in which both communication parties choose to adjust physical layer parameters to avoid jamming in a non-cooperative scenario and the jammer achieves accurate jamming by interacting with the environment. However, when the situation becomes complex and large in number, traditional reinforcement learning suffers from the problems of failure to converge and a high number of interactions, which are fatal and unrealistic in a real warfare environment. To solve this problem, we propose a deep reinforcement learning based and maximum-entropy-based soft actor-critic (SAC) algorithm. In the proposed algorithm, we add an improved Wolpertinger architecture to the original SAC algorithm in order to reduce the number of interactions and improve the accuracy of the algorithm. The results show that the proposed algorithm shows excellent performance in various scenarios of jamming and achieves accurate, fast, and continuous jamming for both sides of the communication.

Published in Entropy

ISSN: 1099-4300 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Astronomy: Astrophysics; Science: Physics
Website: http://www.mdpi.com/journal/entropy

About the journal

Abstract

Keywords