Bio-Inspired Optimization Algorithm Associated with Reinforcement Learning for Multi-Objective Operating Planning in Radioactive Environment
Shihan Kong,
Fang Wu,
Hao Liu,
Wei Zhang,
Jinan Sun,
Jian Wang,
Junzhi Yu
Affiliations
Shihan Kong
The State Key Laboratory for Turbulence and Complex Systems, Department of Advanced Manufacturing and Robotics, College of Engineering, Peking University, Beijing 100871, China
Fang Wu
SPIC Nuclear Energy Co., Ltd., Beijing 100029, China
Hao Liu
The College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
Wei Zhang
The College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
Jinan Sun
National Engineering Research Center for Software Engineering, Peking University, Beijing 100871, China
Jian Wang
The Laboratory of Cognitive and Decision Intelligence for Complex System, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
Junzhi Yu
The State Key Laboratory for Turbulence and Complex Systems, Department of Advanced Manufacturing and Robotics, College of Engineering, Peking University, Beijing 100871, China
This paper aims to solve the multi-objective operating planning problem in the radioactive environment. First, a more complicated radiation dose model is constructed, considering difficulty levels at each operating point. Based on this model, the multi-objective operating planning problem is converted to a variant traveling salesman problem (VTSP). Second, with respect to this issue, a novel combinatorial algorithm framework, namely hyper-parameter adaptive genetic algorithm (HPAGA), integrating bio-inspired optimization with reinforcement learning, is proposed, which allows for adaptive adjustment of the hyperparameters of GA so as to obtain optimal solutions efficiently. Third, comparative studies demonstrate the superior performance of the proposed HPAGA against classical evolutionary algorithms for various TSP instances. Additionally, a case study in the simulated radioactive environment implies the potential application of HPAGA in the future.