Two-Population Ant Colony Algorithm Based on Heuristic Reinforcement Learning

LIU Zhongqiang, YOU Xiaoming, LIU Sheng

doi:10.3778/j.issn.1673-9418.1903068

Jisuanji kexue yu tansuo (Mar 2020)

Two-Population Ant Colony Algorithm Based on Heuristic Reinforcement Learning

LIU Zhongqiang, YOU Xiaoming, LIU Sheng

Affiliations

LIU Zhongqiang, YOU Xiaoming, LIU Sheng: 1. College of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China 2. School of Management, Shanghai University of Engineering Science, Shanghai 201620, China

DOI: https://doi.org/10.3778/j.issn.1673-9418.1903068
Journal volume & issue: Vol. 14, no. 3
pp. 460 – 469

Abstract

Read online

A heterogeneous two-population ant colony algorithm based on heuristic reinforcement learning is pro-posed to solve the problem that traditional ant colony algorithm is prone to fall into local optimum and convergence speed is slow when solving TSP. Ant colony can be divided into main population and sub-population.The main population is responsible for solution construction and pheromone update. The sub-population replaces the solution set of the main population while constructing the solution. At the beginning of the algorithm, heuristic operator is used to control the communication frequency of two populations adaptively, and the solution exchange mode is controlled by the deviation degree coefficient. At the early stage, the optimal solution of the subpopulation is allowed to replace the random solution of the main population, increasing the diversity of solutions. Meanwhile, the reinforcement learning mechanism is introduced to reward the pheromones in the optimal path of the main population adaptively, so as to increase the probability of the optimal public path selected later. In the later stage, the optimal solution of the sub-population is controlled to replace the worst solution of the main population, the quantity of pheromones in the optimal path is strengthened, and the pheromones in the optimal path of the main population are rewarded, so as to further improve the convergence speed of the algorithm. The simulation results show that the algorithm can effectively jump out of the local optimum, and the quality of the solution is obviously improved in the large-scale test set.

Published in Jisuanji kexue yu tansuo

ISSN: 1673-9418 (Print)
Publisher: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://fcst.ceaj.org

About the journal

Abstract

Keywords