IET Intelligent Transport Systems (Dec 2022)
A sharing deep reinforcement learning method for efficient vehicle platooning control
Abstract
Abstract The combination of reinforcement learning and platooning control has been widely studied, which has been considered to be altruistic on the basis of safety. However, the platooning control method based on reinforcement learning is not mature. This paper proposes a platoon sharing deep deterministic policy gradient algorithm (PSDDPG) for a multi‐vehicle network which overcomes the problem of low efficiency of continuous action space exploration. On the basis of the noise of deep deterministic policy gradient (DDPG) algorithm, additional platoon noise is added to enhance the diversity of training samples during exploration and avoid using a single‐vehicle data, which could weaken model robustness. The method of using the time sequence information as an input and replay buffer backup is proposed to prevent the problems of insufficient exploration and low sampling efficiency from deteriorating the training effect. The robustness of the proposed algorithm is verified by tests using the Carla simulator. Also, the platoon merging, overtaking, cruising following and obstacle avoidance control of a vehicle platoon is realized under typical traffic flow conditions. The experimental results show that the PSDDPG algorithm can provide an efficient control strategy and thus has the potential to reduce energy consumption and improve road efficiency.