Deep Reinforcement Learning Based Dynamic Proportional-Integral (PI) Gain Auto-Tuning Method for a Robot Driver System

Joonghoo Park; Heejung Kim; Kyunghun Hwang; Sejoon Lim

doi:10.1109/ACCESS.2022.3159785

IEEE Access (Jan 2022)

Deep Reinforcement Learning Based Dynamic Proportional-Integral (PI) Gain Auto-Tuning Method for a Robot Driver System

Joonghoo Park,
Heejung Kim,
Kyunghun Hwang,
Sejoon Lim

Affiliations

Joonghoo Park: ORCiD; Graduate School of Automotive Engineering, Kookmin University, Seongbuk-gu, Seoul, South Korea
Heejung Kim: Graduate School of Automotive Engineering, Kookmin University, Seongbuk-gu, Seoul, South Korea
Kyunghun Hwang: Electrification Energy Efficiency & Drivability Team 3, Hyundai Motor Company, Hwaseong, South Korea
Sejoon Lim: ORCiD; Department of Automobile and IT Convergence, Kookmin University, Seongbuk-gu, Seoul, South Korea

DOI: https://doi.org/10.1109/ACCESS.2022.3159785
Journal volume & issue: Vol. 10
pp. 31043 – 31057

Abstract

Read online

To meet the growing trend of stringent fuel economy regulations, automakers around the world are designing modules such as engines, motors, transmissions and batteries to be as efficient as possible. In order to verify the effect of these designs on the overall fuel efficiency of the vehicle, the vehicle equipped with each module is placed on the chassis dynamometer, driven to follow the target vehicle speed, and actual fuel efficiency is measured. These tests are traditionally performed by human operators, but are now being replaced by robots (physical or software) to ensure the accuracy and reliability of test results. Although the conventionally proposed proportional integral (PI)-based controller has a simple structure and is easy to implement, it requires the process of finding the optimal gain whenever the test conditions such as vehicle or drive cycle change, which is difficult and time consuming. In this study, we propose a proportional integral controller gain adjustment algorithm using deep reinforcement learning. The reinforcement learning agent learns to dynamically modify the PI gain value of the acceleration/deceleration pedal to better follow the target vehicle in a simulation environment. The perturbation is used in each training episode to reduce the difference between the simulation and real testing environment. Upon completion of the training process, the trained agent performs an adjustment process that generates a reference gain table. We then use this reference gain table to perform a real test. The performance of the proposed system was evaluated using Hyundai Tucson HEV (NX4) on an AVL chassis dynamometer. We also compared the performance of our proposed algorithm to traditional fuzzy logic-based PI controllers. The obtained experimental results show that the proposed control system achieved a performance improvement of aounrd 46.8% compared to the conventional PI control system in terms of root mean square error.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords