Time-in-action RL

Jiangcheng Zhu; Zhepei Wang; Zhepei Wang; Douglas Mcilwraith; Chao Wu; Chao Xu; Yike Guo

doi:10.1049/iet-csr.2018.0001

IET Cyber-systems and Robotics (Feb 2019)

Time-in-action RL

Jiangcheng Zhu,
Zhepei Wang,
Zhepei Wang,
Douglas Mcilwraith,
Chao Wu,
Chao Xu,
Yike Guo

Affiliations

Jiangcheng Zhu: Institute of Cyber-Systems and Control, Department of Control Science and Engineering, Zhejiang University
Zhepei Wang: Institute of Cyber-Systems and Control, Department of Control Science and Engineering, Zhejiang University
Zhepei Wang: Institute of Cyber-Systems and Control, Department of Control Science and Engineering, Zhejiang University
Douglas Mcilwraith: Data Science Institute, Department of Computing, Imperial College London
Chao Wu: Zhejiang University
Chao Xu: Institute of Cyber-Systems and Control, Department of Control Science and Engineering, Zhejiang University
Yike Guo: Data Science Institute, Department of Computing, Imperial College London

DOI: https://doi.org/10.1049/iet-csr.2018.0001

Abstract

Read online

The authors propose a novel reinforcement learning (RL) framework, where agent behaviour is governed by traditional control theory. This integrated approach, called time-in-action RL, enables RL to be applicable to many real-world systems, where underlying dynamics are known in their control theoretical formalism. The key insight to facilitate this integration is to model the explicit time function, mapping the state-action pair to the time accomplishing the action by its underlying controller. In their framework, they describe an action by its value (action value), and the time that it takes to perform (action time). An action-value results from the policy of RL regarding a state. Action time is estimated by an explicit time model learnt from the measured activities of the underlying controller. RL value network is then trained with embedded time model to predict action time. This approach is tested using a variant of Atari Pong and proved to be convergent.

Published in IET Cyber-systems and Robotics

ISSN: 2631-6315 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Science: Science (General): Cybernetics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://ietresearch.onlinelibrary.wiley.com/journal/26316315

About the journal

Abstract

Keywords