Deep Reinforcement Learning for Model Predictive Controller Based on Disturbed Single Rigid Body Model of Biped Robots

Landong Hou; Bin Li; Weilong Liu; Yiming Xu; Shuhui Yang; Xuewen Rong

doi:10.3390/machines10110975

Machines (Oct 2022)

Deep Reinforcement Learning for Model Predictive Controller Based on Disturbed Single Rigid Body Model of Biped Robots

Landong Hou,
Bin Li,
Weilong Liu,
Yiming Xu,
Shuhui Yang,
Xuewen Rong

Affiliations

Landong Hou: School of Electrical Engineering and Automation, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China
Bin Li: School of Mathematics and Statistics, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China
Weilong Liu: School of Mathematics and Statistics, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China
Yiming Xu: School of Electrical Engineering and Automation, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China
Shuhui Yang: School of Mathematics and Statistics, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China
Xuewen Rong: School of Control Science and Engineering, Shandong University, Jinan 250100, China

DOI: https://doi.org/10.3390/machines10110975
Journal volume & issue: Vol. 10, no. 11
p. 975

Abstract

Read online

This paper modifies the single rigid body (SRB) model, and considers the swinging leg as the disturbances to the centroid acceleration and rotational acceleration of the SRB model. This paper proposes deep reinforcement learning (DRL)-based model predictive control (MPC) to resist the disturbances of the swinging leg. The DRL predicts the swing leg disturbances, and then MPC gives the optimal ground reaction forces according to the predicted disturbances. We use the proximal policy optimization (PPO) algorithm among the DRL methods since it is a very stable and widely applicable algorithm. It is an on-policy algorithm based on the actor–critic framework. The simulation results show that the improved SRB model and the PPO-based MPC method can accurately predict the disturbances of the swinging leg to the SRB model and resist the disturbance, making the locomotion more robust.

Published in Machines

ISSN: 2075-1702 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Mechanical engineering and machinery
Website: http://www.mdpi.com/journal/machines

About the journal

Abstract

Keywords