Efficient Reinforcement Learning for 3D Jumping Monopods

Riccardo Bussola; Michele Focchi; Andrea Del Prete; Daniele Fontanelli; Luigi Palopoli

doi:10.3390/s24154981

Sensors (Aug 2024)

Efficient Reinforcement Learning for 3D Jumping Monopods

Riccardo Bussola,
Michele Focchi,
Andrea Del Prete,
Daniele Fontanelli,
Luigi Palopoli

Affiliations

Riccardo Bussola: Dipartimento di Ingegneria and Scienza Dell’Informazione (DISI), University of Trento, 38123 Trento, Italy
Michele Focchi: Dipartimento di Ingegneria and Scienza Dell’Informazione (DISI), University of Trento, 38123 Trento, Italy
Andrea Del Prete: Dipartimento di Ingegneria Industriale (DII), University of Trento, 38123 Trento, Italy
Daniele Fontanelli: Dipartimento di Ingegneria Industriale (DII), University of Trento, 38123 Trento, Italy
Luigi Palopoli: Dipartimento di Ingegneria and Scienza Dell’Informazione (DISI), University of Trento, 38123 Trento, Italy

DOI: https://doi.org/10.3390/s24154981
Journal volume & issue: Vol. 24, no. 15
p. 4981

Abstract

Read online

We consider a complex control problem: making a monopod accurately reach a target with a single jump. The monopod can jump in any direction at different elevations of the terrain. This is a paradigm for a much larger class of problems, which are extremely challenging and computationally expensive to solve using standard optimization-based techniques. Reinforcement learning (RL) is an interesting alternative, but an end-to-end approach in which the controller must learn everything from scratch can be non-trivial with a sparse-reward task like jumping. Our solution is to guide the learning process within an RL framework leveraging nature-inspired heuristic knowledge. This expedient brings widespread benefits, such as a drastic reduction of learning time, and the ability to learn and compensate for possible errors in the low-level execution of the motion. Our simulation results reveal a clear advantage of our solution against both optimization-based and end-to-end RL approaches.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords