Geometric Reinforcement Learning for Robotic Manipulation

Naseem Alhousani; Matteo Saveriano; Ibrahim Sevinc; Talha Abdulkuddus; Hatice Kose; Fares J. Abu-Dakka

doi:10.1109/ACCESS.2023.3322654

IEEE Access (Jan 2023)

Geometric Reinforcement Learning for Robotic Manipulation

Naseem Alhousani,
Matteo Saveriano,
Ibrahim Sevinc,
Talha Abdulkuddus,
Hatice Kose,
Fares J. Abu-Dakka

Affiliations

Naseem Alhousani: ORCiD; Faculty of Computer and Informatics Engineering, Istanbul Technical University, Maslak, Sarıyer, İstanbul, Turkey
Matteo Saveriano: ORCiD; Department of Industrial Engineering (DII), University of Trento, Trento, Italy
Ibrahim Sevinc: MCFLY Robot Teknolojileri A.Ş, Sarıyer, İstanbul, Turkey
Talha Abdulkuddus: ORCiD; ILITRON Enerji ve Bilgi Teknolojileri A.Ş, Kağıthane, İstanbul, Turkey
Hatice Kose: ORCiD; Faculty of Computer and Informatics Engineering, Istanbul Technical University, Maslak, Sarıyer, İstanbul, Turkey
Fares J. Abu-Dakka: ORCiD; Munich Institute of Robotics and Machine Intelligence (MIRMI), Technical University of Munich, Munich, Germany

DOI: https://doi.org/10.1109/ACCESS.2023.3322654
Journal volume & issue: Vol. 11
pp. 111492 – 111505

Abstract

Read online

Reinforcement learning (RL) is a popular technique that allows an agent to learn by trial and error while interacting with a dynamic environment. The traditional Reinforcement Learning (RL) approach has been successful in learning and predicting Euclidean robotic manipulation skills such as positions, velocities, and forces. However, in robotics, it is common to encounter non-Euclidean data such as orientation or stiffness, and failing to account for their geometric nature can negatively impact learning accuracy and performance. In this paper, to address this challenge, we propose a novel framework for RL that leverages Riemannian geometry, which we call Geometric Reinforcement Learning ( $\mathcal {G}$ -RL), to enable agents to learn robotic manipulation skills with non-Euclidean data. Specifically, $\mathcal {G}$ -RL utilizes the tangent space in two ways: a tangent space for parameterization and a local tangent space for mapping to a non-Euclidean manifold. The policy is learned in the parameterization tangent space, which remains constant throughout the training. The policy is then transferred to the local tangent space via parallel transport and projected onto the non-Euclidean manifold. The local tangent space changes over time to remain within the neighborhood of the current manifold point, reducing the approximation error. Therefore, by introducing a geometrically grounded pre- and post-processing step into the traditional RL pipeline, our $\mathcal {G}$ -RL framework enables several model-free algorithms designed for Euclidean space to learn from non-Euclidean data without modifications. Experimental results, obtained both in simulation and on a real robot, support our hypothesis that $\mathcal {G}$ -RL is more accurate and converges to a better solution than approximating non-Euclidean data.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords