Decentralized Multi-Agent Control of a Manipulator in Continuous Task Learning

Asad Ali Shahid; Jorge Said Vidal Sesin; Damjan Pecioski; Francesco Braghin; Dario Piga; Loris Roveda

doi:10.3390/app112110227

Applied Sciences (Nov 2021)

Decentralized Multi-Agent Control of a Manipulator in Continuous Task Learning

Asad Ali Shahid,
Jorge Said Vidal Sesin,
Damjan Pecioski,
Francesco Braghin,
Dario Piga,
Loris Roveda

Affiliations

Asad Ali Shahid: Istituto Dalle Molle di Studi Sull’Intelligenza Artificiale (IDSIA), Scuola Universitaria Professionale della Svizzera Italiana (SUPSI), Università della Svizzera Italiana (USI), CH-6962 Lugano-Viganello, Switzerland
Jorge Said Vidal Sesin: Department of Mechanical Engineering, Politecnico di Milano, 20156 Milano, Italy
Damjan Pecioski: Department of Mechanical Engineering, Politecnico di Milano, 20156 Milano, Italy
Francesco Braghin: Department of Mechanical Engineering, Politecnico di Milano, 20156 Milano, Italy
Dario Piga: Istituto Dalle Molle di Studi Sull’Intelligenza Artificiale (IDSIA), Scuola Universitaria Professionale della Svizzera Italiana (SUPSI), Università della Svizzera Italiana (USI), CH-6962 Lugano-Viganello, Switzerland
Loris Roveda: Istituto Dalle Molle di Studi Sull’Intelligenza Artificiale (IDSIA), Scuola Universitaria Professionale della Svizzera Italiana (SUPSI), Università della Svizzera Italiana (USI), CH-6962 Lugano-Viganello, Switzerland

DOI: https://doi.org/10.3390/app112110227
Journal volume & issue: Vol. 11, no. 21
p. 10227

Abstract

Read online

Many real-world tasks require multiple agents to work together. When talking about multiple agents in robotics, it is usually referenced to multiple manipulators in collaboration to solve a given task, where each one is controlled by a single agent. However, due to the increasing development of modular and re-configurable robots, it is also important to investigate the possibility of implementing multi-agent controllers that learn how to manage the manipulator’s degrees of freedom (DoF) in separated clusters for the execution of a given application (e.g., being able to face faults or, partially, new kinematics configurations). Within this context, this paper focuses on the decentralization of the robot control action learning and (re)execution considering a generic multi-DoF manipulator. Indeed, the proposed framework employs a multi-agent paradigm and investigates how such a framework impacts the control action learning process. Multiple variations of the multi-agent framework have been proposed and tested in this research, comparing the achieved performance w.r.t. a centralized (i.e., single-agent) control action learning framework, previously proposed by some of the authors. As a case study, a manipulation task (i.e., grasping and lifting) of an unknown object (to the robot controller) has been considered for validation, employing a Franka EMIKA panda robot. The MuJoCo environment has been employed to implement and test the proposed multi-agent framework. The achieved results show that the proposed decentralized approach is capable of accelerating the learning process at the beginning with respect to the single-agent framework while also reducing the computational effort. In fact, when decentralizing the controller, it is shown that the number of variables involved in the action space can be efficiently separated into several groups and several agents. This simplifies the original complex problem into multiple ones, efficiently improving the task learning process.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords