Keep the Human in the Loop: Arguments for Human Assistance in the Synthesis of Simulation Data for Robot Training

Carina Liebers; Pranav Megarajan; Jonas Auda; Tim C. Stratmann; Max Pfingsthorn; Uwe Gruenefeld; Stefan Schneegass

doi:10.3390/mti8030018

Multimodal Technologies and Interaction (Mar 2024)

Keep the Human in the Loop: Arguments for Human Assistance in the Synthesis of Simulation Data for Robot Training

Carina Liebers,
Pranav Megarajan,
Jonas Auda,
Tim C. Stratmann,
Max Pfingsthorn,
Uwe Gruenefeld,
Stefan Schneegass

Affiliations

Carina Liebers: HCI Group, Faculty of Computer Science, University of Duisburg-Essen, Schuetzenbahn 70, 45127 Essen, Germany
Pranav Megarajan: OFFIS—Institute for IT, 26121 Oldenburg, Germany
Jonas Auda: HCI Group, Faculty of Computer Science, University of Duisburg-Essen, Schuetzenbahn 70, 45127 Essen, Germany
Tim C. Stratmann: OFFIS—Institute for IT, 26121 Oldenburg, Germany
Max Pfingsthorn: OFFIS—Institute for IT, 26121 Oldenburg, Germany
Uwe Gruenefeld: HCI Group, Faculty of Computer Science, University of Duisburg-Essen, Schuetzenbahn 70, 45127 Essen, Germany
Stefan Schneegass: HCI Group, Faculty of Computer Science, University of Duisburg-Essen, Schuetzenbahn 70, 45127 Essen, Germany

DOI: https://doi.org/10.3390/mti8030018
Journal volume & issue: Vol. 8, no. 3
p. 18

Abstract

Read online

Robot training often takes place in simulated environments, particularly with reinforcement learning. Therefore, multiple training environments are generated using domain randomization to ensure transferability to real-world applications and compensate for unknown real-world states. We propose improving domain randomization by involving human application experts in various stages of the training process. Experts can provide valuable judgments on simulation realism, identify missing properties, and verify robot execution. Our human-in-the-loop workflow describes how they can enhance the process in five stages: validating and improving real-world scans, correcting virtual representations, specifying application-specific object properties, verifying and influencing simulation environment generation, and verifying robot training. We outline examples and highlight research opportunities. Furthermore, we present a case study in which we implemented different prototypes, demonstrating the potential of human experts in the given stages. Our early insights indicate that human input can benefit robot training at different stages.

Published in Multimodal Technologies and Interaction

ISSN: 2414-4088 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology; Science
Website: http://www.mdpi.com/journal/mti

About the journal

Abstract

Keywords