Visual Rewards From Observation for Sequential Tasks: Autonomous Pile Loading

Nataliya Strokina; Wenyan Yang; Joni Pajarinen; Nikolay Serbenyuk; Joni Kämäräinen; Reza Ghabcheloo

doi:10.3389/frobt.2022.838059

Frontiers in Robotics and AI (May 2022)

Visual Rewards From Observation for Sequential Tasks: Autonomous Pile Loading

Nataliya Strokina,
Wenyan Yang,
Joni Pajarinen,
Nikolay Serbenyuk,
Joni Kämäräinen,
Reza Ghabcheloo

Affiliations

Nataliya Strokina: Computing Sciences, Tampere University, Tampere, Finland
Wenyan Yang: Computing Sciences, Tampere University, Tampere, Finland
Joni Pajarinen: Department of Electrical Engineering and Automation, Aalto University, Espoo, Finland
Nikolay Serbenyuk: Automation Technology and Mechanical Engineering, Tampere University, Tampere, Finland
Joni Kämäräinen: Computing Sciences, Tampere University, Tampere, Finland
Reza Ghabcheloo: Automation Technology and Mechanical Engineering, Tampere University, Tampere, Finland

DOI: https://doi.org/10.3389/frobt.2022.838059
Journal volume & issue: Vol. 9

Abstract

Read online

One of the key challenges in implementing reinforcement learning methods for real-world robotic applications is the design of a suitable reward function. In field robotics, the absence of abundant datasets, limited training time, and high variation of environmental conditions complicate the task further. In this paper, we review reward learning techniques together with visual representations commonly used in current state-of-the-art works in robotics. We investigate a practical approach proposed in prior work to associate the reward with the stage of the progress in task completion based on visual observation. This approach was demonstrated in controlled laboratory conditions. We study its potential for a real-scale field application, autonomous pile loading, tested outdoors in three seasons: summer, autumn, and winter. In our framework, the cumulative reward combines the predictions about the process stage and the task completion (terminal stage). We use supervised classification methods to train prediction models and investigate the most common state-of-the-art visual representations. We use task-specific contrastive features for terminal stage prediction.

Published in Frontiers in Robotics and AI

ISSN: 2296-9144 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Technology: Mechanical engineering and machinery; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.frontiersin.org/journals/robotics-and-ai

About the journal

Abstract

Keywords