Combining brain-computer interfaces with deep reinforcement learning for robot training: a feasibility study in a simulation environment

Mathias Vukelić; Michael Bui; Anna Vorreuther; Katharina Lingelbach

doi:10.3389/fnrgo.2023.1274730

Frontiers in Neuroergonomics (Nov 2023)

Combining brain-computer interfaces with deep reinforcement learning for robot training: a feasibility study in a simulation environment

Mathias Vukelić,
Michael Bui,
Anna Vorreuther,
Katharina Lingelbach

Affiliations

Mathias Vukelić: Applied Neurocognitive Systems, Fraunhofer Institute for Industrial Engineering (IAO), Stuttgart, Germany
Michael Bui: Applied Neurocognitive Systems, Fraunhofer Institute for Industrial Engineering (IAO), Stuttgart, Germany
Anna Vorreuther: Applied Neurocognitive Systems, Institute of Human Factors and Technology Management (IAT), University of Stuttgart, Stuttgart, Germany
Katharina Lingelbach: Applied Neurocognitive Systems, Fraunhofer Institute for Industrial Engineering (IAO), Stuttgart, Germany

DOI: https://doi.org/10.3389/fnrgo.2023.1274730
Journal volume & issue: Vol. 4

Abstract

Read online

Deep reinforcement learning (RL) is used as a strategy to teach robot agents how to autonomously learn complex tasks. While sparsity is a natural way to define a reward in realistic robot scenarios, it provides poor learning signals for the agent, thus making the design of good reward functions challenging. To overcome this challenge learning from human feedback through an implicit brain-computer interface (BCI) is used. We combined a BCI with deep RL for robot training in a 3-D physical realistic simulation environment. In a first study, we compared the feasibility of different electroencephalography (EEG) systems (wet- vs. dry-based electrodes) and its application for automatic classification of perceived errors during a robot task with different machine learning models. In a second study, we compared the performance of the BCI-based deep RL training to feedback explicitly given by participants. Our findings from the first study indicate the use of a high-quality dry-based EEG-system can provide a robust and fast method for automatically assessing robot behavior using a sophisticated convolutional neural network machine learning model. The results of our second study prove that the implicit BCI-based deep RL version in combination with the dry EEG-system can significantly accelerate the learning process in a realistic 3-D robot simulation environment. Performance of the BCI-based trained deep RL model was even comparable to that achieved by the approach with explicit human feedback. Our findings emphasize the usage of BCI-based deep RL methods as a valid alternative in those human-robot applications where no access to cognitive demanding explicit human feedback is available.

Published in Frontiers in Neuroergonomics

ISSN: 2673-6195 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry: Neurology. Diseases of the nervous system
Website: https://www.frontiersin.org/journals/neuroergonomics

About the journal

Abstract

Keywords