Design and evaluation of a global workspace agent embodied in a realistic multimodal environment

Rousslan Fernand Julien Dossa; Kai Arulkumaran; Arthur Juliani; Shuntaro Sasai; Ryota Kanai

doi:10.3389/fncom.2024.1352685

Frontiers in Computational Neuroscience (Jun 2024)

Design and evaluation of a global workspace agent embodied in a realistic multimodal environment

Rousslan Fernand Julien Dossa,
Kai Arulkumaran,
Arthur Juliani,
Shuntaro Sasai,
Ryota Kanai

Affiliations

Rousslan Fernand Julien Dossa: Araya Inc., Tokyo, Japan
Kai Arulkumaran: Araya Inc., Tokyo, Japan
Arthur Juliani: Microsoft Research, New York, NY, United States
Shuntaro Sasai: Araya Inc., Tokyo, Japan
Ryota Kanai: Araya Inc., Tokyo, Japan

DOI: https://doi.org/10.3389/fncom.2024.1352685
Journal volume & issue: Vol. 18

Abstract

Read online

As the apparent intelligence of artificial neural networks (ANNs) advances, they are increasingly likened to the functional networks and information processing capabilities of the human brain. Such comparisons have typically focused on particular modalities, such as vision or language. The next frontier is to use the latest advances in ANNs to design and investigate scalable models of higher-level cognitive processes, such as conscious information access, which have historically lacked concrete and specific hypotheses for scientific evaluation. In this work, we propose and then empirically assess an embodied agent with a structure based on global workspace theory (GWT) as specified in the recently proposed “indicator properties” of consciousness. In contrast to prior works on GWT which utilized single modalities, our agent is trained to navigate 3D environments based on realistic audiovisual inputs. We find that the global workspace architecture performs better and more robustly at smaller working memory sizes, as compared to a standard recurrent architecture. Beyond performance, we perform a series of analyses on the learned representations of our architecture and share findings that point to task complexity and regularization being essential for feature learning and the development of meaningful attentional patterns within the workspace.

Published in Frontiers in Computational Neuroscience

ISSN: 1662-5188 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: http://www.frontiersin.org/computational_neuroscience

About the journal

Abstract

Keywords