Modeling Theory of Mind in Dyadic Games Using Adaptive Feedback Control

Ismael T. Freire; Xerxes D. Arsiwalla; Jordi-Ysard Puigbò; Paul Verschure

doi:10.3390/info14080441

Information (Aug 2023)

Modeling Theory of Mind in Dyadic Games Using Adaptive Feedback Control

Ismael T. Freire,
Xerxes D. Arsiwalla,
Jordi-Ysard Puigbò,
Paul Verschure

Affiliations

Ismael T. Freire: Donders Institute for Brain, Cognition and Behaviour, Radboud University, 6525 AJ Nijmegen, The Netherlands
Xerxes D. Arsiwalla: Department of Information and Communication Technologies, Universitat Pompeu Fabra, 08018 Barcelona, Spain
Jordi-Ysard Puigbò: Department of Information and Communication Technologies, Universitat Pompeu Fabra, 08018 Barcelona, Spain
Paul Verschure: Donders Institute for Brain, Cognition and Behaviour, Radboud University, 6525 AJ Nijmegen, The Netherlands

DOI: https://doi.org/10.3390/info14080441
Journal volume & issue: Vol. 14, no. 8
p. 441

Abstract

Read online

A major challenge in cognitive science and AI has been to understand how intelligent autonomous agents might acquire and predict the behavioral and mental states of other agents in the course of complex social interactions. How does such an agent model the goals, beliefs, and actions of other agents it interacts with? What are the computational principles to model a Theory of Mind (ToM)? Deep learning approaches to address these questions fall short of a better understanding of the problem. In part, this is due to the black-box nature of deep networks, wherein computational mechanisms of ToM are not readily revealed. Here, we consider alternative hypotheses seeking to model how the brain might realize a ToM. In particular, we propose embodied and situated agent models based on distributed adaptive control theory to predict the actions of other agents in five different game-theoretic tasks (Harmony Game, Hawk-Dove, Stag Hunt, Prisoner’s Dilemma, and Battle of the Exes). Our multi-layer control models implement top-down predictions from adaptive to reactive layers of control and bottom-up error feedback from reactive to adaptive layers. We test cooperative and competitive strategies among seven different agent models (cooperative, greedy, tit-for-tat, reinforcement-based, rational, predictive, and internal agents). We show that, compared to pure reinforcement-based strategies, probabilistic learning agents modeled on rational, predictive, and internal phenotypes perform better in game-theoretic metrics across tasks. The outlined autonomous multi-agent models might capture systems-level processes underlying a ToM and suggest architectural principles of ToM from a control-theoretic perspective.

Published in Information

ISSN: 2078-2489 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/information/

About the journal

Abstract

Keywords