Committing to the wrong artificial delegate in a collective-risk dilemma is better than directly committing mistakes

Inês Terrucha; Elias Fernández Domingos; Pieter Simoens; Tom Lenaerts

doi:10.1038/s41598-024-61153-9

Scientific Reports (May 2024)

Committing to the wrong artificial delegate in a collective-risk dilemma is better than directly committing mistakes

Inês Terrucha,
Elias Fernández Domingos,
Pieter Simoens,
Tom Lenaerts

Affiliations

Inês Terrucha: Department of Information Technology-IDLab, Ghent University-IMEC
Elias Fernández Domingos: Artificial Intelligence Lab, Computer Science Department, Vrije Universiteit Brussel
Pieter Simoens: Department of Information Technology-IDLab, Ghent University-IMEC
Tom Lenaerts: Artificial Intelligence Lab, Computer Science Department, Vrije Universiteit Brussel

DOI: https://doi.org/10.1038/s41598-024-61153-9
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 13

Abstract

Read online

Abstract While autonomous artificial agents are assumed to perfectly execute the strategies they are programmed with, humans who design them may make mistakes. These mistakes may lead to a misalignment between the humans’ intended goals and their agents’ observed behavior, a problem of value alignment. Such an alignment problem may have particularly strong consequences when these autonomous systems are used in social contexts that involve some form of collective risk. By means of an evolutionary game theoretical model, we investigate whether errors in the configuration of artificial agents change the outcome of a collective-risk dilemma, in comparison to a scenario with no delegation. Delegation is here distinguished from no-delegation simply by the moment at which a mistake occurs: either when programming/choosing the agent (in case of delegation) or when executing the actions at each round of the game (in case of no-delegation). We find that, while errors decrease success rate, it is better to delegate and commit to a somewhat flawed strategy, perfectly executed by an autonomous agent, than to commit execution errors directly. Our model also shows that in the long-term, delegation strategies should be favored over no-delegation, if given the choice.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal