A Bayesian Network Approach to Explainable Reinforcement Learning with Distal Information

Rudy Milani; Maximilian Moll; Renato De Leone; Stefan Pickl

doi:10.3390/s23042013

Sensors (Feb 2023)

A Bayesian Network Approach to Explainable Reinforcement Learning with Distal Information

Rudy Milani,
Maximilian Moll,
Renato De Leone,
Stefan Pickl

Affiliations

Rudy Milani: Faculty of Computer Science, Universitaet der Bundeswehr Muenchen, Werner-Heisenberg-Weg 39, 85577 Neubiberg, Germany
Maximilian Moll: Faculty of Computer Science, Universitaet der Bundeswehr Muenchen, Werner-Heisenberg-Weg 39, 85577 Neubiberg, Germany
Renato De Leone: School of Science and Technology, University of Camerino, via Madonna delle Carceri 9, 62032 Camerino, Italy
Stefan Pickl: Faculty of Computer Science, Universitaet der Bundeswehr Muenchen, Werner-Heisenberg-Weg 39, 85577 Neubiberg, Germany

DOI: https://doi.org/10.3390/s23042013
Journal volume & issue: Vol. 23, no. 4
p. 2013

Abstract

Read online

Nowadays, Artificial Intelligence systems have expanded their competence field from research to industry and daily life, so understanding how they make decisions is becoming fundamental to reducing the lack of trust between users and machines and increasing the transparency of the model. This paper aims to automate the generation of explanations for model-free Reinforcement Learning algorithms by answering “why” and “why not” questions. To this end, we use Bayesian Networks in combination with the NOTEARS algorithm for automatic structure learning. This approach complements an existing framework very well and demonstrates thus a step towards generating explanations with as little user input as possible. This approach is computationally evaluated in three benchmarks using different Reinforcement Learning methods to highlight that it is independent of the type of model used and the explanations are then rated through a human study. The results obtained are compared to other baseline explanation models to underline the satisfying performance of the framework presented in terms of increasing the understanding, transparency and trust in the action chosen by the agent.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords