Counterfactual Models for Fair and Adequate Explanations

Nicholas Asher; Lucas De Lara; Soumya Paul; Chris Russell

doi:10.3390/make4020014

Machine Learning and Knowledge Extraction (Mar 2022)

Counterfactual Models for Fair and Adequate Explanations

Nicholas Asher,
Lucas De Lara,
Soumya Paul,
Chris Russell

Affiliations

Nicholas Asher: Institut de Recherche en Informatique de Toulouse, Université Paul Sabatier, 31062 Toulouse, France
Lucas De Lara: Institut de Mathématiques de Toulouse, Université Paul Sabatier, 31062 Toulouse, France
Soumya Paul: Telindus, 18 rue du Puits Romain, L-8070 Luxembourg, Luxembourg
Chris Russell: Amazon Research, 72072 Tübingen, Germany

DOI: https://doi.org/10.3390/make4020014
Journal volume & issue: Vol. 4, no. 2
pp. 316 – 349

Abstract

Read online

Recent efforts have uncovered various methods for providing explanations that can help interpret the behavior of machine learning programs. Exact explanations with a rigorous logical foundation provide valid and complete explanations, but they have an epistemological problem: they are often too complex for humans to understand and too expensive to compute even with automated reasoning methods. Interpretability requires good explanations that humans can grasp and can compute. We take an important step toward specifying what good explanations are by analyzing the epistemically accessible and pragmatic aspects of explanations. We characterize sufficiently good, or fair and adequate, explanations in terms of counterfactuals and what we call the conundra of the explainee, the agent that requested the explanation. We provide a correspondence between logical and mathematical formulations for counterfactuals to examine the partiality of counterfactual explanations that can hide biases; we define fair and adequate explanations in such a setting. We provide formal results about the algorithmic complexity of fair and adequate explanations. We then detail two sophisticated counterfactual models, one based on causal graphs, and one based on transport theories. We show transport based models have several theoretical advantages over the competition as explanation frameworks for machine learning algorithms.

Published in Machine Learning and Knowledge Extraction

ISSN: 2504-4990 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware
Website: https://www.mdpi.com/journal/make

About the journal

Abstract

Keywords