A Framework and Benchmarking Study for Counterfactual Generating Methods on Tabular Data

Raphael Mazzine Barbosa de Oliveira; David Martens

doi:10.3390/app11167274

Applied Sciences (Aug 2021)

A Framework and Benchmarking Study for Counterfactual Generating Methods on Tabular Data

Raphael Mazzine Barbosa de Oliveira,
David Martens

Affiliations

Raphael Mazzine Barbosa de Oliveira: Department of Engineering Management, University of Antwerp, Prinsstraat 13, 2000 Antwerpen, Belgium
David Martens: Department of Engineering Management, University of Antwerp, Prinsstraat 13, 2000 Antwerpen, Belgium

DOI: https://doi.org/10.3390/app11167274
Journal volume & issue: Vol. 11, no. 16
p. 7274

Abstract

Read online

Counterfactual explanations are viewed as an effective way to explain machine learning predictions. This interest is reflected by a relatively young literature with already dozens of algorithms aiming to generate such explanations. These algorithms are focused on finding how features can be modified to change the output classification. However, this rather general objective can be achieved in different ways, which brings about the need for a methodology to test and benchmark these algorithms. The contributions of this work are manifold: First, a large benchmarking study of 10 algorithmic approaches on 22 tabular datasets is performed, using nine relevant evaluation metrics; second, the introduction of a novel, first of its kind, framework to test counterfactual generation algorithms; third, a set of objective metrics to evaluate and compare counterfactual results; and, finally, insight from the benchmarking results that indicate which approaches obtain the best performance on what type of dataset. This benchmarking study and framework can help practitioners in determining which technique and building blocks most suit their context, and can help researchers in the design and evaluation of current and future counterfactual generation algorithms. Our findings show that, overall, there’s no single best algorithm to generate counterfactual explanations as the performance highly depends on properties related to the dataset, model, score, and factual point specificities.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords