Global Translation of Classification Models

Mohammad Al-Merri; Zina Ben Miled

doi:10.3390/info13050246

Information (May 2022)

Global Translation of Classification Models

Mohammad Al-Merri,
Zina Ben Miled

Affiliations

Mohammad Al-Merri: Electrical and Computer Engineering Department, Indiana University-Purdue University, 723 W. Michigan St., SL 160, Indianapolis, IN 46202, USA
Zina Ben Miled: Electrical and Computer Engineering Department, Indiana University-Purdue University, 723 W. Michigan St., SL 160, Indianapolis, IN 46202, USA

DOI: https://doi.org/10.3390/info13050246
Journal volume & issue: Vol. 13, no. 5
p. 246

Abstract

Read online

The widespread and growing usage of machine learning models, particularly for critical areas such as law, predicate the need for global interpretability. Models that cannot be audited are vulnerable to biases inherited from the datasets that were used to develop them. Moreover, locally interpretable models are vulnerable to adversarial attacks. To address this issue, the present paper proposes a new methodology that can translate any existing machine learning model into a globally interpretable one. MTRE-PAN is a hybrid SVM-decision tree architecture that leverages the interpretability of linear hyperplanes by creating a set of polygons that delimit the decision boundaries of the target model. Moreover, the present paper introduces two new metrics: certain and boundary model parities. These metrics can be used to accurately evaluate the performance of the interpretable model near the decision boundaries. These metrics are used to compare MTRE-PAN to a previously proposed interpretable architecture called TRE-PAN. As in the case of TRE-PAN, MTRE-PAN aims at providing global interpretability. The comparisons are performed over target models developed using three benchmark datasets: Abalone, Census and Diabetes data. The results show that MTRE-PAN generates interpretable models that have a lower number of leaves and a higher agreement with the target models, especially around the most important regions in the feature space, namely the decision boundaries.

Published in Information

ISSN: 2078-2489 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/information/

About the journal

Abstract

Keywords