A Responsible Machine Learning Workflow with Focus on Interpretable Models, Post-hoc Explanation, and Discrimination Testing

Navdeep Gill; Patrick Hall; Kim Montgomery; Nicholas Schmidt

doi:10.3390/info11030137

Information (Feb 2020)

A Responsible Machine Learning Workflow with Focus on Interpretable Models, Post-hoc Explanation, and Discrimination Testing

Navdeep Gill,
Patrick Hall,
Kim Montgomery,
Nicholas Schmidt

Affiliations

Navdeep Gill: H2O.ai, Mountain View 94043, CA, USA
Patrick Hall: H2O.ai, Mountain View 94043, CA, USA
Kim Montgomery: H2O.ai, Mountain View 94043, CA, USA
Nicholas Schmidt: BLDS, LLC, Philadelphia 19103, PA, USA

DOI: https://doi.org/10.3390/info11030137
Journal volume & issue: Vol. 11, no. 3
p. 137

Abstract

Read online

This manuscript outlines a viable approach for training and evaluating machine learning systems for high-stakes, human-centered, or regulated applications using common Python programming tools. The accuracy and intrinsic interpretability of two types of constrained models, monotonic gradient boosting machines and explainable neural networks, a deep learning architecture well-suited for structured data, are assessed on simulated data and publicly available mortgage data. For maximum transparency and the potential generation of personalized adverse action notices, the constrained models are analyzed using post-hoc explanation techniques including plots of partial dependence and individual conditional expectation and with global and local Shapley feature importance. The constrained model predictions are also tested for disparate impact and other types of discrimination using measures with long-standing legal precedents, adverse impact ratio, marginal effect, and standardized mean difference, along with straightforward group fairness measures. By combining interpretable models, post-hoc explanations, and discrimination testing with accessible software tools, this text aims to provide a template workflow for machine learning applications that require high accuracy and interpretability and that mitigate risks of discrimination.

Published in Information

ISSN: 2078-2489 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/information/

About the journal

Abstract

Keywords