IDC: quantitative evaluation benchmark of interpretation methods for deep text classification models

Mohammed Khaleel; Lei Qi; Wallapak Tavanapong; Johnny Wong; Adisak Sukul; David A. M. Peterson

doi:10.1186/s40537-022-00583-6

Journal of Big Data (Mar 2022)

IDC: quantitative evaluation benchmark of interpretation methods for deep text classification models

Mohammed Khaleel,
Lei Qi,
Wallapak Tavanapong,
Johnny Wong,
Adisak Sukul,
David A. M. Peterson

Affiliations

Mohammed Khaleel: Department of Computer Science, Iowa State University
Lei Qi: Department of Computer Science, Iowa State University
Wallapak Tavanapong: Department of Computer Science, Iowa State University
Johnny Wong: Department of Computer Science, Iowa State University
Adisak Sukul: Department of Computer Science, Iowa State University
David A. M. Peterson: Department of Political Science, Iowa State University

DOI: https://doi.org/10.1186/s40537-022-00583-6
Journal volume & issue: Vol. 9, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Recent advances in deep neural networks have achieved outstanding success in natural language processing tasks. Interpretation methods that provide insight into the decision-making process of these models have received an influx of research attention because of the success and the black-box nature of the deep text classification models. Evaluation of these methods has been based on changes in classification accuracy or prediction confidence when removing important words identified by these methods. There are no measurements of the actual difference between the predicted important words and humans’ interpretation of ground truth because of the lack of interpretation ground truth. A large publicly available interpretation ground truth has the potential to advance the development of interpretation methods. Manual labeling important words for each document to create a large interpretation ground truth is very time-consuming. This paper presents (1) IDC, a new benchmark for quantitative evaluation of interpretation methods for deep text classification models, and (2) evaluation of six interpretation methods using the benchmark. The IDC benchmark consists of: (1) Three methods that generate three pseudo-interpretation ground truth datasets. (2) Three performance metrics: interpretation recall, interpretation precision, and Cohen’s kappa inter-agreement. Findings: IDC-generated interpretation ground truth agrees with human annotators on sampled movie reviews. IDC identifies Layer-wise Relevance Propagation and the gradient-by-input methods as the winning interpretation methods in this study.

Published in Journal of Big Data

ISSN: 2196-1115 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://journalofbigdata.springeropen.com

About the journal

Abstract

Keywords