Generating and evaluating explanations of attended and error‐inducing input regions for VQA models

Arijit Ray; Michael Cogswell; Xiao Lin; Kamran Alipour; Ajay Divakaran; Yi Yao; Giedrius Burachas

doi:10.1002/ail2.51

Applied AI Letters (Dec 2021)

Generating and evaluating explanations of attended and error‐inducing input regions for VQA models

Arijit Ray,
Michael Cogswell,
Xiao Lin,
Kamran Alipour,
Ajay Divakaran,
Yi Yao,
Giedrius Burachas

Affiliations

Arijit Ray: Center for Vision Technologies SRI International Princeton New Jersey USA
Michael Cogswell: Center for Vision Technologies SRI International Princeton New Jersey USA
Xiao Lin: Center for Vision Technologies SRI International Princeton New Jersey USA
Kamran Alipour: Department of Computer Science University of California, San Diego La Jolla California USA
Ajay Divakaran: Center for Vision Technologies SRI International Princeton New Jersey USA
Yi Yao: Center for Vision Technologies SRI International Princeton New Jersey USA
Giedrius Burachas: Center for Vision Technologies SRI International Princeton New Jersey USA

DOI: https://doi.org/10.1002/ail2.51
Journal volume & issue: Vol. 2, no. 4
pp. n/a – n/a

Abstract

Read online

Abstract Attention maps, a popular heatmap‐based explanation method for Visual Question Answering, are supposed to help users understand the model by highlighting portions of the image/question used by the model to infer answers. However, we see that users are often misled by current attention map visualizations that point to relevant regions despite the model producing an incorrect answer. Hence, we propose Error Maps that clarify the error by highlighting image regions where the model is prone to err. Error maps can indicate when a correctly attended region may be processed incorrectly leading to an incorrect answer, and hence, improve users' understanding of those cases. To evaluate our new explanations, we further introduce a metric that simulates users' interpretation of explanations to evaluate their potential helpfulness to understand model correctness. We finally conduct user studies to see that our new explanations help users understand model correctness better than baselines by an expected 30% and that our proxy helpfulness metrics correlate strongly (ρ>0.97) with how well users can predict model correctness.

Published in Applied AI Letters

ISSN: 2689-5595 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://onlinelibrary.wiley.com/journal/26895595

About the journal

Abstract

Keywords