A detailed study of interpretability of deep neural network based top taggers

Ayush Khot; Mark S Neubauer; Avik Roy

doi:10.1088/2632-2153/ace0a1

Machine Learning: Science and Technology (Jan 2023)

A detailed study of interpretability of deep neural network based top taggers

Ayush Khot,
Mark S Neubauer,
Avik Roy

Affiliations

Ayush Khot: ORCiD; Department of Physics & National Center for Supercomputing Applications (NCSA), University of Illinois at Urbana-Champaign , Urbana, IL, 61801, United States of America
Mark S Neubauer: ORCiD; Department of Physics & National Center for Supercomputing Applications (NCSA), University of Illinois at Urbana-Champaign , Urbana, IL, 61801, United States of America
Avik Roy: ORCiD; Department of Physics & National Center for Supercomputing Applications (NCSA), University of Illinois at Urbana-Champaign , Urbana, IL, 61801, United States of America

DOI: https://doi.org/10.1088/2632-2153/ace0a1
Journal volume & issue: Vol. 4, no. 3
p. 035003

Abstract

Read online

Recent developments in the methods of explainable artificial intelligence (XAI) allow researchers to explore the inner workings of deep neural networks (DNNs), revealing crucial information about input–output relationships and realizing how data connects with machine learning models. In this paper we explore interpretability of DNN models designed to identify jets coming from top quark decay in high energy proton–proton collisions at the Large Hadron Collider. We review a subset of existing top tagger models and explore different quantitative methods to identify which features play the most important roles in identifying the top jets. We also investigate how and why feature importance varies across different XAI metrics, how correlations among features impact their explainability, and how latent space representations encode information as well as correlate with physically meaningful quantities. Our studies uncover some major pitfalls of existing XAI methods and illustrate how they can be overcome to obtain consistent and meaningful interpretation of these models. We additionally illustrate the activity of hidden layers as neural activation pattern diagrams and demonstrate how they can be used to understand how DNNs relay information across the layers and how this understanding can help to make such models significantly simpler by allowing effective model reoptimization and hyperparameter tuning. These studies not only facilitate a methodological approach to interpreting models but also unveil new insights about what these models learn. Incorporating these observations into augmented model design, we propose the particle flow interaction network model and demonstrate how interpretability-inspired model augmentation can improve top tagging performance.

Published in Machine Learning: Science and Technology

ISSN: 2632-2153 (Online)
Publisher: IOP Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://iopscience.iop.org/journal/2632-2153

About the journal

Abstract

Keywords