An Information Theoretic Interpretation to Deep Neural Networks

Xiangxiang Xu; Shao-Lun Huang; Lizhong Zheng; Gregory W. Wornell

doi:10.3390/e24010135

Entropy (Jan 2022)

An Information Theoretic Interpretation to Deep Neural Networks

Xiangxiang Xu,
Shao-Lun Huang,
Lizhong Zheng,
Gregory W. Wornell

Affiliations

Xiangxiang Xu: Data Science and Information Technology Research Center, Tsinghua–Berkeley Shenzhen Institute, Shenzhen 518055, China
Shao-Lun Huang: Data Science and Information Technology Research Center, Tsinghua–Berkeley Shenzhen Institute, Shenzhen 518055, China
Lizhong Zheng: Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Gregory W. Wornell: Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA

DOI: https://doi.org/10.3390/e24010135
Journal volume & issue: Vol. 24, no. 1
p. 135

Abstract

Read online

With the unprecedented performance achieved by deep learning, it is commonly believed that deep neural networks (DNNs) attempt to extract informative features for learning tasks. To formalize this intuition, we apply the local information geometric analysis and establish an information-theoretic framework for feature selection, which demonstrates the information-theoretic optimality of DNN features. Moreover, we conduct a quantitative analysis to characterize the impact of network structure on the feature extraction process of DNNs. Our investigation naturally leads to a performance metric for evaluating the effectiveness of extracted features, called the H-score, which illustrates the connection between the practical training process of DNNs and the information-theoretic framework. Finally, we validate our theoretical results by experimental designs on synthesized data and the ImageNet dataset.

Published in Entropy

ISSN: 1099-4300 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Astronomy: Astrophysics; Science: Physics
Website: http://www.mdpi.com/journal/entropy

About the journal

Abstract

Keywords