A Geometric Perspective on Information Plane Analysis

Mina Basirat; Bernhard C. Geiger; Peter M. Roth

doi:10.3390/e23060711

Entropy (Jun 2021)

A Geometric Perspective on Information Plane Analysis

Mina Basirat,
Bernhard C. Geiger,
Peter M. Roth

Affiliations

Mina Basirat: Institute of Computer Graphics and Vision, Graz University of Technology, Inffeldgasse 16/II, 8010 Graz, Austria
Bernhard C. Geiger: Know-Center GmbH, Inffeldgasse 13, 8010 Graz, Austria
Peter M. Roth: International AI Future Lab, Technical University of Munich (TUM), Willy-Messerschmitt-Straße 1, 85521 Taufkirchen, Germany

DOI: https://doi.org/10.3390/e23060711
Journal volume & issue: Vol. 23, no. 6
p. 711

Abstract

Read online

Information plane analysis, describing the mutual information between the input and a hidden layer and between a hidden layer and the target over time, has recently been proposed to analyze the training of neural networks. Since the activations of a hidden layer are typically continuous-valued, this mutual information cannot be computed analytically and must thus be estimated, resulting in apparently inconsistent or even contradicting results in the literature. The goal of this paper is to demonstrate how information plane analysis can still be a valuable tool for analyzing neural network training. To this end, we complement the prevailing binning estimator for mutual information with a geometric interpretation. With this geometric interpretation in mind, we evaluate the impact of regularization and interpret phenomena such as underfitting and overfitting. In addition, we investigate neural network learning in the presence of noisy data and noisy labels.

Published in Entropy

ISSN: 1099-4300 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Astronomy: Astrophysics; Science: Physics
Website: http://www.mdpi.com/journal/entropy

About the journal

Abstract

Keywords