Exact and Soft Successive Refinement of the Information Bottleneck

Hippolyte Charvin; Nicola Catenacci Volpi; Daniel Polani

doi:10.3390/e25091355

Entropy (Sep 2023)

Exact and Soft Successive Refinement of the Information Bottleneck

Hippolyte Charvin,
Nicola Catenacci Volpi,
Daniel Polani

Affiliations

Hippolyte Charvin: School of Physics, Engineering and Computer Science, University of Hertfordshire, Hatfield AL10 9AB, UK
Nicola Catenacci Volpi: School of Physics, Engineering and Computer Science, University of Hertfordshire, Hatfield AL10 9AB, UK
Daniel Polani: School of Physics, Engineering and Computer Science, University of Hertfordshire, Hatfield AL10 9AB, UK

DOI: https://doi.org/10.3390/e25091355
Journal volume & issue: Vol. 25, no. 9
p. 1355

Abstract

Read online

The information bottleneck (IB) framework formalises the essential requirement for efficient information processing systems to achieve an optimal balance between the complexity of their representation and the amount of information extracted about relevant features. However, since the representation complexity affordable by real-world systems may vary in time, the processing cost of updating the representations should also be taken into account. A crucial question is thus the extent to which adaptive systems can leverage the information content of already existing IB-optimal representations for producing new ones, which target the same relevant features but at a different granularity. We investigate the information-theoretic optimal limits of this process by studying and extending, within the IB framework, the notion of successive refinement, which describes the ideal situation where no information needs to be discarded for adapting an IB-optimal representation’s granularity. Thanks in particular to a new geometric characterisation, we analytically derive the successive refinability of some specific IB problems (for binary variables, for jointly Gaussian variables, and for the relevancy variable being a deterministic function of the source variable), and provide a linear-programming-based tool to numerically investigate, in the discrete case, the successive refinement of the IB. We then soften this notion into a quantification of the loss of information optimality induced by several-stage processing through an existing measure of unique information. Simple numerical experiments suggest that this quantity is typically low, though not entirely negligible. These results could have important implications for (i) the structure and efficiency of incremental learning in biological and artificial agents, (ii) the comparison of IB-optimal observation channels in statistical decision problems, and (iii) the IB theory of deep neural networks.

Published in Entropy

ISSN: 1099-4300 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Astronomy: Astrophysics; Science: Physics
Website: http://www.mdpi.com/journal/entropy

About the journal

Abstract

Keywords