Differentiable PAC–Bayes Objectives with Partially Aggregated Neural Networks

Felix Biggs; Benjamin Guedj

doi:10.3390/e23101280

Entropy (Sep 2021)

Differentiable PAC–Bayes Objectives with Partially Aggregated Neural Networks

Felix Biggs,
Benjamin Guedj

Affiliations

Felix Biggs: Centre for Artificial Intelligence, Department of Computer Science, University College London, London WC1V 6LJ, UK
Benjamin Guedj: Centre for Artificial Intelligence, Department of Computer Science, University College London, London WC1V 6LJ, UK

DOI: https://doi.org/10.3390/e23101280
Journal volume & issue: Vol. 23, no. 10
p. 1280

Abstract

Read online

We make two related contributions motivated by the challenge of training stochastic neural networks, particularly in a PAC–Bayesian setting: (1) we show how averaging over an ensemble of stochastic neural networks enables a new class of partially-aggregated estimators, proving that these lead to unbiased lower-variance output and gradient estimators; (2) we reformulate a PAC–Bayesian bound for signed-output networks to derive in combination with the above a directly optimisable, differentiable objective and a generalisation guarantee, without using a surrogate loss or loosening the bound. We show empirically that this leads to competitive generalisation guarantees and compares favourably to other methods for training such networks. Finally, we note that the above leads to a simpler PAC–Bayesian training scheme for sign-activation networks than previous work.

Published in Entropy

ISSN: 1099-4300 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Astronomy: Astrophysics; Science: Physics
Website: http://www.mdpi.com/journal/entropy

About the journal

Abstract

Keywords